Picture for Mengcheng Lan

Mengcheng Lan

GeoGround: A Unified Large Vision-Language Model. for Remote Sensing Visual Grounding

Add code
Nov 16, 2024
Viaarxiv icon

Text4Seg: Reimagining Image Segmentation as Text Generation

Add code
Oct 13, 2024
Viaarxiv icon

Contrasformer: A Brain Network Contrastive Transformer for Neurodegenerative Condition Identification

Add code
Sep 17, 2024
Viaarxiv icon

ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation

Add code
Aug 09, 2024
Viaarxiv icon

ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference

Add code
Jul 17, 2024
Viaarxiv icon

Learning to Discover Knowledge: A Weakly-Supervised Partial Domain Adaptation Approach

Add code
Jun 20, 2024
Viaarxiv icon

SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation

Add code
Oct 27, 2023
Viaarxiv icon

MIMO Is All You Need : A Strong Multi-In-Multi-Out Baseline for Video Prediction

Add code
Dec 09, 2022
Viaarxiv icon

From Single to Multiple: Leveraging Multi-level Prediction Spaces for Video Forecasting

Add code
Jul 21, 2021
Figure 1 for From Single to Multiple: Leveraging Multi-level Prediction Spaces for Video Forecasting
Figure 2 for From Single to Multiple: Leveraging Multi-level Prediction Spaces for Video Forecasting
Figure 3 for From Single to Multiple: Leveraging Multi-level Prediction Spaces for Video Forecasting
Figure 4 for From Single to Multiple: Leveraging Multi-level Prediction Spaces for Video Forecasting
Viaarxiv icon