Picture for Gangshan Wu

Gangshan Wu

GLAD: Generative Language-Assisted Visual Tracking for Low-Semantic Templates

Add code
Jan 31, 2026
Viaarxiv icon

VMonarch: Efficient Video Diffusion Transformers with Structured Attention

Add code
Jan 29, 2026
Viaarxiv icon

SAM 2++: Tracking Anything at Any Granularity

Add code
Oct 22, 2025
Viaarxiv icon

Pure-Pass: Fine-Grained, Adaptive Masking for Dynamic Token-Mixing Routing in Lightweight Image Super-Resolution

Add code
Oct 02, 2025
Viaarxiv icon

ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models

Add code
Aug 25, 2025
Viaarxiv icon

Spatial-Temporal Human-Object Interaction Detection

Add code
Aug 24, 2025
Viaarxiv icon

MTNet: Learning modality-aware representation with transformer for RGBT tracking

Add code
Aug 24, 2025
Viaarxiv icon

CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning

Add code
May 22, 2025
Viaarxiv icon

RGB-D Tracking via Hierarchical Modality Aggregation and Distribution Network

Add code
Apr 24, 2025
Viaarxiv icon

RGB-D Video Object Segmentation via Enhanced Multi-store Feature Memory

Add code
Apr 23, 2025
Viaarxiv icon