Picture for Huchuan Lu

Huchuan Lu

The Devil is in Temporal Token: High Quality Video Reasoning Segmentation

Add code
Jan 15, 2025
Viaarxiv icon

3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding

Add code
Jan 14, 2025
Viaarxiv icon

AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation

Add code
Jan 14, 2025
Viaarxiv icon

Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation

Add code
Jan 14, 2025
Viaarxiv icon

ReNeg: Learning Negative Embedding with Reward Guidance

Add code
Dec 27, 2024
Viaarxiv icon

SUTrack: Towards Simple and Unified Single Object Tracking

Add code
Dec 26, 2024
Viaarxiv icon

Unity is Strength: Unifying Convolutional and Transformeral Features for Better Person Re-Identification

Add code
Dec 23, 2024
Viaarxiv icon

Autoregressive Video Generation without Vector Quantization

Add code
Dec 18, 2024
Viaarxiv icon

MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt

Add code
Dec 14, 2024
Figure 1 for MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt
Figure 2 for MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt
Figure 3 for MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt
Figure 4 for MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt
Viaarxiv icon

Towards Real-Time Open-Vocabulary Video Instance Segmentation

Add code
Dec 05, 2024
Viaarxiv icon