Picture for Yunzhi Zhuge

Yunzhi Zhuge

The Devil is in Temporal Token: High Quality Video Reasoning Segmentation

Add code
Jan 15, 2025
Figure 1 for The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
Figure 2 for The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
Figure 3 for The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
Figure 4 for The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
Viaarxiv icon

AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation

Add code
Jan 14, 2025
Figure 1 for AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation
Figure 2 for AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation
Figure 3 for AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation
Figure 4 for AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation
Viaarxiv icon

3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding

Add code
Jan 14, 2025
Viaarxiv icon

Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation

Add code
Jan 14, 2025
Figure 1 for Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation
Figure 2 for Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation
Figure 3 for Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation
Figure 4 for Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation
Viaarxiv icon

Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation

Add code
Dec 27, 2024
Viaarxiv icon

Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding

Add code
Nov 29, 2024
Viaarxiv icon

DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting

Add code
Nov 26, 2024
Viaarxiv icon

LLMs Can Evolve Continually on Modality for X-Modal Reasoning

Add code
Oct 26, 2024
Viaarxiv icon

SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning

Add code
Jul 10, 2024
Viaarxiv icon

Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters

Add code
Mar 18, 2024
Viaarxiv icon