Picture for Yunzhi Zhuge

Yunzhi Zhuge

The Devil is in Temporal Token: High Quality Video Reasoning Segmentation

Add code
Jan 15, 2025
Viaarxiv icon

AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation

Add code
Jan 14, 2025
Viaarxiv icon

Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation

Add code
Jan 14, 2025
Viaarxiv icon

3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding

Add code
Jan 14, 2025
Viaarxiv icon

Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation

Add code
Dec 27, 2024
Viaarxiv icon

Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding

Add code
Nov 29, 2024
Viaarxiv icon

DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting

Add code
Nov 26, 2024
Viaarxiv icon

LLMs Can Evolve Continually on Modality for X-Modal Reasoning

Add code
Oct 26, 2024
Viaarxiv icon

SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning

Add code
Jul 10, 2024
Viaarxiv icon

Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters

Add code
Mar 18, 2024
Viaarxiv icon