Picture for Yunzhi Zhuge

Yunzhi Zhuge

VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text?

Add code
Feb 04, 2026
Viaarxiv icon

Towards Cross-Platform Generalization: Domain Adaptive 3D Detection with Augmentation and Pseudo-Labeling

Add code
Jan 13, 2026
Viaarxiv icon

The RoboSense Challenge: Sense Anything, Navigate Anywhere, Adapt Across Platforms

Add code
Jan 08, 2026
Viaarxiv icon

Parameter Aware Mamba Model for Multi-task Dense Prediction

Add code
Nov 18, 2025
Figure 1 for Parameter Aware Mamba Model for Multi-task Dense Prediction
Figure 2 for Parameter Aware Mamba Model for Multi-task Dense Prediction
Figure 3 for Parameter Aware Mamba Model for Multi-task Dense Prediction
Figure 4 for Parameter Aware Mamba Model for Multi-task Dense Prediction
Viaarxiv icon

VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning

Add code
Oct 29, 2025
Viaarxiv icon

Regularizing Subspace Redundancy of Low-Rank Adaptation

Add code
Jul 28, 2025
Figure 1 for Regularizing Subspace Redundancy of Low-Rank Adaptation
Figure 2 for Regularizing Subspace Redundancy of Low-Rank Adaptation
Figure 3 for Regularizing Subspace Redundancy of Low-Rank Adaptation
Figure 4 for Regularizing Subspace Redundancy of Low-Rank Adaptation
Viaarxiv icon

Learning Universal Features for Generalizable Image Forgery Localization

Add code
Apr 10, 2025
Viaarxiv icon

The Devil is in Temporal Token: High Quality Video Reasoning Segmentation

Add code
Jan 15, 2025
Figure 1 for The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
Figure 2 for The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
Figure 3 for The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
Figure 4 for The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
Viaarxiv icon

AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation

Add code
Jan 14, 2025
Figure 1 for AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation
Figure 2 for AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation
Figure 3 for AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation
Figure 4 for AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation
Viaarxiv icon

Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation

Add code
Jan 14, 2025
Figure 1 for Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation
Figure 2 for Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation
Figure 3 for Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation
Figure 4 for Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation
Viaarxiv icon