Picture for Xin Yu

Xin Yu

Safe and Reliable Diffusion Models via Subspace Projection

Add code
Mar 21, 2025
Viaarxiv icon

Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment

Add code
Mar 17, 2025
Viaarxiv icon

Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics

Add code
Mar 17, 2025
Viaarxiv icon

ObjectMover: Generative Object Movement with Video Prior

Add code
Mar 11, 2025
Viaarxiv icon

7ABAW-Compound Expression Recognition via Curriculum Learning

Add code
Mar 11, 2025
Viaarxiv icon

EasyCraft: A Robust and Efficient Framework for Automatic Avatar Crafting

Add code
Mar 03, 2025
Viaarxiv icon

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Add code
Feb 27, 2025
Viaarxiv icon

Trust-Aware Diversion for Data-Effective Distillation

Add code
Feb 07, 2025
Viaarxiv icon

CMamba: Learned Image Compression with State Space Models

Add code
Feb 07, 2025
Viaarxiv icon

FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding

Add code
Dec 18, 2024
Viaarxiv icon