Picture for Xianwei Zhuang

Xianwei Zhuang

VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model

Add code
Jan 21, 2025
Viaarxiv icon

VASparse: Towards Efficient Visual Hallucination Mitigation for Large Vision-Language Model via Visual-Aware Sparsification

Add code
Jan 11, 2025
Viaarxiv icon

Uncertainty-aware sign language video retrieval with probability distribution modeling

Add code
May 30, 2024
Viaarxiv icon