Picture for Shiqian Su

Shiqian Su

HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding

Add code
Dec 20, 2024
Viaarxiv icon

Learning 1D Causal Visual Representation with De-focus Attention Networks

Add code
Jun 06, 2024
Viaarxiv icon