Picture for Luoxin Ye

Luoxin Ye

LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression

Add code
Jun 28, 2024
Viaarxiv icon