Picture for Yongxin Liao

Yongxin Liao

TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models

Add code
Apr 14, 2024
Viaarxiv icon