Picture for Ting-Yao E. Hsu

Ting-Yao E. Hsu

Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023

Add code
Jan 31, 2025
Viaarxiv icon