Picture for Se Yeon Kim

Se Yeon Kim

Are Vision-Language Models Truly Understanding Multi-vision Sensor?

Add code
Dec 30, 2024
Viaarxiv icon