Picture for Zihui Cheng

Zihui Cheng

CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models

Add code
Dec 17, 2024
Viaarxiv icon