Picture for Kalaiarasi Sonai Muthu Anbananthen

Kalaiarasi Sonai Muthu Anbananthen

VLMT: Vision-Language Multimodal Transformer for Multimodal Multi-hop Question Answering

Add code
Apr 11, 2025
Viaarxiv icon