Picture for Haonan Luo

Haonan Luo

Enhancing Multi-Robot Semantic Navigation Through Multimodal Chain-of-Thought Score Collaboration

Add code
Dec 24, 2024
Viaarxiv icon

Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering

Add code
Mar 14, 2024
Viaarxiv icon