Picture for Gio Paik

Gio Paik

Towards Truly Multilingual ASR: Generalizing Code-Switching ASR to Unseen Language Pairs

Add code
Jun 04, 2026
Viaarxiv icon

MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models

Add code
Jun 05, 2025
Figure 1 for MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models
Figure 2 for MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models
Figure 3 for MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models
Figure 4 for MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models
Viaarxiv icon

Improving Fine-grained Visual Understanding in VLMs through Text-Only Training

Add code
Dec 17, 2024
Figure 1 for Improving Fine-grained Visual Understanding in VLMs through Text-Only Training
Figure 2 for Improving Fine-grained Visual Understanding in VLMs through Text-Only Training
Figure 3 for Improving Fine-grained Visual Understanding in VLMs through Text-Only Training
Figure 4 for Improving Fine-grained Visual Understanding in VLMs through Text-Only Training
Viaarxiv icon