Picture for Geewook Kim

Geewook Kim

Natural-Language Temporal Grounding in Hour-Long Videos is a Search Problem: A Benchmark and Empirical Decomposition

Add code
Jun 10, 2026
Viaarxiv icon

KCSAT-ML: Probing Reasoning Models with Nationwide-Cohort Human Difficulty

Add code
Jun 09, 2026
Viaarxiv icon

K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts

Add code
Jun 01, 2026
Viaarxiv icon

Decentralized Instruction Tuning: Conflict-Aware Splitting and Weight Merging

Add code
Jun 01, 2026
Viaarxiv icon

Context-Informed Grounding Supervision

Add code
Jun 18, 2025
Viaarxiv icon

MambaMia: A State-Space-Model-Based Compression for Efficient Video Understanding in Large Multimodal Models

Add code
Jun 16, 2025
Figure 1 for MambaMia: A State-Space-Model-Based Compression for Efficient Video Understanding in Large Multimodal Models
Figure 2 for MambaMia: A State-Space-Model-Based Compression for Efficient Video Understanding in Large Multimodal Models
Figure 3 for MambaMia: A State-Space-Model-Based Compression for Efficient Video Understanding in Large Multimodal Models
Figure 4 for MambaMia: A State-Space-Model-Based Compression for Efficient Video Understanding in Large Multimodal Models
Viaarxiv icon

MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models

Add code
Jun 05, 2025
Figure 1 for MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models
Figure 2 for MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models
Figure 3 for MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models
Figure 4 for MMRefine: Unveiling the Obstacles to Robust Refinement in Multimodal Large Language Models
Viaarxiv icon

Evaluating Multimodal Generative AI with Korean Educational Standards

Add code
Feb 21, 2025
Viaarxiv icon

How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?

Add code
Oct 10, 2024
Figure 1 for How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?
Figure 2 for How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?
Figure 3 for How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?
Figure 4 for How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?
Viaarxiv icon

On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning

Add code
Jun 17, 2024
Viaarxiv icon