Picture for Eunsu Kim

Eunsu Kim

K-BrowseComp: A Web Browsing Agent Benchmark Grounded in Korean Contexts

Add code
Jun 01, 2026
Viaarxiv icon

LoCar: Localization-Aware Evaluation of In-Vehicle Assistants through Fine-Grained Sociolinguistic Control

Add code
May 20, 2026
Viaarxiv icon

"I didn't Make the Micro Decisions": Measuring, Inducing, and Exposing Goal-Level AI Contributions in Collaboration

Add code
May 20, 2026
Viaarxiv icon

Are they lovers or friends? Evaluating LLMs' Social Reasoning in English and Korean Dialogues

Add code
Oct 21, 2025
Viaarxiv icon

Spotting Out-of-Character Behavior: Atomic-Level Evaluation of Persona Fidelity in Open-Ended Generation

Add code
Jun 24, 2025
Viaarxiv icon

Flex-TravelPlanner: A Benchmark for Flexible Planning with Language Agents

Add code
Jun 05, 2025
Viaarxiv icon

BLUCK: A Benchmark Dataset for Bengali Linguistic Understanding and Cultural Knowledge

Add code
May 27, 2025
Figure 1 for BLUCK: A Benchmark Dataset for Bengali Linguistic Understanding and Cultural Knowledge
Figure 2 for BLUCK: A Benchmark Dataset for Bengali Linguistic Understanding and Cultural Knowledge
Figure 3 for BLUCK: A Benchmark Dataset for Bengali Linguistic Understanding and Cultural Knowledge
Figure 4 for BLUCK: A Benchmark Dataset for Bengali Linguistic Understanding and Cultural Knowledge
Viaarxiv icon

MUG-Eval: A Proxy Evaluation Framework for Multilingual Generation Capabilities in Any Language

Add code
May 20, 2025
Viaarxiv icon

When Tom Eats Kimchi: Evaluating Cultural Bias of Multimodal Large Language Models in Cultural Mixture Contexts

Add code
Mar 21, 2025
Figure 1 for When Tom Eats Kimchi: Evaluating Cultural Bias of Multimodal Large Language Models in Cultural Mixture Contexts
Figure 2 for When Tom Eats Kimchi: Evaluating Cultural Bias of Multimodal Large Language Models in Cultural Mixture Contexts
Figure 3 for When Tom Eats Kimchi: Evaluating Cultural Bias of Multimodal Large Language Models in Cultural Mixture Contexts
Figure 4 for When Tom Eats Kimchi: Evaluating Cultural Bias of Multimodal Large Language Models in Cultural Mixture Contexts
Viaarxiv icon

Diffusion Models Through a Global Lens: Are They Culturally Inclusive?

Add code
Feb 13, 2025
Figure 1 for Diffusion Models Through a Global Lens: Are They Culturally Inclusive?
Figure 2 for Diffusion Models Through a Global Lens: Are They Culturally Inclusive?
Figure 3 for Diffusion Models Through a Global Lens: Are They Culturally Inclusive?
Figure 4 for Diffusion Models Through a Global Lens: Are They Culturally Inclusive?
Viaarxiv icon