Picture for Jinyoung Yeo

Jinyoung Yeo

ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term Interactions

Add code
May 29, 2025
Viaarxiv icon

LLM Meets Scene Graph: Can Large Language Models Understand and Generate Scene Graphs? A Benchmark and Empirical Study

Add code
May 26, 2025
Viaarxiv icon

Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance

Add code
May 22, 2025
Viaarxiv icon

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Add code
May 21, 2025
Viaarxiv icon

Rethinking Reward Model Evaluation Through the Lens of Reward Overoptimization

Add code
May 19, 2025
Viaarxiv icon

KULTURE Bench: A Benchmark for Assessing Language Model in Korean Cultural Context

Add code
Dec 10, 2024
Viaarxiv icon

Stop Playing the Guessing Game! Target-free User Simulation for Evaluating Conversational Recommender Systems

Add code
Nov 25, 2024
Viaarxiv icon

Why These Documents? Explainable Generative Retrieval with Hierarchical Category Paths

Add code
Nov 08, 2024
Figure 1 for Why These Documents? Explainable Generative Retrieval with Hierarchical Category Paths
Figure 2 for Why These Documents? Explainable Generative Retrieval with Hierarchical Category Paths
Figure 3 for Why These Documents? Explainable Generative Retrieval with Hierarchical Category Paths
Figure 4 for Why These Documents? Explainable Generative Retrieval with Hierarchical Category Paths
Viaarxiv icon

Can Code-Switched Texts Activate a Knowledge Switch in LLMs? A Case Study on English-Korean Code-Switching

Add code
Oct 24, 2024
Viaarxiv icon

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation

Add code
Oct 17, 2024
Viaarxiv icon