Picture for Jinyoung Yeo

Jinyoung Yeo

KULTURE Bench: A Benchmark for Assessing Language Model in Korean Cultural Context

Add code
Dec 10, 2024
Viaarxiv icon

Stop Playing the Guessing Game! Target-free User Simulation for Evaluating Conversational Recommender Systems

Add code
Nov 25, 2024
Viaarxiv icon

Why These Documents? Explainable Generative Retrieval with Hierarchical Category Paths

Add code
Nov 08, 2024
Figure 1 for Why These Documents? Explainable Generative Retrieval with Hierarchical Category Paths
Figure 2 for Why These Documents? Explainable Generative Retrieval with Hierarchical Category Paths
Figure 3 for Why These Documents? Explainable Generative Retrieval with Hierarchical Category Paths
Figure 4 for Why These Documents? Explainable Generative Retrieval with Hierarchical Category Paths
Viaarxiv icon

Can Code-Switched Texts Activate a Knowledge Switch in LLMs? A Case Study on English-Korean Code-Switching

Add code
Oct 24, 2024
Viaarxiv icon

Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation

Add code
Oct 17, 2024
Viaarxiv icon

Evaluating Robustness of Reward Models for Mathematical Reasoning

Add code
Oct 02, 2024
Figure 1 for Evaluating Robustness of Reward Models for Mathematical Reasoning
Figure 2 for Evaluating Robustness of Reward Models for Mathematical Reasoning
Figure 3 for Evaluating Robustness of Reward Models for Mathematical Reasoning
Figure 4 for Evaluating Robustness of Reward Models for Mathematical Reasoning
Viaarxiv icon

Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code

Add code
Sep 29, 2024
Figure 1 for Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code
Figure 2 for Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code
Figure 3 for Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code
Figure 4 for Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code
Viaarxiv icon

YA-TA: Towards Personalized Question-Answering Teaching Assistants using Instructor-Student Dual Retrieval-augmented Knowledge Fusion

Add code
Aug 31, 2024
Figure 1 for YA-TA: Towards Personalized Question-Answering Teaching Assistants using Instructor-Student Dual Retrieval-augmented Knowledge Fusion
Figure 2 for YA-TA: Towards Personalized Question-Answering Teaching Assistants using Instructor-Student Dual Retrieval-augmented Knowledge Fusion
Figure 3 for YA-TA: Towards Personalized Question-Answering Teaching Assistants using Instructor-Student Dual Retrieval-augmented Knowledge Fusion
Figure 4 for YA-TA: Towards Personalized Question-Answering Teaching Assistants using Instructor-Student Dual Retrieval-augmented Knowledge Fusion
Viaarxiv icon

Is Functional Correctness Enough to Evaluate Code Language Models? Exploring Diversity of Generated Codes

Add code
Aug 24, 2024
Viaarxiv icon

Large Language Models Are Self-Taught Reasoners: Enhancing LLM Applications via Tailored Problem-Solving Demonstrations

Add code
Aug 22, 2024
Viaarxiv icon