Picture for Taeyoon Kwon

Taeyoon Kwon

EMBGuard: Constructing Hazard-Aware Guardrails for Safe Planning in Embodied Agents

Add code
May 29, 2026
Viaarxiv icon

Towards Direct Evaluation of Harness Optimizers via Priority Ranking

Add code
May 21, 2026
Viaarxiv icon

On Training Large Language Models for Long-Horizon Tasks: An Empirical Study of Horizon Length

Add code
May 04, 2026
Viaarxiv icon

PAC-BENCH: Evaluating Multi-Agent Collaboration under Privacy Constraints

Add code
Apr 13, 2026
Viaarxiv icon

Designing Memory-Augmented AR Agents for Spatiotemporal Reasoning in Personalized Task Assistance

Add code
Aug 12, 2025
Viaarxiv icon

Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance

Add code
May 22, 2025
Viaarxiv icon

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Add code
May 21, 2025
Figure 1 for Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
Figure 2 for Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
Figure 3 for Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
Figure 4 for Web-Shepherd: Advancing PRMs for Reinforcing Web Agents
Viaarxiv icon

Rethinking Reward Model Evaluation Through the Lens of Reward Overoptimization

Add code
May 19, 2025
Viaarxiv icon

Evaluating Robustness of Reward Models for Mathematical Reasoning

Add code
Oct 02, 2024
Figure 1 for Evaluating Robustness of Reward Models for Mathematical Reasoning
Figure 2 for Evaluating Robustness of Reward Models for Mathematical Reasoning
Figure 3 for Evaluating Robustness of Reward Models for Mathematical Reasoning
Figure 4 for Evaluating Robustness of Reward Models for Mathematical Reasoning
Viaarxiv icon

Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code

Add code
Sep 29, 2024
Figure 1 for Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code
Figure 2 for Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code
Figure 3 for Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code
Figure 4 for Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code
Viaarxiv icon