Picture for Jieyu Zhao

Jieyu Zhao

Video-Based Reward Modeling for Computer-Use Agents

Add code
Mar 10, 2026
Viaarxiv icon

MED-COPILOT: A Medical Assistant Powered by GraphRAG and Similar Patient Case Retrieval

Add code
Feb 28, 2026
Viaarxiv icon

Experiential Reinforcement Learning

Add code
Feb 15, 2026
Viaarxiv icon

CoAct-1: Computer-using Agents with Coding as Actions

Add code
Aug 05, 2025
Figure 1 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 2 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 3 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 4 for CoAct-1: Computer-using Agents with Coding as Actions
Viaarxiv icon

Can LLMs Express Personality Across Cultures? Introducing CulturalPersonas for Evaluating Trait Alignment

Add code
Jun 06, 2025
Viaarxiv icon

SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models

Add code
May 29, 2025
Figure 1 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Figure 2 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Figure 3 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Figure 4 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Viaarxiv icon

Cross-Lingual Pitfalls: Automatic Probing Cross-Lingual Weakness of Multilingual Large Language Models

Add code
May 24, 2025
Viaarxiv icon

The Hallucination Tax of Reinforcement Finetuning

Add code
May 20, 2025
Viaarxiv icon

Efficient Reinforcement Finetuning via Adaptive Curriculum Learning

Add code
Apr 07, 2025
Viaarxiv icon

BIASINSPECTOR: Detecting Bias in Structured Data through LLM Agents

Add code
Apr 07, 2025
Viaarxiv icon