Picture for Yongbin Li

Yongbin Li

Beyond Quantity: Trajectory Diversity Scaling for Code Agents

Add code
Feb 03, 2026
Viaarxiv icon

ExpSeek: Self-Triggered Experience Seeking for Web Agents

Add code
Jan 13, 2026
Viaarxiv icon

Controlling Multimodal Conversational Agents with Coverage-Enhanced Latent Actions

Add code
Jan 12, 2026
Viaarxiv icon

Reward Modeling from Natural Language Human Feedback

Add code
Jan 12, 2026
Viaarxiv icon

EvoRoute: Experience-Driven Self-Routing LLM Agent Systems

Add code
Jan 06, 2026
Viaarxiv icon

RollArt: Scaling Agentic RL Training via Disaggregated Infrastructure

Add code
Dec 27, 2025
Viaarxiv icon

Understanding Generalization in Role-Playing Models via Information Theory

Add code
Dec 19, 2025
Viaarxiv icon

MOA: Multi-Objective Alignment for Role-Playing Agents

Add code
Dec 10, 2025
Viaarxiv icon

Selective Weak-to-Strong Generalization

Add code
Nov 18, 2025
Viaarxiv icon

CPO: Addressing Reward Ambiguity in Role-playing Dialogue via Comparative Policy Optimization

Add code
Aug 12, 2025
Viaarxiv icon