Picture for Yuchen Yan

Yuchen Yan

University of Illinois Urbana-Champaign

CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution

Add code
Mar 18, 2026
Viaarxiv icon

Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

Add code
Mar 16, 2026
Viaarxiv icon

Optimal-Horizon Social Robot Navigation in Heterogeneous Crowds

Add code
Feb 28, 2026
Viaarxiv icon

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

Add code
Feb 09, 2026
Viaarxiv icon

TSAQA: Time Series Analysis Question And Answering Benchmark

Add code
Jan 30, 2026
Viaarxiv icon

Subspace Alignment for Vision-Language Model Test-time Adaptation

Add code
Jan 13, 2026
Viaarxiv icon

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Add code
Oct 21, 2025
Figure 1 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 2 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 3 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 4 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Viaarxiv icon

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Add code
Aug 07, 2025
Viaarxiv icon

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

Add code
Aug 07, 2025
Figure 1 for Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models
Figure 2 for Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models
Figure 3 for Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models
Figure 4 for Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models
Viaarxiv icon

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks

Add code
Aug 07, 2025
Viaarxiv icon