Picture for Juanzi Li

Juanzi Li

EnvRL: Learn from Environment Dynamics in Agentic Reinforcement Learning

Add code
Jun 16, 2026
Viaarxiv icon

EurekAgent: Agent Environment Engineering is All You Need For Autonomous Scientific Discovery

Add code
Jun 11, 2026
Viaarxiv icon

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

Add code
Jun 03, 2026
Viaarxiv icon

LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Add code
May 29, 2026
Viaarxiv icon

Guiding LLM Post-training Data Engineering with Model Internals from Sparse Autoencoders

Add code
May 26, 2026
Viaarxiv icon

StoryAlign: Evaluating and Training Reward Models for Story Generation

Add code
May 06, 2026
Viaarxiv icon

MAIC-UI: Making Interactive Courseware with Generative UI

Add code
Apr 28, 2026
Viaarxiv icon

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Add code
Mar 12, 2026
Viaarxiv icon

GLM-5: from Vibe Coding to Agentic Engineering

Add code
Feb 17, 2026
Viaarxiv icon

WildReward: Learning Reward Models from In-the-Wild Human Interactions

Add code
Feb 09, 2026
Viaarxiv icon