Picture for Shusheng Xu

Shusheng Xu

AREAL-DTA: Dynamic Tree Attention for Efficient Reinforcement Learning of Large Language Models

Add code
Jan 31, 2026
Viaarxiv icon

From Self-Evolving Synthetic Data to Verifiable-Reward RL: Post-Training Multi-turn Interactive Tool-Using Agents

Add code
Jan 30, 2026
Viaarxiv icon

Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

Add code
Aug 13, 2025
Viaarxiv icon

How Far Are We from Optimal Reasoning Efficiency?

Add code
Jun 08, 2025
Viaarxiv icon

AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

Add code
May 30, 2025
Viaarxiv icon

On Designing Effective RL Reward at Training Time for LLM Reasoning

Add code
Oct 19, 2024
Figure 1 for On Designing Effective RL Reward at Training Time for LLM Reasoning
Figure 2 for On Designing Effective RL Reward at Training Time for LLM Reasoning
Figure 3 for On Designing Effective RL Reward at Training Time for LLM Reasoning
Figure 4 for On Designing Effective RL Reward at Training Time for LLM Reasoning
Viaarxiv icon

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Add code
Apr 16, 2024
Figure 1 for Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Figure 2 for Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Figure 3 for Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Figure 4 for Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
Viaarxiv icon

Language-Guided Generation of Physically Realistic Robot Motion and Control

Add code
Jun 18, 2023
Figure 1 for Language-Guided Generation of Physically Realistic Robot Motion and Control
Figure 2 for Language-Guided Generation of Physically Realistic Robot Motion and Control
Figure 3 for Language-Guided Generation of Physically Realistic Robot Motion and Control
Figure 4 for Language-Guided Generation of Physically Realistic Robot Motion and Control
Viaarxiv icon

Native Chinese Reader: A Dataset Towards Native-Level Chinese Machine Reading Comprehension

Add code
Dec 14, 2021
Figure 1 for Native Chinese Reader: A Dataset Towards Native-Level Chinese Machine Reading Comprehension
Figure 2 for Native Chinese Reader: A Dataset Towards Native-Level Chinese Machine Reading Comprehension
Figure 3 for Native Chinese Reader: A Dataset Towards Native-Level Chinese Machine Reading Comprehension
Figure 4 for Native Chinese Reader: A Dataset Towards Native-Level Chinese Machine Reading Comprehension
Viaarxiv icon

A Benchmark for Low-Switching-Cost Reinforcement Learning

Add code
Dec 13, 2021
Figure 1 for A Benchmark for Low-Switching-Cost Reinforcement Learning
Figure 2 for A Benchmark for Low-Switching-Cost Reinforcement Learning
Figure 3 for A Benchmark for Low-Switching-Cost Reinforcement Learning
Figure 4 for A Benchmark for Low-Switching-Cost Reinforcement Learning
Viaarxiv icon