Picture for Li Dong

Li Dong

Sherman

The Era of Agentic Organization: Learning to Organize with Language Models

Add code
Oct 30, 2025
Viaarxiv icon

Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts

Add code
Oct 27, 2025
Viaarxiv icon

Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective

Add code
Sep 26, 2025
Viaarxiv icon

VibeVoice Technical Report

Add code
Aug 26, 2025
Viaarxiv icon

Data Efficacy for Language Model Training

Add code
Jun 26, 2025
Viaarxiv icon

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Add code
Jun 10, 2025
Figure 1 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 2 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 3 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 4 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Viaarxiv icon

Reinforcement Pre-Training

Add code
Jun 09, 2025
Viaarxiv icon

Rectified Sparse Attention

Add code
Jun 05, 2025
Viaarxiv icon

On-Policy RL with Optimal Reward Baseline

Add code
May 29, 2025
Viaarxiv icon

From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications

Add code
May 28, 2025
Viaarxiv icon