Picture for Li Dong

Li Dong

Sherman

VibeVoice Technical Report

Add code
Aug 26, 2025
Viaarxiv icon

Data Efficacy for Language Model Training

Add code
Jun 26, 2025
Viaarxiv icon

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Add code
Jun 10, 2025
Viaarxiv icon

Reinforcement Pre-Training

Add code
Jun 09, 2025
Viaarxiv icon

Rectified Sparse Attention

Add code
Jun 05, 2025
Viaarxiv icon

On-Policy RL with Optimal Reward Baseline

Add code
May 29, 2025
Viaarxiv icon

From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications

Add code
May 28, 2025
Viaarxiv icon

Think Only When You Need with Large Hybrid-Reasoning Models

Add code
May 21, 2025
Viaarxiv icon

Reward Reasoning Model

Add code
May 20, 2025
Viaarxiv icon

MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems

Add code
May 16, 2025
Viaarxiv icon