Picture for Yutao Sun

Yutao Sun

Universal YOCO for Efficient Depth Scaling

Add code
Apr 01, 2026
Viaarxiv icon

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

Add code
Mar 26, 2026
Viaarxiv icon

Geometric Autoencoder for Diffusion Models

Add code
Mar 12, 2026
Viaarxiv icon

VIBEVOICE-ASR Technical Report

Add code
Jan 26, 2026
Viaarxiv icon

VibeVoice Technical Report

Add code
Aug 26, 2025
Viaarxiv icon

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Add code
Jun 10, 2025
Figure 1 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 2 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 3 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 4 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Viaarxiv icon

Reinforcement Pre-Training

Add code
Jun 09, 2025
Figure 1 for Reinforcement Pre-Training
Figure 2 for Reinforcement Pre-Training
Figure 3 for Reinforcement Pre-Training
Figure 4 for Reinforcement Pre-Training
Viaarxiv icon

Rectified Sparse Attention

Add code
Jun 05, 2025
Viaarxiv icon

The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?

Add code
Feb 19, 2025
Figure 1 for The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?
Figure 2 for The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?
Figure 3 for The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?
Figure 4 for The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?
Viaarxiv icon

Multimodal Latent Language Modeling with Next-Token Diffusion

Add code
Dec 11, 2024
Figure 1 for Multimodal Latent Language Modeling with Next-Token Diffusion
Figure 2 for Multimodal Latent Language Modeling with Next-Token Diffusion
Figure 3 for Multimodal Latent Language Modeling with Next-Token Diffusion
Figure 4 for Multimodal Latent Language Modeling with Next-Token Diffusion
Viaarxiv icon