Picture for Ben Athiwaratkun

Ben Athiwaratkun

Introspective Diffusion Language Models

Add code
Apr 13, 2026
Viaarxiv icon

Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution

Add code
Apr 09, 2026
Viaarxiv icon

CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention

Add code
Mar 18, 2026
Viaarxiv icon

$V_1$: Unifying Generation and Self-Verification for Parallel Reasoners

Add code
Mar 04, 2026
Viaarxiv icon

When RL Meets Adaptive Speculative Training: A Unified Training-Serving System

Add code
Feb 06, 2026
Viaarxiv icon

Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time

Add code
Dec 31, 2025
Viaarxiv icon

Beat the long tail: Distribution-Aware Speculative Decoding for RL Training

Add code
Nov 17, 2025
Viaarxiv icon

Intelligence per Watt: Measuring Intelligence Efficiency of Local AI

Add code
Nov 14, 2025
Viaarxiv icon

Staircase Streaming for Low-Latency Multi-Agent Inference

Add code
Oct 06, 2025
Figure 1 for Staircase Streaming for Low-Latency Multi-Agent Inference
Figure 2 for Staircase Streaming for Low-Latency Multi-Agent Inference
Figure 3 for Staircase Streaming for Low-Latency Multi-Agent Inference
Figure 4 for Staircase Streaming for Low-Latency Multi-Agent Inference
Viaarxiv icon

Data Diversification Methods In Alignment Enhance Math Performance In LLMs

Add code
Jul 02, 2025
Figure 1 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 2 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 3 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 4 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Viaarxiv icon