Picture for Ion Stoica

Ion Stoica

Autellix: An Efficient Serving Engine for LLM Agents as General Programs

Add code
Feb 19, 2025
Viaarxiv icon

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Add code
Feb 12, 2025
Viaarxiv icon

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Add code
Feb 11, 2025
Viaarxiv icon

Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile

Add code
Feb 10, 2025
Viaarxiv icon

Fast Video Generation with Sliding Tile Attention

Add code
Feb 06, 2025
Viaarxiv icon

Twilight: Adaptive Attention Sparsity with Hierarchical Top-$p$ Pruning

Add code
Feb 06, 2025
Viaarxiv icon

Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

Add code
Feb 03, 2025
Viaarxiv icon

BARE: Combining Base and Instruction-Tuned Language Models for Better Synthetic Data Generation

Add code
Feb 03, 2025
Viaarxiv icon

Locality-aware Fair Scheduling in LLM Serving

Add code
Jan 24, 2025
Viaarxiv icon

Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards

Add code
Jan 13, 2025
Viaarxiv icon