Picture for Ion Stoica

Ion Stoica

The Price Reversal Phenomenon: When Cheaper Reasoning Models End Up Costing More

Add code
Mar 25, 2026
Viaarxiv icon

M$^2$RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Language Modeling

Add code
Mar 15, 2026
Viaarxiv icon

SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing

Add code
Mar 09, 2026
Viaarxiv icon

SageBwd: A Trainable Low-bit Attention

Add code
Mar 02, 2026
Viaarxiv icon

K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model

Add code
Feb 26, 2026
Viaarxiv icon

AdaEvolve: Adaptive LLM Driven Zeroth-Order Optimization

Add code
Feb 23, 2026
Viaarxiv icon

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Add code
Feb 13, 2026
Viaarxiv icon

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Add code
Feb 03, 2026
Viaarxiv icon

Qrita: High-performance Top-k and Top-p Algorithm for GPUs using Pivot-based Truncation and Selection

Add code
Feb 02, 2026
Viaarxiv icon

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

Add code
Jan 23, 2026
Viaarxiv icon