Picture for Matei Zaharia

Matei Zaharia

Establishing Best Practices for Building Rigorous Agentic Benchmarks

Add code
Jul 03, 2025
Viaarxiv icon

LEANN: A Low-Storage Vector Index

Add code
Jun 09, 2025
Viaarxiv icon

EXP-Bench: Can AI Conduct AI Research Experiments?

Add code
May 30, 2025
Viaarxiv icon

ColBERT-serve: Efficient Multi-Stage Memory-Mapped Scoring

Add code
Apr 21, 2025
Viaarxiv icon

Reasoning Models Can Be Effective Without Thinking

Add code
Apr 14, 2025
Viaarxiv icon

Why Do Multi-Agent LLM Systems Fail?

Add code
Mar 17, 2025
Viaarxiv icon

LangProBe: a Language Programs Benchmark

Add code
Feb 27, 2025
Viaarxiv icon

Optimizing Model Selection for Compound AI Systems

Add code
Feb 20, 2025
Viaarxiv icon

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Add code
Feb 11, 2025
Viaarxiv icon

Adaptive Semantic Prompt Caching with VectorQ

Add code
Feb 06, 2025
Viaarxiv icon