Picture for Archit Patke

Archit Patke

Hierarchical Autoscaling for Large Language Model Serving with Chiron

Add code
Jan 14, 2025
Viaarxiv icon

Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction

Add code
Apr 12, 2024
Viaarxiv icon