Picture for Archit Patke

Archit Patke

Characterizing GPU Resilience and Impact on AI/HPC Systems

Add code
Mar 14, 2025
Viaarxiv icon

Hierarchical Autoscaling for Large Language Model Serving with Chiron

Add code
Jan 14, 2025
Viaarxiv icon

Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction

Add code
Apr 12, 2024
Viaarxiv icon