Picture for Jaehong Cho

Jaehong Cho

LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale

Add code
Aug 10, 2024
Viaarxiv icon