Picture for Jinjie Yang

Jinjie Yang

Topology-aware Preemptive Scheduling for Co-located LLM Workloads

Add code
Nov 18, 2024
Viaarxiv icon