Picture for Chun-Feng Wu

Chun-Feng Wu

S$^{3}$: Increasing GPU Utilization during Generative Inference for Higher Throughput

Add code
Jun 09, 2023
Viaarxiv icon