Picture for Kangdi Chen

Kangdi Chen

FlashDecoding++: Faster Large Language Model Inference on GPUs

Add code
Nov 10, 2023
Viaarxiv icon