Picture for Karen Khatamifard

Karen Khatamifard

LLM in a flash: Efficient Large Language Model Inference with Limited Memory

Add code
Dec 12, 2023
Viaarxiv icon