Picture for Jonah Wonkyu Yi

Jonah Wonkyu Yi

NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention

Add code
Mar 02, 2024
Viaarxiv icon