Picture for Basil Homer

Basil Homer

CHAI: Clustered Head Attention for Efficient LLM Inference

Add code
Mar 12, 2024
Viaarxiv icon