Picture for Roman Garipov

Roman Garipov

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Add code
Apr 09, 2025
Viaarxiv icon