Picture for Peter Conway Humphreys

Peter Conway Humphreys

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Add code
Apr 02, 2024
Viaarxiv icon