Picture for Mohammed Tolba

Mohammed Tolba

Hybrid Dynamic Pruning: A Pathway to Efficient Transformer Inference

Add code
Jul 17, 2024
Viaarxiv icon