Picture for Shaoguang Yan

Shaoguang Yan

Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference

Add code
Jun 26, 2023
Viaarxiv icon