Picture for Alfredo Garrachón Ruiz

Alfredo Garrachón Ruiz

TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation

Add code
Dec 10, 2024
Viaarxiv icon