Picture for Minxing Huang

Minxing Huang

Ripple: Accelerating LLM Inference on Smartphones with Correlation-Aware Neuron Management

Add code
Oct 29, 2024
Viaarxiv icon