Picture for Ruwen Fan

Ruwen Fan

Ripple: Accelerating LLM Inference on Smartphones with Correlation-Aware Neuron Management

Add code
Oct 29, 2024
Figure 1 for Ripple: Accelerating LLM Inference on Smartphones with Correlation-Aware Neuron Management
Figure 2 for Ripple: Accelerating LLM Inference on Smartphones with Correlation-Aware Neuron Management
Figure 3 for Ripple: Accelerating LLM Inference on Smartphones with Correlation-Aware Neuron Management
Figure 4 for Ripple: Accelerating LLM Inference on Smartphones with Correlation-Aware Neuron Management
Viaarxiv icon