Picture for Lukas Cavigelli

Lukas Cavigelli

Late Breaking Results: The Art of Beating the Odds with Predictor-Guided Random Design Space Exploration

Add code
Feb 26, 2025
Figure 1 for Late Breaking Results: The Art of Beating the Odds with Predictor-Guided Random Design Space Exploration
Figure 2 for Late Breaking Results: The Art of Beating the Odds with Predictor-Guided Random Design Space Exploration
Figure 3 for Late Breaking Results: The Art of Beating the Odds with Predictor-Guided Random Design Space Exploration
Figure 4 for Late Breaking Results: The Art of Beating the Odds with Predictor-Guided Random Design Space Exploration
Viaarxiv icon

Top-Theta Attention: Sparsifying Transformers by Compensated Thresholding

Add code
Feb 12, 2025
Viaarxiv icon

PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving

Add code
Jan 14, 2025
Figure 1 for PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving
Figure 2 for PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving
Figure 3 for PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving
Figure 4 for PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving
Viaarxiv icon

SSSD: Simply-Scalable Speculative Decoding

Add code
Nov 08, 2024
Figure 1 for SSSD: Simply-Scalable Speculative Decoding
Figure 2 for SSSD: Simply-Scalable Speculative Decoding
Figure 3 for SSSD: Simply-Scalable Speculative Decoding
Figure 4 for SSSD: Simply-Scalable Speculative Decoding
Viaarxiv icon

On-Device Domain Learning for Keyword Spotting on Low-Power Extreme Edge Embedded Systems

Add code
Mar 12, 2024
Viaarxiv icon

Boosting keyword spotting through on-device learnable user speech characteristics

Add code
Mar 12, 2024
Viaarxiv icon

Stella Nera: Achieving 161 TOp/s/W with Multiplier-free DNN Acceleration based on Approximate Matrix Multiplication

Add code
Nov 16, 2023
Viaarxiv icon

RL-based Stateful Neural Adaptive Sampling and Denoising for Real-Time Path Tracing

Add code
Oct 05, 2023
Viaarxiv icon

Going Further With Winograd Convolutions: Tap-Wise Quantization for Efficient Inference on 4x4 Tile

Add code
Sep 26, 2022
Figure 1 for Going Further With Winograd Convolutions: Tap-Wise Quantization for Efficient Inference on 4x4 Tile
Figure 2 for Going Further With Winograd Convolutions: Tap-Wise Quantization for Efficient Inference on 4x4 Tile
Figure 3 for Going Further With Winograd Convolutions: Tap-Wise Quantization for Efficient Inference on 4x4 Tile
Figure 4 for Going Further With Winograd Convolutions: Tap-Wise Quantization for Efficient Inference on 4x4 Tile
Viaarxiv icon

Vau da muntanialas: Energy-efficient multi-die scalable acceleration of RNN inference

Add code
Feb 14, 2022
Viaarxiv icon