Picture for Ben Keller

Ben Keller

FGMP: Fine-Grained Mixed-Precision Weight and Activation Quantization for Hardware-Accelerated LLM Inference

Add code
Apr 19, 2025
Viaarxiv icon

GauRast: Enhancing GPU Triangle Rasterizers to Accelerate 3D Gaussian Splatting

Add code
Mar 20, 2025
Viaarxiv icon

HEAT: Hardware-Efficient Automatic Tensor Decomposition for Transformer Compression

Add code
Nov 30, 2022
Viaarxiv icon