Picture for Rohan Juneja

Rohan Juneja

HALO: Hardware-aware quantization with low critical-path-delay weights for LLM acceleration

Add code
Feb 27, 2025
Viaarxiv icon

NOVA: NoC-based Vector Unit for Mapping Attention Layers on a CNN Accelerator

Add code
May 07, 2024
Viaarxiv icon