Picture for Reena Elangovan

Reena Elangovan

BCQ: Block Clustered Quantization for 4-bit (W4A4) LLM Inference

Add code
Feb 07, 2025
Viaarxiv icon

Ax-BxP: Approximate Blocked Computation for Precision-Reconfigurable Deep Neural Network Acceleration

Add code
Nov 25, 2020
Figure 1 for Ax-BxP: Approximate Blocked Computation for Precision-Reconfigurable Deep Neural Network Acceleration
Figure 2 for Ax-BxP: Approximate Blocked Computation for Precision-Reconfigurable Deep Neural Network Acceleration
Figure 3 for Ax-BxP: Approximate Blocked Computation for Precision-Reconfigurable Deep Neural Network Acceleration
Figure 4 for Ax-BxP: Approximate Blocked Computation for Precision-Reconfigurable Deep Neural Network Acceleration
Viaarxiv icon