Picture for Majid Daliri

Majid Daliri

Unlocking the Theory Behind Scaling 1-Bit Neural Networks

Add code
Nov 03, 2024
Figure 1 for Unlocking the Theory Behind Scaling 1-Bit Neural Networks
Figure 2 for Unlocking the Theory Behind Scaling 1-Bit Neural Networks
Figure 3 for Unlocking the Theory Behind Scaling 1-Bit Neural Networks
Viaarxiv icon

Coupling without Communication and Drafter-Invariant Speculative Decoding

Add code
Aug 15, 2024
Viaarxiv icon

QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead

Add code
Jun 05, 2024
Viaarxiv icon

KDEformer: Accelerating Transformers via Kernel Density Estimation

Add code
Feb 05, 2023
Viaarxiv icon