QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead

Add code
Jun 05, 2024
Figure 1 for QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead
Figure 2 for QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead
Figure 3 for QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead
Figure 4 for QJL: 1-Bit Quantized JL Transform for KV Cache Quantization with Zero Overhead

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: