Picture for Tristan Webb

Tristan Webb

Understanding the difficulty of low-precision post-training quantization of large language models

Add code
Oct 18, 2024
Figure 1 for Understanding the difficulty of low-precision post-training quantization of large language models
Figure 2 for Understanding the difficulty of low-precision post-training quantization of large language models
Figure 3 for Understanding the difficulty of low-precision post-training quantization of large language models
Figure 4 for Understanding the difficulty of low-precision post-training quantization of large language models
Viaarxiv icon

Scaling laws for post-training quantized large language models

Add code
Oct 15, 2024
Figure 1 for Scaling laws for post-training quantized large language models
Figure 2 for Scaling laws for post-training quantized large language models
Figure 3 for Scaling laws for post-training quantized large language models
Figure 4 for Scaling laws for post-training quantized large language models
Viaarxiv icon

A Hardware-Aware System for Accelerating Deep Neural Network Optimization

Add code
Feb 25, 2022
Figure 1 for A Hardware-Aware System for Accelerating Deep Neural Network Optimization
Figure 2 for A Hardware-Aware System for Accelerating Deep Neural Network Optimization
Figure 3 for A Hardware-Aware System for Accelerating Deep Neural Network Optimization
Figure 4 for A Hardware-Aware System for Accelerating Deep Neural Network Optimization
Viaarxiv icon