Picture for Chenhao Xue

Chenhao Xue

LLM Inference Unveiled: Survey and Roofline Model Insights

Add code
Mar 11, 2024
Viaarxiv icon

Latency-aware Spatial-wise Dynamic Networks

Add code
Oct 12, 2022
Figure 1 for Latency-aware Spatial-wise Dynamic Networks
Figure 2 for Latency-aware Spatial-wise Dynamic Networks
Figure 3 for Latency-aware Spatial-wise Dynamic Networks
Figure 4 for Latency-aware Spatial-wise Dynamic Networks
Viaarxiv icon

PTQ4ViT: Post-Training Quantization Framework for Vision Transformers

Add code
Nov 24, 2021
Figure 1 for PTQ4ViT: Post-Training Quantization Framework for Vision Transformers
Figure 2 for PTQ4ViT: Post-Training Quantization Framework for Vision Transformers
Figure 3 for PTQ4ViT: Post-Training Quantization Framework for Vision Transformers
Figure 4 for PTQ4ViT: Post-Training Quantization Framework for Vision Transformers
Viaarxiv icon

PTQ-SL: Exploring the Sub-layerwise Post-training Quantization

Add code
Oct 18, 2021
Figure 1 for PTQ-SL: Exploring the Sub-layerwise Post-training Quantization
Figure 2 for PTQ-SL: Exploring the Sub-layerwise Post-training Quantization
Figure 3 for PTQ-SL: Exploring the Sub-layerwise Post-training Quantization
Figure 4 for PTQ-SL: Exploring the Sub-layerwise Post-training Quantization
Viaarxiv icon