Picture for Haoli Bai

Haoli Bai

FlatQuant: Flatness Matters for LLM Quantization

Add code
Oct 12, 2024
Figure 1 for FlatQuant: Flatness Matters for LLM Quantization
Figure 2 for FlatQuant: Flatness Matters for LLM Quantization
Figure 3 for FlatQuant: Flatness Matters for LLM Quantization
Figure 4 for FlatQuant: Flatness Matters for LLM Quantization
Viaarxiv icon

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

Add code
Sep 26, 2024
Figure 1 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Figure 2 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Figure 3 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Figure 4 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Viaarxiv icon

S2D: Sorted Speculative Decoding For More Efficient Deployment of Nested Large Language Models

Add code
Jul 02, 2024
Viaarxiv icon

Visually Guided Generative Text-Layout Pre-training for Document Intelligence

Add code
Mar 27, 2024
Viaarxiv icon

MoPE-CLIP: Structured Pruning for Efficient Vision-Language Models with Module-wise Pruning Error Metric

Add code
Mar 12, 2024
Viaarxiv icon

IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact

Add code
Mar 02, 2024
Figure 1 for IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
Figure 2 for IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
Figure 3 for IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
Figure 4 for IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
Viaarxiv icon

Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding

Add code
Dec 19, 2022
Viaarxiv icon

Dynamically pruning segformer for efficient semantic segmentation

Add code
Nov 18, 2021
Figure 1 for Dynamically pruning segformer for efficient semantic segmentation
Figure 2 for Dynamically pruning segformer for efficient semantic segmentation
Figure 3 for Dynamically pruning segformer for efficient semantic segmentation
Figure 4 for Dynamically pruning segformer for efficient semantic segmentation
Viaarxiv icon

Towards Efficient Post-training Quantization of Pre-trained Language Models

Add code
Sep 30, 2021
Figure 1 for Towards Efficient Post-training Quantization of Pre-trained Language Models
Figure 2 for Towards Efficient Post-training Quantization of Pre-trained Language Models
Figure 3 for Towards Efficient Post-training Quantization of Pre-trained Language Models
Figure 4 for Towards Efficient Post-training Quantization of Pre-trained Language Models
Viaarxiv icon

Discrete Auto-regressive Variational Attention Models for Text Modeling

Add code
Jun 16, 2021
Figure 1 for Discrete Auto-regressive Variational Attention Models for Text Modeling
Figure 2 for Discrete Auto-regressive Variational Attention Models for Text Modeling
Figure 3 for Discrete Auto-regressive Variational Attention Models for Text Modeling
Figure 4 for Discrete Auto-regressive Variational Attention Models for Text Modeling
Viaarxiv icon