Picture for Haoli Bai

Haoli Bai

WeightedKV: Attention Scores Weighted Key-Value Cache Merging for Large Language Models

Add code
Mar 03, 2025
Viaarxiv icon

TreeKV: Smooth Key-Value Cache Compression with Tree Structures

Add code
Jan 09, 2025
Viaarxiv icon

FlatQuant: Flatness Matters for LLM Quantization

Add code
Oct 12, 2024
Figure 1 for FlatQuant: Flatness Matters for LLM Quantization
Figure 2 for FlatQuant: Flatness Matters for LLM Quantization
Figure 3 for FlatQuant: Flatness Matters for LLM Quantization
Figure 4 for FlatQuant: Flatness Matters for LLM Quantization
Viaarxiv icon

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

Add code
Sep 26, 2024
Figure 1 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Figure 2 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Figure 3 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Figure 4 for EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
Viaarxiv icon

S2D: Sorted Speculative Decoding For More Efficient Deployment of Nested Large Language Models

Add code
Jul 02, 2024
Figure 1 for S2D: Sorted Speculative Decoding For More Efficient Deployment of Nested Large Language Models
Figure 2 for S2D: Sorted Speculative Decoding For More Efficient Deployment of Nested Large Language Models
Figure 3 for S2D: Sorted Speculative Decoding For More Efficient Deployment of Nested Large Language Models
Figure 4 for S2D: Sorted Speculative Decoding For More Efficient Deployment of Nested Large Language Models
Viaarxiv icon

Visually Guided Generative Text-Layout Pre-training for Document Intelligence

Add code
Mar 27, 2024
Figure 1 for Visually Guided Generative Text-Layout Pre-training for Document Intelligence
Figure 2 for Visually Guided Generative Text-Layout Pre-training for Document Intelligence
Figure 3 for Visually Guided Generative Text-Layout Pre-training for Document Intelligence
Figure 4 for Visually Guided Generative Text-Layout Pre-training for Document Intelligence
Viaarxiv icon

MoPE-CLIP: Structured Pruning for Efficient Vision-Language Models with Module-wise Pruning Error Metric

Add code
Mar 12, 2024
Viaarxiv icon

IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact

Add code
Mar 02, 2024
Figure 1 for IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
Figure 2 for IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
Figure 3 for IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
Figure 4 for IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
Viaarxiv icon

Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding

Add code
Dec 19, 2022
Figure 1 for Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding
Figure 2 for Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding
Figure 3 for Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding
Figure 4 for Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding
Viaarxiv icon

Dynamically pruning segformer for efficient semantic segmentation

Add code
Nov 18, 2021
Figure 1 for Dynamically pruning segformer for efficient semantic segmentation
Figure 2 for Dynamically pruning segformer for efficient semantic segmentation
Figure 3 for Dynamically pruning segformer for efficient semantic segmentation
Figure 4 for Dynamically pruning segformer for efficient semantic segmentation
Viaarxiv icon