Picture for Yuzhuang Xu

Yuzhuang Xu

Lookahead Q-Cache: Achieving More Consistent KV Cache Eviction via Pseudo Query

Add code
May 24, 2025
Viaarxiv icon

Think Before You Accept: Semantic Reflective Verification for Faster Speculative Decoding

Add code
May 24, 2025
Viaarxiv icon

Perspective Transition of Large Language Models for Solving Subjective Tasks

Add code
Jan 16, 2025
Figure 1 for Perspective Transition of Large Language Models for Solving Subjective Tasks
Figure 2 for Perspective Transition of Large Language Models for Solving Subjective Tasks
Figure 3 for Perspective Transition of Large Language Models for Solving Subjective Tasks
Figure 4 for Perspective Transition of Large Language Models for Solving Subjective Tasks
Viaarxiv icon

CRVQ: Channel-relaxed Vector Quantization for Extreme Compression of LLMs

Add code
Dec 12, 2024
Figure 1 for CRVQ: Channel-relaxed Vector Quantization for Extreme Compression of LLMs
Figure 2 for CRVQ: Channel-relaxed Vector Quantization for Extreme Compression of LLMs
Figure 3 for CRVQ: Channel-relaxed Vector Quantization for Extreme Compression of LLMs
Figure 4 for CRVQ: Channel-relaxed Vector Quantization for Extreme Compression of LLMs
Viaarxiv icon

ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models

Add code
Oct 07, 2024
Viaarxiv icon

Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models

Add code
Jun 13, 2024
Figure 1 for Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models
Figure 2 for Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models
Figure 3 for Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models
Figure 4 for Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models
Viaarxiv icon

UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset

Add code
Feb 18, 2024
Figure 1 for UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset
Figure 2 for UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset
Figure 3 for UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset
Figure 4 for UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset
Viaarxiv icon

OneBit: Towards Extremely Low-bit Large Language Models

Add code
Feb 17, 2024
Viaarxiv icon

Exploring Large Language Models for Communication Games: An Empirical Study on Werewolf

Add code
Sep 09, 2023
Viaarxiv icon

Pluggable Neural Machine Translation Models via Memory-augmented Adapters

Add code
Jul 12, 2023
Figure 1 for Pluggable Neural Machine Translation Models via Memory-augmented Adapters
Figure 2 for Pluggable Neural Machine Translation Models via Memory-augmented Adapters
Figure 3 for Pluggable Neural Machine Translation Models via Memory-augmented Adapters
Figure 4 for Pluggable Neural Machine Translation Models via Memory-augmented Adapters
Viaarxiv icon