Picture for Zhisong Zhang

Zhisong Zhang

A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression

Add code
Dec 23, 2024
Viaarxiv icon

Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models

Add code
Dec 21, 2024
Viaarxiv icon

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Add code
Nov 26, 2024
Viaarxiv icon

LoGU: Long-form Generation with Uncertainty Expressions

Add code
Oct 18, 2024
Figure 1 for LoGU: Long-form Generation with Uncertainty Expressions
Figure 2 for LoGU: Long-form Generation with Uncertainty Expressions
Figure 3 for LoGU: Long-form Generation with Uncertainty Expressions
Figure 4 for LoGU: Long-form Generation with Uncertainty Expressions
Viaarxiv icon

Atomic Calibration of LLMs in Long-Form Generations

Add code
Oct 17, 2024
Viaarxiv icon

On the Transformations across Reward Model, Parameter Update, and In-Context Prompt

Add code
Jun 24, 2024
Viaarxiv icon

On the Worst Prompt Performance of Large Language Models

Add code
Jun 08, 2024
Figure 1 for On the Worst Prompt Performance of Large Language Models
Figure 2 for On the Worst Prompt Performance of Large Language Models
Figure 3 for On the Worst Prompt Performance of Large Language Models
Figure 4 for On the Worst Prompt Performance of Large Language Models
Viaarxiv icon

Self-playing Adversarial Language Game Enhances LLM Reasoning

Add code
Apr 16, 2024
Viaarxiv icon

A Thorough Examination of Decoding Methods in the Era of LLMs

Add code
Feb 10, 2024
Viaarxiv icon

Reasons to Reject? Aligning Language Models with Judgments

Add code
Dec 22, 2023
Viaarxiv icon