Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Calibrating Language Models with Adaptive Temperature Scaling

Sep 29, 2024

Johnathan Xie, Annie S. Chen, Yoonho Lee, Eric Mitchell, Chelsea Finn

Figure 1 for Calibrating Language Models with Adaptive Temperature Scaling

Figure 2 for Calibrating Language Models with Adaptive Temperature Scaling

Figure 3 for Calibrating Language Models with Adaptive Temperature Scaling

Figure 4 for Calibrating Language Models with Adaptive Temperature Scaling

Share this with someone who'll enjoy it:

Abstract:The effectiveness of large language models (LLMs) is not only measured by their ability to generate accurate outputs but also by their calibration-how well their confidence scores reflect the probability of their outputs being correct. While unsupervised pre-training has been shown to yield LLMs with well-calibrated conditional probabilities, recent studies have shown that after fine-tuning with reinforcement learning from human feedback (RLHF), the calibration of these models degrades significantly. In this work, we introduce Adaptive Temperature Scaling (ATS), a post-hoc calibration method that predicts a temperature scaling parameter for each token prediction. The predicted temperature values adapt based on token-level features and are fit over a standard supervised fine-tuning (SFT) dataset. The adaptive nature of ATS addresses the varying degrees of calibration shift that can occur after RLHF fine-tuning. ATS improves calibration by over 10-50% across three downstream natural language evaluation benchmarks compared to prior calibration methods and does not impede performance improvements from RLHF.

* EMNLP 2024

View paper on

Share this with someone who'll enjoy it:

Title:Calibrating Language Models with Adaptive Temperature Scaling

Paper and Code