Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:What Are the Odds? Language Models Are Capable of Probabilistic Reasoning

Jun 18, 2024

Akshay Paruchuri, Jake Garrison, Shun Liao, John Hernandez, Jacob Sunshine, Tim Althoff, Xin Liu, Daniel McDuff

Figure 1 for What Are the Odds? Language Models Are Capable of Probabilistic Reasoning

Figure 2 for What Are the Odds? Language Models Are Capable of Probabilistic Reasoning

Figure 3 for What Are the Odds? Language Models Are Capable of Probabilistic Reasoning

Figure 4 for What Are the Odds? Language Models Are Capable of Probabilistic Reasoning

Share this with someone who'll enjoy it:

Abstract:Language models (LM) are capable of remarkably complex linguistic tasks; however, numerical reasoning is an area in which they frequently struggle. An important but rarely evaluated form of reasoning is understanding probability distributions. In this paper, we focus on evaluating the probabilistic reasoning capabilities of LMs using idealized and real-world statistical distributions. We perform a systematic evaluation of state-of-the-art LMs on three tasks: estimating percentiles, drawing samples, and calculating probabilities. We evaluate three ways to provide context to LMs 1) anchoring examples from within a distribution or family of distributions, 2) real-world context, 3) summary statistics on which to base a Normal approximation. Models can make inferences about distributions, and can be further aided by the incorporation of real-world context, example shots and simplified assumptions, even if these assumptions are incorrect or misspecified. To conduct this work, we developed a comprehensive benchmark distribution dataset with associated question-answer pairs that we will release publicly.

* 21 pages, 9 figures, 2 tables

View paper on

Share this with someone who'll enjoy it:

Title:What Are the Odds? Language Models Are Capable of Probabilistic Reasoning

Paper and Code