Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data

Jun 04, 2024

Haolong Li, Yu Ma, Yinqi Zhang, Chen Ye, Jie Chen

Figure 1 for Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data

Figure 2 for Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data

Figure 3 for Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data

Figure 4 for Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data

Share this with someone who'll enjoy it:

Abstract:Large Language Models (LLMs) have shown excellent performance in language understanding, text generation, code synthesis, and many other tasks, while they still struggle in complex multi-step reasoning problems, such as mathematical reasoning. In this paper, through a newly proposed arithmetical puzzle problem, we show that the model can perform well on multi-step reasoning tasks via fine-tuning on high-quality synthetic data. Experimental results with the open-llama-3B model on three different test datasets show that not only the model can reach a zero-shot pass@1 at 0.44 on the in-domain dataset, it also demonstrates certain generalization capabilities on the out-of-domain datasets. Specifically, this paper has designed two out-of-domain datasets in the form of extending the numerical range and the composing components of the arithmetical puzzle problem separately. The fine-tuned models have shown encouraging performance on these two far more difficult tasks with the zero-shot pass@1 at 0.33 and 0.35, respectively.

* Accept by Findings of ACL 2024

View paper on

Share this with someone who'll enjoy it:

Title:Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data

Paper and Code