Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:MAmmoTH2: Scaling Instructions from the Web

May 06, 2024

Xiang Yue, Tuney Zheng, Ge Zhang, Wenhu Chen

Figure 1 for MAmmoTH2: Scaling Instructions from the Web

Figure 2 for MAmmoTH2: Scaling Instructions from the Web

Figure 3 for MAmmoTH2: Scaling Instructions from the Web

Figure 4 for MAmmoTH2: Scaling Instructions from the Web

Share this with someone who'll enjoy it:

Abstract:Instruction tuning improves the reasoning abilities of large language models (LLMs), with data quality and scalability being the crucial factors. Most instruction tuning data come from human crowd-sourcing or GPT-4 distillation. We propose a paradigm to efficiently harvest 10 million naturally existing instruction data from the pre-training web corpus to enhance LLM reasoning. Our approach involves (1) recalling relevant documents, (2) extracting instruction-response pairs, and (3) refining the extracted pairs using open-source LLMs. Fine-tuning base LLMs on this dataset, we build MAmmoTH2 models, which significantly boost performance on reasoning benchmarks. Notably, MAmmoTH2-7B's (Mistral) performance increases from 11% to 34% on MATH and from 36% to 67% on GSM8K without training on any in-domain data. Further training MAmmoTH2 on public instruction tuning datasets yields MAmmoTH2-Plus, achieving state-of-the-art performance on several reasoning and chatbot benchmarks. Our work demonstrates how to harvest large-scale, high-quality instruction data without costly human annotation or GPT-4 distillation, providing a new paradigm for building better instruction tuning data.

* Work in Progress

View paper on

Share this with someone who'll enjoy it:

Title:MAmmoTH2: Scaling Instructions from the Web

Paper and Code