Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Feb 06, 2025

Xiao-Wen Yang, Xuan-Yi Zhu, Wen-Da Wei, Ding-Chu Zhang, Jie-Jing Shao, Zhi Zhou, Lan-Zhe Guo, Yu-Feng Li

Figure 1 for Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Figure 2 for Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Figure 3 for Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Figure 4 for Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Share this with someone who'll enjoy it:

Abstract:The integration of slow-thinking mechanisms into large language models (LLMs) offers a promising way toward achieving Level 2 AGI Reasoners, as exemplified by systems like OpenAI's o1. However, several significant challenges remain, including inefficient overthinking and an overreliance on auxiliary reward models. We point out that these limitations stem from LLMs' inability to internalize the search process, a key component of effective reasoning. A critical step toward addressing this issue is enabling LLMs to autonomously determine when and where to backtrack, a fundamental operation in traditional search algorithms. To this end, we propose a self-backtracking mechanism that equips LLMs with the ability to backtrack during both training and inference. This mechanism not only enhances reasoning ability but also efficiency by transforming slow-thinking processes into fast-thinking through self-improvement. Empirical evaluations demonstrate that our proposal significantly enhances the reasoning capabilities of LLMs, achieving a performance gain of over 40 percent compared to the optimal-path supervised fine-tuning method. We believe this study introduces a novel and promising pathway for developing more advanced and robust Reasoners.

* This is a preprint under review, 15 pages, 13 figures

View paper on

Share this with someone who'll enjoy it:

Title:Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Paper and Code