Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Guiding Language Model Reasoning with Planning Tokens

Oct 09, 2023

Xinyi Wang, Lucas Caccia, Oleksiy Ostapenko, Xingdi Yuan, Alessandro Sordoni

Figure 1 for Guiding Language Model Reasoning with Planning Tokens

Figure 2 for Guiding Language Model Reasoning with Planning Tokens

Figure 3 for Guiding Language Model Reasoning with Planning Tokens

Figure 4 for Guiding Language Model Reasoning with Planning Tokens

Share this with someone who'll enjoy it:

Abstract:Large language models (LLMs) have recently attracted considerable interest for their ability to perform complex reasoning tasks, such as chain-of-thought reasoning. However, most of the existing approaches to enhance this ability rely heavily on data-driven methods, while neglecting the structural aspects of the model's reasoning capacity. We find that while LLMs can manage individual reasoning steps well, they struggle with maintaining consistency across an entire reasoning chain. To solve this, we introduce 'planning tokens' at the start of each reasoning step, serving as a guide for the model. These token embeddings are then fine-tuned along with the rest of the model parameters. Our approach requires a negligible increase in trainable parameters (just 0.001%) and can be applied through either full fine-tuning or a more parameter-efficient scheme. We demonstrate our method's effectiveness by applying it to three different LLMs, showing notable accuracy improvements across three math word problem datasets w.r.t. plain chain-of-thought fine-tuning baselines.

* 10 pages, 4 figures

View paper on

Share this with someone who'll enjoy it:

Title:Guiding Language Model Reasoning with Planning Tokens

Paper and Code