Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Anders Johan Andreassen

Show Your Work: Scratchpads for Intermediate Computation with Language Models

Nov 30, 2021

Maxwell Nye, Anders Johan Andreassen, Guy Gur-Ari, Henryk Michalewski, Jacob Austin, David Bieber, David Dohan, Aitor Lewkowycz, Maarten Bosma, David Luan(+2 more)

Figure 1 for Show Your Work: Scratchpads for Intermediate Computation with Language Models

Figure 2 for Show Your Work: Scratchpads for Intermediate Computation with Language Models

Figure 3 for Show Your Work: Scratchpads for Intermediate Computation with Language Models

Figure 4 for Show Your Work: Scratchpads for Intermediate Computation with Language Models

Abstract:Large pre-trained language models perform remarkably well on tasks that can be done "in one pass", such as generating realistic text or synthesizing computer programs. However, they struggle with tasks that require unbounded multi-step computation, such as adding integers or executing programs. Surprisingly, we find that these same models are able to perform complex multi-step computations -- even in the few-shot regime -- when asked to perform the operation "step by step", showing the results of intermediate computations. In particular, we train transformers to perform multi-step computations by asking them to emit intermediate computation steps into a "scratchpad". On a series of increasingly complex tasks ranging from long addition to the execution of arbitrary programs, we show that scratchpads dramatically improve the ability of language models to perform multi-step computations.

Via

Access Paper or Ask Questions