Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents

Feb 18, 2024

Renxi Wang, Haonan Li, Xudong Han, Yixuan Zhang, Timothy Baldwin

Figure 1 for Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents

Figure 2 for Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents

Figure 3 for Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents

Figure 4 for Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents

Share this with someone who'll enjoy it:

Abstract:Large language models (LLMs) have achieved success in acting as agents, which interact with environments through tools like search engines. However, LLMs are not optimized specifically for tool use during training or alignment, limiting their effectiveness as agents. To resolve this problem, previous work has collected interaction trajectories between GPT-4 and environments, and fine-tuned smaller models with them. As part of this, the standard approach has been to simply discard trajectories that do not finish the task successfully, which, on the one hand, leads to a significant waste of data and resources, and on the other hand, has the potential to limit the possible optimization paths during fine-tuning. In this paper, we contend that large language models can learn from failures through appropriate data cleaning and fine-tuning strategies. We conduct experiments on mathematical reasoning, multi-hop question answering, and strategic question answering tasks. Experimental results demonstrate that compared to solely using positive examples, incorporating negative examples enhances model performance by a large margin.

* Agent, LLM, Large Language Model

View paper on

Share this with someone who'll enjoy it:

Title:Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents

Paper and Code