Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:ControlLLM: Augment Language Models with Tools by Searching on Graphs

Oct 30, 2023

Zhaoyang Liu, Zeqiang Lai, Zhangwei Gao, Erfei Cui, Zhiheng Li, Xizhou Zhu, Lewei Lu, Qifeng Chen, Yu Qiao, Jifeng Dai(+1 more)

Figure 1 for ControlLLM: Augment Language Models with Tools by Searching on Graphs

Figure 2 for ControlLLM: Augment Language Models with Tools by Searching on Graphs

Figure 3 for ControlLLM: Augment Language Models with Tools by Searching on Graphs

Figure 4 for ControlLLM: Augment Language Models with Tools by Searching on Graphs

Share this with someone who'll enjoy it:

Abstract:We present ControlLLM, a novel framework that enables large language models (LLMs) to utilize multi-modal tools for solving complex real-world tasks. Despite the remarkable performance of LLMs, they still struggle with tool invocation due to ambiguous user prompts, inaccurate tool selection and parameterization, and inefficient tool scheduling. To overcome these challenges, our framework comprises three key components: (1) a \textit{task decomposer} that breaks down a complex task into clear subtasks with well-defined inputs and outputs; (2) a \textit{Thoughts-on-Graph (ToG) paradigm} that searches the optimal solution path on a pre-built tool graph, which specifies the parameter and dependency relations among different tools; and (3) an \textit{execution engine with a rich toolbox} that interprets the solution path and runs the tools efficiently on different computational devices. We evaluate our framework on diverse tasks involving image, audio, and video processing, demonstrating its superior accuracy, efficiency, and versatility compared to existing methods. The code is at https://github.com/OpenGVLab/ControlLLM .

* 22 pages, 9 figures, 10 tables

View paper on

Share this with someone who'll enjoy it:

Title:ControlLLM: Augment Language Models with Tools by Searching on Graphs

Paper and Code