Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Large Language Model is a Good Policy Teacher for Training Reinforcement Learning Agents

Nov 29, 2023

Zihao Zhou, Bin Hu, Pu Zhang, Chenyang Zhao, Bin Liu

Figure 1 for Large Language Model is a Good Policy Teacher for Training Reinforcement Learning Agents

Figure 2 for Large Language Model is a Good Policy Teacher for Training Reinforcement Learning Agents

Figure 3 for Large Language Model is a Good Policy Teacher for Training Reinforcement Learning Agents

Figure 4 for Large Language Model is a Good Policy Teacher for Training Reinforcement Learning Agents

Share this with someone who'll enjoy it:

Abstract:Recent studies have shown that Large Language Models (LLMs) can be utilized for solving complex sequential decision-making tasks by providing high-level instructions. However, LLM-based agents face limitations in real-time dynamic environments due to their lack of specialization in solving specific target problems. Moreover, the deployment of such LLM-based agents is both costly and time-consuming in practical scenarios. In this paper, we introduce a novel framework that addresses these challenges by training a smaller scale specialized student agent using instructions from an LLM-based teacher agent. By leveraging guided actions provided by the teachers, the prior knowledge of the LLM is distilled into the local student model. Consequently, the student agent can be trained with significantly less data. Furthermore, subsequent training with environment feedback empowers the student agents to surpass the capabilities of their teachers. We conducted experiments on three challenging MiniGrid environments to evaluate the effectiveness of our framework. The results demonstrate that our approach enhances sample efficiency and achieves superior performance compared to baseline methods. Our code is available at https://github.com/ZJLAB-AMMI/LLM4Teach.

* 10 pages

View paper on

Share this with someone who'll enjoy it:

Title:Large Language Model is a Good Policy Teacher for Training Reinforcement Learning Agents

Paper and Code