Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:WizardLM: Empowering Large Language Models to Follow Complex Instructions

Apr 24, 2023

Can Xu, Qingfeng Sun, Kai Zheng, Xiubo Geng, Pu Zhao, Jiazhan Feng, Chongyang Tao, Daxin Jiang

Figure 1 for WizardLM: Empowering Large Language Models to Follow Complex Instructions

Figure 2 for WizardLM: Empowering Large Language Models to Follow Complex Instructions

Figure 3 for WizardLM: Empowering Large Language Models to Follow Complex Instructions

Figure 4 for WizardLM: Empowering Large Language Models to Follow Complex Instructions

Share this with someone who'll enjoy it:

Abstract:Training large language models (LLM) with open-domain instruction following data brings colossal success. However, manually creating such instruction data is very time-consuming and labor-intensive. Moreover, humans may struggle to produce high-complexity instructions. In this paper, we show an avenue for creating large amounts of instruction data with varying levels of complexity using LLM instead of humans. Starting with an initial set of instructions, we use our proposed Evol-Instruct to rewrite them step by step into more complex instructions. Then, we mix all generated instruction data to fine-tune LLaMA. We call the resulting model WizardLM. Human evaluations on a complexity-balanced test bed show that instructions from Evol-Instruct are superior to human-created ones. By analyzing the human evaluation results of the high complexity part, we demonstrate that outputs from our WizardLM model are preferred to outputs from OpenAI ChatGPT. Even though WizardLM still lags behind ChatGPT in some aspects, our findings suggest that fine-tuning with AI-evolved instructions is a promising direction for enhancing large language models. Our codes and generated data are public at https://github.com/nlpxucan/WizardLM

* large language model, instruction fine-tune

View paper on

Share this with someone who'll enjoy it:

Title:WizardLM: Empowering Large Language Models to Follow Complex Instructions

Paper and Code