Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Black-Box Tuning for Language-Model-as-a-Service

Feb 08, 2022

Tianxiang Sun, Yunfan Shao, Hong Qian, Xuanjing Huang, Xipeng Qiu

Figure 1 for Black-Box Tuning for Language-Model-as-a-Service

Figure 2 for Black-Box Tuning for Language-Model-as-a-Service

Figure 3 for Black-Box Tuning for Language-Model-as-a-Service

Figure 4 for Black-Box Tuning for Language-Model-as-a-Service

Share this with someone who'll enjoy it:

Abstract:Extremely large pre-trained language models (PTMs) such as GPT-3 are usually released as a service. It allows users to design task-specific prompts to query the PTMs through some black-box APIs. In such a scenario, which we call Language-Model-as-a-Service (LMaaS), the gradients of PTMs are usually unavailable. Can we optimize the task prompts by only accessing the model inference APIs? This paper proposes the black-box tuning framework to optimize the continuous prompt prepended to the input text via derivative-free optimization. Instead of optimizing in the original high-dimensional prompt space, which is intractable for traditional derivative-free optimization, we perform optimization in a randomly generated subspace due to the low intrinsic dimensionality of large PTMs. The experimental results show that the black-box tuning with RoBERTa on a few labeled samples not only significantly outperforms manual prompt and GPT-3's in-context learning, but also surpasses the gradient-based counterparts, i.e. prompt tuning and full model tuning.

* 14 pages. Code is available at https://github.com/txsun1997/Black-Box-Tuning

View paper on

Share this with someone who'll enjoy it:

Title:Black-Box Tuning for Language-Model-as-a-Service

Paper and Code