Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Is In-Context Learning Sufficient for Instruction Following in LLMs?

May 30, 2024

Hao Zhao, Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion

Figure 1 for Is In-Context Learning Sufficient for Instruction Following in LLMs?

Figure 2 for Is In-Context Learning Sufficient for Instruction Following in LLMs?

Figure 3 for Is In-Context Learning Sufficient for Instruction Following in LLMs?

Figure 4 for Is In-Context Learning Sufficient for Instruction Following in LLMs?

Share this with someone who'll enjoy it:

Abstract:In-context learning (ICL) allows LLMs to learn from examples without changing their weights, which is a particularly promising capability for long-context LLMs that can potentially learn from many examples. Recently, Lin et al. (2024) proposed URIAL, a method using only three in-context examples to align base LLMs, achieving non-trivial instruction following performance. In this work, we show that, while effective, ICL alignment with URIAL still underperforms compared to instruction fine-tuning on established benchmarks such as MT-Bench and AlpacaEval 2.0 (LC), especially with more capable base LMs. Unlike for tasks such as classification, translation, or summarization, adding more ICL demonstrations for long-context LLMs does not systematically improve instruction following performance. To address this limitation, we derive a greedy selection approach for ICL examples that noticeably improves performance, yet without bridging the gap to instruction fine-tuning. Finally, we provide a series of ablation studies to better understand the reasons behind the remaining gap, and we show how some aspects of ICL depart from the existing knowledge and are specific to the instruction tuning setting. Overall, our work advances the understanding of ICL as an alignment technique. We provide our code at https://github.com/tml-epfl/icl-alignment.

* Preprint. Code at https://github.com/tml-epfl/icl-alignment

View paper on

Share this with someone who'll enjoy it:

Title:Is In-Context Learning Sufficient for Instruction Following in LLMs?

Paper and Code