Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:In-Context Learning in Large Language Models Learns Label Relationships but Is Not Conventional Learning

Aug 07, 2023

Jannik Kossen, Tom Rainforth, Yarin Gal

Figure 1 for In-Context Learning in Large Language Models Learns Label Relationships but Is Not Conventional Learning

Figure 2 for In-Context Learning in Large Language Models Learns Label Relationships but Is Not Conventional Learning

Figure 3 for In-Context Learning in Large Language Models Learns Label Relationships but Is Not Conventional Learning

Figure 4 for In-Context Learning in Large Language Models Learns Label Relationships but Is Not Conventional Learning

Share this with someone who'll enjoy it:

Abstract:The performance of Large Language Models (LLMs) on downstream tasks often improves significantly when including examples of the input-label relationship in the context. However, there is currently no consensus about how this in-context learning (ICL) ability of LLMs works: for example, while Xie et al. (2021) liken ICL to a general-purpose learning algorithm, Min et al. (2022b) argue ICL does not even learn label relationships from in-context examples. In this paper, we study (1) how labels of in-context examples affect predictions, (2) how label relationships learned during pre-training interact with input-label examples provided in-context, and (3) how ICL aggregates label information across in-context examples. Our findings suggests LLMs usually incorporate information from in-context labels, but that pre-training and in-context label relationships are treated differently, and that the model does not consider all in-context information equally. Our results give insights into understanding and aligning LLM behavior.

View paper on

Share this with someone who'll enjoy it:

Title:In-Context Learning in Large Language Models Learns Label Relationships but Is Not Conventional Learning

Paper and Code