Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions

Jul 02, 2024

Xiang Li, Haoran Tang, Siyu Chen, Ziwei Wang, Ryan Chen, Marcin Abram

Figure 1 for Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions

Figure 2 for Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions

Figure 3 for Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions

Figure 4 for Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions

Share this with someone who'll enjoy it:

Abstract:We measure the performance of in-context learning as a function of task novelty and difficulty for open and closed questions. For that purpose, we created a novel benchmark consisting of hard scientific questions, each paired with a context of various relevancy. We show that counter-intuitively, a context that is more aligned with the topic does not always help more than a less relevant context. This effect is especially visible for open questions and questions of high difficulty or novelty. This result reveals a fundamental difference between the treatment of close-form and open-form questions by large-language models and shows a need for a more robust evaluation of in-context learning on the variety of different types of questions. It also poses a new question of how to optimally select a context for large language models, especially in the context of Retrieval Augmented Generation (RAG) systems. Our results suggest that the answer to this question can be highly application-dependent and might be contingent on factors including the format of the question, the perceived difficulty level of the questions, and the novelty or popularity of the information we seek.

* 8 pages plus references, 4 main figures, 6 pages of supplementary material

View paper on

Share this with someone who'll enjoy it:

Title:Why does in-context learning fail sometimes? Evaluating in-context learning on open and closed questions

Paper and Code