Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration

Sep 15, 2023

Rachneet Sachdeva, Martin Tutek, Iryna Gurevych

Figure 1 for CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration

Figure 2 for CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration

Figure 3 for CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration

Figure 4 for CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration

Share this with someone who'll enjoy it:

Abstract:In recent years, large language models (LLMs) have shown remarkable capabilities at scale, particularly at generating text conditioned on a prompt. In our work, we investigate the use of LLMs to augment training data of small language models~(SLMs) with automatically generated counterfactual~(CF) instances -- i.e. minimally altered inputs -- in order to improve out-of-domain~(OOD) performance of SLMs in the extractive question answering~(QA) setup. We show that, across various LLM generators, such data augmentation consistently enhances OOD performance and improves model calibration for both confidence-based and rationale-augmented calibrator models. Furthermore, these performance improvements correlate with higher diversity of CF instances in terms of their surface form and semantic content. Finally, we show that CF augmented models which are easier to calibrate also exhibit much lower entropy when assigning importance, indicating that rationale-augmented calibrators prefer concise explanations.

* We make our code available at: https://github.com/UKPLab/CATfOOD

View paper on

Share this with someone who'll enjoy it:

Title:CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration

Paper and Code