Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Robust Natural Language Understanding with Residual Attention Debiasing

May 28, 2023

Fei Wang, James Y. Huang, Tianyi Yan, Wenxuan Zhou, Muhao Chen

Figure 1 for Robust Natural Language Understanding with Residual Attention Debiasing

Figure 2 for Robust Natural Language Understanding with Residual Attention Debiasing

Figure 3 for Robust Natural Language Understanding with Residual Attention Debiasing

Figure 4 for Robust Natural Language Understanding with Residual Attention Debiasing

Share this with someone who'll enjoy it:

Abstract:Natural language understanding (NLU) models often suffer from unintended dataset biases. Among bias mitigation methods, ensemble-based debiasing methods, especially product-of-experts (PoE), have stood out for their impressive empirical success. However, previous ensemble-based debiasing methods typically apply debiasing on top-level logits without directly addressing biased attention patterns. Attention serves as the main media of feature interaction and aggregation in PLMs and plays a crucial role in providing robust prediction. In this paper, we propose REsidual Attention Debiasing (READ), an end-to-end debiasing method that mitigates unintended biases from attention. Experiments on three NLU tasks show that READ significantly improves the performance of BERT-based models on OOD data with shortcuts removed, including +12.9% accuracy on HANS, +11.0% accuracy on FEVER-Symmetric, and +2.7% F1 on PAWS. Detailed analyses demonstrate the crucial role of unbiased attention in robust NLU models and that READ effectively mitigates biases in attention. Code is available at https://github.com/luka-group/READ.

* ACL 2023 Findings

View paper on

Share this with someone who'll enjoy it:

Title:Robust Natural Language Understanding with Residual Attention Debiasing

Paper and Code