Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yuqing Zhou

Towards Robust Text Classification: Mitigating Spurious Correlations with Causal Learning

Nov 01, 2024

Yuqing Zhou, Ziwei Zhu

Figure 1 for Towards Robust Text Classification: Mitigating Spurious Correlations with Causal Learning

Figure 2 for Towards Robust Text Classification: Mitigating Spurious Correlations with Causal Learning

Figure 3 for Towards Robust Text Classification: Mitigating Spurious Correlations with Causal Learning

Figure 4 for Towards Robust Text Classification: Mitigating Spurious Correlations with Causal Learning

Abstract:In text classification tasks, models often rely on spurious correlations for predictions, incorrectly associating irrelevant features with the target labels. This issue limits the robustness and generalization of models, especially when faced with out-of-distribution data where such spurious correlations no longer hold. To address this challenge, we propose the Causally Calibrated Robust Classifier (CCR), which aims to reduce models' reliance on spurious correlations and improve model robustness. Our approach integrates a causal feature selection method based on counterfactual reasoning, along with an unbiased inverse propensity weighting (IPW) loss function. By focusing on selecting causal features, we ensure that the model relies less on spurious features during prediction. We theoretically justify our approach and empirically show that CCR achieves state-of-the-art performance among methods without group labels, and in some cases, it can compete with the models that utilize group labels.

Via

Access Paper or Ask Questions

Navigating the Shortcut Maze: A Comprehensive Analysis of Shortcut Learning in Text Classification by Language Models

Sep 26, 2024

Yuqing Zhou, Ruixiang Tang, Ziyu Yao, Ziwei Zhu

Abstract:Language models (LMs), despite their advances, often depend on spurious correlations, undermining their accuracy and generalizability. This study addresses the overlooked impact of subtler, more complex shortcuts that compromise model reliability beyond oversimplified shortcuts. We introduce a comprehensive benchmark that categorizes shortcuts into occurrence, style, and concept, aiming to explore the nuanced ways in which these shortcuts influence the performance of LMs. Through extensive experiments across traditional LMs, large language models, and state-of-the-art robust models, our research systematically investigates models' resilience and susceptibilities to sophisticated shortcuts. Our benchmark and code can be found at: https://github.com/yuqing-zhou/shortcut-learning-in-text-classification.

Via

Access Paper or Ask Questions