Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Smriti Singh

From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings

Feb 20, 2024

Aishik Rakshit, Smriti Singh, Shuvam Keshari, Arijit Ghosh Chowdhury, Vinija Jain, Aman Chadha

Figure 1 for From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings

Figure 2 for From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings

Figure 3 for From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings

Figure 4 for From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings

Abstract:Embeddings play a pivotal role in the efficacy of Large Language Models. They are the bedrock on which these models grasp contextual relationships and foster a more nuanced understanding of language and consequently perform remarkably on a plethora of complex tasks that require a fundamental understanding of human language. Given that these embeddings themselves often reflect or exhibit bias, it stands to reason that these models may also inadvertently learn this bias. In this work, we build on the seminal previous work and propose DeepSoftDebias, an algorithm that uses a neural network to perform 'soft debiasing'. We exhaustively evaluate this algorithm across a variety of SOTA datasets, accuracy metrics, and challenging NLP tasks. We find that DeepSoftDebias outperforms the current state-of-the-art methods at reducing bias across gender, race, and religion.

Via

Access Paper or Ask Questions

Language Models (Mostly) Do Not Consider Emotion Triggers When Predicting Emotion

Nov 16, 2023

Smriti Singh, Cornelia Caragea, Junyi Jessy Li

Abstract:Situations and events evoke emotions in humans, but to what extent do they inform the prediction of emotion detection models? Prior work in emotion trigger or cause identification focused on training models to recognize events that trigger an emotion. Instead, this work investigates how well human-annotated emotion triggers correlate with features that models deemed salient in their prediction of emotions. First, we introduce a novel dataset EmoTrigger, consisting of 900 social media posts sourced from three different datasets; these were annotated by experts for emotion triggers with high agreement. Using EmoTrigger, we evaluate the ability of large language models (LLMs) to identify emotion triggers, and conduct a comparative analysis of the features considered important for these tasks between LLMs and fine-tuned models. Our analysis reveals that emotion triggers are largely not considered salient features for emotion prediction models, instead there is intricate interplay between various features and the task of emotion detection.

Via

Access Paper or Ask Questions