Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hillary Dawkins

WMT24 Test Suite: Gender Resolution in Speaker-Listener Dialogue Roles

Nov 09, 2024

Hillary Dawkins, Isar Nejadgholi, Chi-kiu Lo

Abstract:We assess the difficulty of gender resolution in literary-style dialogue settings and the influence of gender stereotypes. Instances of the test suite contain spoken dialogue interleaved with external meta-context about the characters and the manner of speaking. We find that character and manner stereotypes outside of the dialogue significantly impact the gender agreement of referents within the dialogue.

Via

Access Paper or Ask Questions

Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods

Jun 21, 2024

Kathleen C. Fraser, Hillary Dawkins, Svetlana Kiritchenko

Figure 1 for Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods

Figure 2 for Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods

Figure 3 for Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods

Figure 4 for Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods

Abstract:Large language models (LLMs) have advanced to a point that even humans have difficulty discerning whether a text was generated by another human, or by a computer. However, knowing whether a text was produced by human or artificial intelligence (AI) is important to determining its trustworthiness, and has applications in many domains including detecting fraud and academic dishonesty, as well as combating the spread of misinformation and political propaganda. The task of AI-generated text (AIGT) detection is therefore both very challenging, and highly critical. In this survey, we summarize state-of-the art approaches to AIGT detection, including watermarking, statistical and stylistic analysis, and machine learning classification. We also provide information about existing datasets for this task. Synthesizing the research findings, we aim to provide insight into the salient factors that combine to determine how "detectable" AIGT text is under different scenarios, and to make practical recommendations for future work towards this significant technical and societal challenge.

Via

Access Paper or Ask Questions

Projective Methods for Mitigating Gender Bias in Pre-trained Language Models

Mar 27, 2024

Hillary Dawkins, Isar Nejadgholi, Daniel Gillis, Judi McCuaig

Abstract:Mitigation of gender bias in NLP has a long history tied to debiasing static word embeddings. More recently, attention has shifted to debiasing pre-trained language models. We study to what extent the simplest projective debiasing methods, developed for word embeddings, can help when applied to BERT's internal representations. Projective methods are fast to implement, use a small number of saved parameters, and make no updates to the existing model parameters. We evaluate the efficacy of the methods in reducing both intrinsic bias, as measured by BERT's next sentence prediction task, and in mitigating observed bias in a downstream setting when fine-tuned. To this end, we also provide a critical analysis of a popular gender-bias assessment test for quantifying intrinsic bias, resulting in an enhanced test set and new bias measures. We find that projective methods can be effective at both intrinsic bias and downstream bias mitigation, but that the two outcomes are not necessarily correlated. This finding serves as a warning that intrinsic bias test sets, based either on language modeling tasks or next sentence prediction, should not be the only benchmark in developing a debiased language model.

Via

Access Paper or Ask Questions

Second Order WinoBias Test Set for Latent Gender Bias Detection in Coreference Resolution

Sep 28, 2021

Hillary Dawkins

Figure 1 for Second Order WinoBias Test Set for Latent Gender Bias Detection in Coreference Resolution

Figure 2 for Second Order WinoBias Test Set for Latent Gender Bias Detection in Coreference Resolution

Figure 3 for Second Order WinoBias Test Set for Latent Gender Bias Detection in Coreference Resolution

Figure 4 for Second Order WinoBias Test Set for Latent Gender Bias Detection in Coreference Resolution

Abstract:We observe an instance of gender-induced bias in a downstream application, despite the absence of explicit gender words in the test cases. We provide a test set, SoWinoBias, for the purpose of measuring such latent gender bias in coreference resolution systems. We evaluate the performance of current debiasing methods on the SoWinoBias test set, especially in reference to the method's design and altered embedding space properties. See https://github.com/hillarydawkins/SoWinoBias.

* Proceedings of the 3rd Workshop on Gender Bias in Natural Language Processing, 103-111 (2021)
* 7+2 pages. Published in the proceedings of the 3rd Workshop on Gender Bias in Natural Language Processing (co-located with ACL-IJCNLP 2021)

Via

Access Paper or Ask Questions

Marked Attribute Bias in Natural Language Inference

Sep 28, 2021

Hillary Dawkins

Figure 1 for Marked Attribute Bias in Natural Language Inference

Figure 2 for Marked Attribute Bias in Natural Language Inference

Figure 3 for Marked Attribute Bias in Natural Language Inference

Figure 4 for Marked Attribute Bias in Natural Language Inference

Abstract:Reporting and providing test sets for harmful bias in NLP applications is essential for building a robust understanding of the current problem. We present a new observation of gender bias in a downstream NLP application: marked attribute bias in natural language inference. Bias in downstream applications can stem from training data, word embeddings, or be amplified by the model in use. However, focusing on biased word embeddings is potentially the most impactful first step due to their universal nature. Here we seek to understand how the intrinsic properties of word embeddings contribute to this observed marked attribute effect, and whether current post-processing methods address the bias successfully. An investigation of the current debiasing landscape reveals two open problems: none of the current debiased embeddings mitigate the marked attribute error, and none of the intrinsic bias measures are predictive of the marked attribute effect. By noticing that a new type of intrinsic bias measure correlates meaningfully with the marked attribute effect, we propose a new postprocessing debiasing scheme for static word embeddings. The proposed method applied to existing embeddings achieves new best results on the marked attribute bias test set. See https://github.com/hillary-dawkins/MAB.

* Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 4214-4226 (2021)
* 9+4 pages. Published in Findings of the ACL (ACL-IJCNLP 2021)

Via

Access Paper or Ask Questions