Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jan-David Krieger

Exploiting Transformer-based Multitask Learning for the Detection of Media Bias in News Articles

Nov 07, 2022

Timo Spinde, Jan-David Krieger, Terry Ruas, Jelena Mitrović, Franz Götz-Hahn, Akiko Aizawa, Bela Gipp

Abstract:Media has a substantial impact on the public perception of events. A one-sided or polarizing perspective on any topic is usually described as media bias. One of the ways how bias in news articles can be introduced is by altering word choice. Biased word choices are not always obvious, nor do they exhibit high context-dependency. Hence, detecting bias is often difficult. We propose a Transformer-based deep learning architecture trained via Multi-Task Learning using six bias-related data sets to tackle the media bias detection problem. Our best-performing implementation achieves a macro $F_{1}$ of 0.776, a performance boost of 3\% compared to our baseline, outperforming existing methods. Our results indicate Multi-Task Learning as a promising alternative to improve existing baseline models in identifying slanted reporting.

* Proceedings of the iConference 2022

Via

Access Paper or Ask Questions

Neural Media Bias Detection Using Distant Supervision With BABE -- Bias Annotations By Experts

Sep 29, 2022

Timo Spinde, Manuel Plank, Jan-David Krieger, Terry Ruas, Bela Gipp, Akiko Aizawa

Figure 1 for Neural Media Bias Detection Using Distant Supervision With BABE -- Bias Annotations By Experts

Figure 2 for Neural Media Bias Detection Using Distant Supervision With BABE -- Bias Annotations By Experts

Figure 3 for Neural Media Bias Detection Using Distant Supervision With BABE -- Bias Annotations By Experts

Figure 4 for Neural Media Bias Detection Using Distant Supervision With BABE -- Bias Annotations By Experts

Abstract:Media coverage has a substantial effect on the public perception of events. Nevertheless, media outlets are often biased. One way to bias news articles is by altering the word choice. The automatic identification of bias by word choice is challenging, primarily due to the lack of a gold standard data set and high context dependencies. This paper presents BABE, a robust and diverse data set created by trained experts, for media bias research. We also analyze why expert labeling is essential within this domain. Our data set offers better annotation quality and higher inter-annotator agreement than existing work. It consists of 3,700 sentences balanced among topics and outlets, containing media bias labels on the word and sentence level. Based on our data, we also introduce a way to detect bias-inducing sentences in news articles automatically. Our best performing BERT-based model is pre-trained on a larger corpus consisting of distant labels. Fine-tuning and evaluating the model on our proposed supervised data set, we achieve a macro F1-score of 0.804, outperforming existing methods.

* Findings of the Association for Computational Linguistics: EMNLP 2021
* arXiv admin note: substantial text overlap with arXiv:2112.13352

Via

Access Paper or Ask Questions

A Domain-adaptive Pre-training Approach for Language Bias Detection in News

May 22, 2022

Jan-David Krieger, Timo Spinde, Terry Ruas, Juhi Kulshrestha, Bela Gipp

Figure 1 for A Domain-adaptive Pre-training Approach for Language Bias Detection in News

Figure 2 for A Domain-adaptive Pre-training Approach for Language Bias Detection in News

Figure 3 for A Domain-adaptive Pre-training Approach for Language Bias Detection in News

Abstract:Media bias is a multi-faceted construct influencing individual behavior and collective decision-making. Slanted news reporting is the result of one-sided and polarized writing which can occur in various forms. In this work, we focus on an important form of media bias, i.e. bias by word choice. Detecting biased word choices is a challenging task due to its linguistic complexity and the lack of representative gold-standard corpora. We present DA-RoBERTa, a new state-of-the-art transformer-based model adapted to the media bias domain which identifies sentence-level bias with an F1 score of 0.814. In addition, we also train, DA-BERT and DA-BART, two more transformer models adapted to the bias domain. Our proposed domain-adapted models outperform prior bias detection approaches on the same data.

* Proceedings of the ACM/IEEE-CS Joint Conference on Digital Libraries 2022 (JCDL)

Via

Access Paper or Ask Questions