Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ange Richard

FRACAS: A FRench Annotated Corpus of Attribution relations in newS

Sep 19, 2023

Ange Richard, Laura Alonzo-Canul, François Portet

Figure 1 for FRACAS: A FRench Annotated Corpus of Attribution relations in newS

Figure 2 for FRACAS: A FRench Annotated Corpus of Attribution relations in newS

Figure 3 for FRACAS: A FRench Annotated Corpus of Attribution relations in newS

Figure 4 for FRACAS: A FRench Annotated Corpus of Attribution relations in newS

Abstract:Quotation extraction is a widely useful task both from a sociological and from a Natural Language Processing perspective. However, very little data is available to study this task in languages other than English. In this paper, we present a manually annotated corpus of 1676 newswire texts in French for quotation extraction and source attribution. We first describe the composition of our corpus and the choices that were made in selecting the data. We then detail the annotation guidelines and annotation process, as well as a few statistics about the final corpus and the obtained balance between quote types (direct, indirect and mixed, which are particularly challenging). We end by detailing our inter-annotator agreement between the 8 annotators who worked on manual labelling, which is substantially high for such a difficult linguistic phenomenon.

Via

Access Paper or Ask Questions

GenderedNews: Une approche computationnelle des écarts de représentation des genres dans la presse française

Mar 07, 2022

Ange Richard, Gilles Bastin, François Portet

Figure 1 for GenderedNews: Une approche computationnelle des écarts de représentation des genres dans la presse française

Figure 2 for GenderedNews: Une approche computationnelle des écarts de représentation des genres dans la presse française

Figure 3 for GenderedNews: Une approche computationnelle des écarts de représentation des genres dans la presse française

Figure 4 for GenderedNews: Une approche computationnelle des écarts de représentation des genres dans la presse française

Abstract:In this article, we present {\it GenderedNews} (\url{https://gendered-news.imag.fr}), an online dashboard which gives weekly measures of gender imbalance in French online press. We use Natural Language Processing (NLP) methods to quantify gender inequalities in the media, in the wake of global projects like the Global Media Monitoring Project. Such projects are instrumental in highlighting gender imbalance in the media and its very slow evolution. However, their generalisation is limited by their sampling and cost in terms of time, data and staff. Automation allows us to offer complementary measures to quantify inequalities in gender representation. We understand representation as the presence and distribution of men and women mentioned and quoted in the news -- as opposed to representation as stereotypification. In this paper, we first review different means adopted by previous studies on gender inequality in the media : qualitative content analysis, quantitative content analysis and computational methods. We then detail the methods adopted by {\it GenderedNews} and the two metrics implemented: the masculinity rate of mentions and the proportion of men quoted in online news. We describe the data collected daily (seven main titles of French online news media) and the methodology behind our metrics, as well as a few visualisations. We finally propose to illustrate possible analysis of our data by conducting an in-depth observation of a sample of two months of our database.

* Paper in French

Via

Access Paper or Ask Questions