Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Revisiting DocRED -- Addressing the Overlooked False Negative Problem in Relation Extraction

May 25, 2022

Qingyu Tan, Lu Xu, Lidong Bing, Hwee Tou Ng

Figure 1 for Revisiting DocRED -- Addressing the Overlooked False Negative Problem in Relation Extraction

Figure 2 for Revisiting DocRED -- Addressing the Overlooked False Negative Problem in Relation Extraction

Figure 3 for Revisiting DocRED -- Addressing the Overlooked False Negative Problem in Relation Extraction

Figure 4 for Revisiting DocRED -- Addressing the Overlooked False Negative Problem in Relation Extraction

Share this with someone who'll enjoy it:

Abstract:The DocRED dataset is one of the most popular and widely used benchmarks for document-level relation extraction (RE). It adopts a recommend-revise annotation scheme so as to have a large-scale annotated dataset. However, we find that the annotation of DocRED is incomplete, i.e., the false negative samples are prevalent. We analyze the causes and effects of the overwhelming false negative problem in the DocRED dataset. To address the shortcoming, we re-annotate 4,053 documents in the DocRED dataset by adding the missed relation triples back to the original DocRED. We name our revised DocRED dataset Re-DocRED. We conduct extensive experiments with state-of-the-art neural models on both datasets, and the experimental results show that the models trained and evaluated on our Re-DocRED achieve performance improvements of around 13 F1 points. Moreover, we propose different metrics to comprehensively evaluate the document-level RE task. We make our data publicly available at https://github.com/tonytan48/Re-DocRED.

* 15 pages, 1 figure, 17 tables

View paper on

Share this with someone who'll enjoy it:

Title:Revisiting DocRED -- Addressing the Overlooked False Negative Problem in Relation Extraction

Paper and Code