Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Daniela Stepanov

Crowdsourcing a High-Quality Gold Standard for QA-SRL

Nov 08, 2019

Paul Roit, Ayal Klein, Daniela Stepanov, Jonathan Mamou, Julian Michael, Gabriel Stanovsky, Luke Zettlemoyer, Ido Dagan

Figure 1 for Crowdsourcing a High-Quality Gold Standard for QA-SRL

Figure 2 for Crowdsourcing a High-Quality Gold Standard for QA-SRL

Figure 3 for Crowdsourcing a High-Quality Gold Standard for QA-SRL

Figure 4 for Crowdsourcing a High-Quality Gold Standard for QA-SRL

Abstract:Question-answer driven Semantic Role Labeling (QA-SRL) has been proposed as an attractive open and natural form of SRL, easily crowdsourceable for new corpora. Recently, a large-scale QA-SRL corpus and a trained parser were released, accompanied by a densely annotated dataset for evaluation. Trying to replicate the QA-SRL annotation and evaluation scheme for new texts, we observed that the resulting annotations were lacking in quality and coverage, particularly insufficient for creating gold standards for evaluation. In this paper, we present an improved QA-SRL annotation protocol, involving crowd-worker selection and training, followed by data consolidation. Applying this process, we release a new gold evaluation dataset for QA-SRL, yielding more consistent annotations and greater coverage. We believe that our new annotation protocol and gold standard will facilitate future replicable research of natural semantic annotations.

Via

Access Paper or Ask Questions

HappyDB: A Corpus of 100,000 Crowdsourced Happy Moments

Jan 25, 2018

Akari Asai, Sara Evensen, Behzad Golshan, Alon Halevy, Vivian Li, Andrei Lopatenko, Daniela Stepanov, Yoshihiko Suhara, Wang-Chiew Tan, Yinzhan Xu

Figure 1 for HappyDB: A Corpus of 100,000 Crowdsourced Happy Moments

Figure 2 for HappyDB: A Corpus of 100,000 Crowdsourced Happy Moments

Figure 3 for HappyDB: A Corpus of 100,000 Crowdsourced Happy Moments

Figure 4 for HappyDB: A Corpus of 100,000 Crowdsourced Happy Moments

Abstract:The science of happiness is an area of positive psychology concerned with understanding what behaviors make people happy in a sustainable fashion. Recently, there has been interest in developing technologies that help incorporate the findings of the science of happiness into users' daily lives by steering them towards behaviors that increase happiness. With the goal of building technology that can understand how people express their happy moments in text, we crowd-sourced HappyDB, a corpus of 100,000 happy moments that we make publicly available. This paper describes HappyDB and its properties, and outlines several important NLP problems that can be studied with the help of the corpus. We also apply several state-of-the-art analysis techniques to analyze HappyDB. Our results demonstrate the need for deeper NLP techniques to be developed which makes HappyDB an exciting resource for follow-on research.

* Typos fixed

Via

Access Paper or Ask Questions