Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shimi Salant

Contextualized Word Representations for Reading Comprehension

Sep 04, 2018

Shimi Salant, Jonathan Berant

Figure 1 for Contextualized Word Representations for Reading Comprehension

Figure 2 for Contextualized Word Representations for Reading Comprehension

Figure 3 for Contextualized Word Representations for Reading Comprehension

Figure 4 for Contextualized Word Representations for Reading Comprehension

Abstract:Reading a document and extracting an answer to a question about its content has attracted substantial attention recently. While most work has focused on the interaction between the question and the document, in this work we evaluate the importance of context when the question and document are processed independently. We take a standard neural architecture for this task, and show that by providing rich contextualized word representations from a large pre-trained language model as well as allowing the model to choose between context-dependent and context-independent word representations, we can obtain dramatic improvements and reach performance comparable to state-of-the-art on the competitive SQuAD dataset.

* 6 pages, 1 figure, NAACL 2018

Via

Access Paper or Ask Questions

Learning Recurrent Span Representations for Extractive Question Answering

Mar 17, 2017

Kenton Lee, Shimi Salant, Tom Kwiatkowski, Ankur Parikh, Dipanjan Das, Jonathan Berant

Figure 1 for Learning Recurrent Span Representations for Extractive Question Answering

Figure 2 for Learning Recurrent Span Representations for Extractive Question Answering

Figure 3 for Learning Recurrent Span Representations for Extractive Question Answering

Figure 4 for Learning Recurrent Span Representations for Extractive Question Answering

Abstract:The reading comprehension task, that asks questions about a given evidence document, is a central problem in natural language understanding. Recent formulations of this task have typically focused on answer selection from a set of candidates pre-defined manually or through the use of an external NLP pipeline. However, Rajpurkar et al. (2016) recently released the SQuAD dataset in which the answers can be arbitrary strings from the supplied text. In this paper, we focus on this answer extraction task, presenting a novel model architecture that efficiently builds fixed length representations of all spans in the evidence document with a recurrent network. We show that scoring explicit span representations significantly improves performance over other approaches that factor the prediction into separate predictions about words or start and end markers. Our approach improves upon the best published results of Wang & Jiang (2016) by 5% and decreases the error of Rajpurkar et al.'s baseline by > 50%.

Via

Access Paper or Ask Questions