Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Eva Schlinger

Open Question Answering over Tables and Text

Oct 20, 2020

Wenhu Chen, Ming-Wei Chang, Eva Schlinger, William Wang, William W. Cohen

Figure 1 for Open Question Answering over Tables and Text

Figure 2 for Open Question Answering over Tables and Text

Figure 3 for Open Question Answering over Tables and Text

Figure 4 for Open Question Answering over Tables and Text

Abstract:In open question answering (QA), the answer to a question is produced by retrieving and then analyzing documents that might contain answers to the question. Most open QA systems have considered only retrieving information from unstructured text. Here we consider for the first time open QA over both tabular and textual data and present a new large-scale dataset Open Table-Text Question Answering (OTT-QA) to evaluate performance on this task. Most questions in OTT-QA require multi-hop inference across tabular data and unstructured text, and the evidence required to answer a question can be distributed in different ways over these two types of input, making evidence retrieval challenging---our baseline model using an iterative retriever and BERT-based reader achieves an exact match score less than 10%. We then propose two novel techniques to address the challenge of retrieving and aggregating evidence for OTT-QA. The first technique is to use "early fusion" to group multiple highly relevant tabular and textual units into a fused block, which provides more context for the retriever to search for. The second technique is to use a cross-block reader to model the cross-dependency between multiple retrieved evidences with global-local sparse attention. Combining these two techniques improves the score significantly, to above 27%.

* Technical Report

Via

Access Paper or Ask Questions

How multilingual is Multilingual BERT?

Jun 04, 2019

Telmo Pires, Eva Schlinger, Dan Garrette

Figure 1 for How multilingual is Multilingual BERT?

Figure 2 for How multilingual is Multilingual BERT?

Figure 3 for How multilingual is Multilingual BERT?

Figure 4 for How multilingual is Multilingual BERT?

Abstract:In this paper, we show that Multilingual BERT (M-BERT), released by Devlin et al. (2018) as a single language model pre-trained from monolingual corpora in 104 languages, is surprisingly good at zero-shot cross-lingual model transfer, in which task-specific annotations in one language are used to fine-tune the model for evaluation in another language. To understand why, we present a large number of probing experiments, showing that transfer is possible even to languages in different scripts, that transfer works best between typologically similar languages, that monolingual corpora can train models for code-switching, and that the model can find translation pairs. From these results, we can conclude that M-BERT does create multilingual representations, but that these representations exhibit systematic deficiencies affecting certain language pairs.

Via

Access Paper or Ask Questions