Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:RoR: Read-over-Read for Long Document Machine Reading Comprehension

Sep 14, 2021

Jing Zhao, Junwei Bao, Yifan Wang, Yongwei Zhou, Youzheng Wu, Xiaodong He, Bowen Zhou

Figure 1 for RoR: Read-over-Read for Long Document Machine Reading Comprehension

Figure 2 for RoR: Read-over-Read for Long Document Machine Reading Comprehension

Figure 3 for RoR: Read-over-Read for Long Document Machine Reading Comprehension

Figure 4 for RoR: Read-over-Read for Long Document Machine Reading Comprehension

Share this with someone who'll enjoy it:

Abstract:Transformer-based pre-trained models, such as BERT, have achieved remarkable results on machine reading comprehension. However, due to the constraint of encoding length (e.g., 512 WordPiece tokens), a long document is usually split into multiple chunks that are independently read. It results in the reading field being limited to individual chunks without information collaboration for long document machine reading comprehension. To address this problem, we propose RoR, a read-over-read method, which expands the reading field from chunk to document. Specifically, RoR includes a chunk reader and a document reader. The former first predicts a set of regional answers for each chunk, which are then compacted into a highly-condensed version of the original document, guaranteeing to be encoded once. The latter further predicts the global answers from this condensed document. Eventually, a voting strategy is utilized to aggregate and rerank the regional and global answers for final prediction. Extensive experiments on two benchmarks QuAC and TriviaQA demonstrate the effectiveness of RoR for long document reading. Notably, RoR ranks 1st place on the QuAC leaderboard (https://quac.ai/) at the time of submission (May 17th, 2021).

* Accepted as findings of EMNLP2021

View paper on

Share this with someone who'll enjoy it:

Title:RoR: Read-over-Read for Long Document Machine Reading Comprehension

Paper and Code