Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Attention-guided Generative Models for Extractive Question Answering

Oct 12, 2021

Peng Xu, Davis Liang, Zhiheng Huang, Bing Xiang

Figure 1 for Attention-guided Generative Models for Extractive Question Answering

Figure 2 for Attention-guided Generative Models for Extractive Question Answering

Figure 3 for Attention-guided Generative Models for Extractive Question Answering

Figure 4 for Attention-guided Generative Models for Extractive Question Answering

Share this with someone who'll enjoy it:

Abstract:We propose a novel method for applying Transformer models to extractive question answering (QA) tasks. Recently, pretrained generative sequence-to-sequence (seq2seq) models have achieved great success in question answering. Contributing to the success of these models are internal attention mechanisms such as cross-attention. We propose a simple strategy to obtain an extractive answer span from the generative model by leveraging the decoder cross-attention patterns. Viewing cross-attention as an architectural prior, we apply joint training to further improve QA performance. Empirical results show that on open-domain question answering datasets like NaturalQuestions and TriviaQA, our method approaches state-of-the-art performance on both generative and extractive inference, all while using much fewer parameters. Furthermore, this strategy allows us to perform hallucination-free inference while conferring significant improvements to the model's ability to rerank relevant passages.

* 10 pages

View paper on

Share this with someone who'll enjoy it:

Title:Attention-guided Generative Models for Extractive Question Answering

Paper and Code