Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Understanding Attention in Machine Reading Comprehension

Aug 26, 2021

Yiming Cui, Wei-Nan Zhang, Wanxiang Che, Ting Liu, Zhigang Chen

Figure 1 for Understanding Attention in Machine Reading Comprehension

Figure 2 for Understanding Attention in Machine Reading Comprehension

Figure 3 for Understanding Attention in Machine Reading Comprehension

Figure 4 for Understanding Attention in Machine Reading Comprehension

Share this with someone who'll enjoy it:

Abstract:Achieving human-level performance on some of Machine Reading Comprehension (MRC) datasets is no longer challenging with the help of powerful Pre-trained Language Models (PLMs). However, the internal mechanism of these artifacts still remains unclear, placing an obstacle for further understanding these models. This paper focuses on conducting a series of analytical experiments to examine the relations between the multi-head self-attention and the final performance, trying to analyze the potential explainability in PLM-based MRC models. We perform quantitative analyses on SQuAD (English) and CMRC 2018 (Chinese), two span-extraction MRC datasets, on top of BERT, ALBERT, and ELECTRA in various aspects. We discover that {\em passage-to-question} and {\em passage understanding} attentions are the most important ones, showing strong correlations to the final performance than other parts. Through visualizations and case studies, we also observe several general findings on the attention maps, which could be helpful to understand how these models solve the questions.

* 11 pages

View paper on

Share this with someone who'll enjoy it:

Title:Understanding Attention in Machine Reading Comprehension

Paper and Code