Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xiangyang Mou

A Survey of Machine Narrative Reading Comprehension Assessments

Apr 30, 2022

Yisi Sang, Xiangyang Mou, Jing Li, Jeffrey Stanton, Mo Yu

Figure 1 for A Survey of Machine Narrative Reading Comprehension Assessments

Figure 2 for A Survey of Machine Narrative Reading Comprehension Assessments

Abstract:As the body of research on machine narrative comprehension grows, there is a critical need for consideration of performance assessment strategies as well as the depth and scope of different benchmark tasks. Based on narrative theories, reading comprehension theories, as well as existing machine narrative reading comprehension tasks and datasets, we propose a typology that captures the main similarities and differences among assessment tasks; and discuss the implications of our typology for new task design and the challenges of narrative reading comprehension.

* accepted for the IJCAI-ECAI2022 Survey Track

Via

Access Paper or Ask Questions

TVShowGuess: Character Comprehension in Stories as Speaker Guessing

Apr 16, 2022

Yisi Sang, Xiangyang Mou, Mo Yu, Shunyu Yao, Jing Li, Jeffrey Stanton

Figure 1 for TVShowGuess: Character Comprehension in Stories as Speaker Guessing

Figure 2 for TVShowGuess: Character Comprehension in Stories as Speaker Guessing

Figure 3 for TVShowGuess: Character Comprehension in Stories as Speaker Guessing

Figure 4 for TVShowGuess: Character Comprehension in Stories as Speaker Guessing

Abstract:We propose a new task for assessing machines' skills of understanding fictional characters in narrative stories. The task, TVShowGuess, builds on the scripts of TV series and takes the form of guessing the anonymous main characters based on the backgrounds of the scenes and the dialogues. Our human study supports that this form of task covers comprehension of multiple types of character persona, including understanding characters' personalities, facts and memories of personal experience, which are well aligned with the psychological and literary theories about the theory of mind (ToM) of human beings on understanding fictional characters during reading. We further propose new model architectures to support the contextualized encoding of long scene texts. Experiments show that our proposed approaches significantly outperform baselines, yet still largely lag behind the (nearly perfect) human performance. Our work serves as a first step toward the goal of narrative character comprehension.

* Accepted at NAACL 2022

Via

Access Paper or Ask Questions

Efficient Long Sequence Encoding via Synchronization

Mar 15, 2022

Xiangyang Mou, Mo Yu, Bingsheng Yao, Lifu Huang

Figure 1 for Efficient Long Sequence Encoding via Synchronization

Figure 2 for Efficient Long Sequence Encoding via Synchronization

Figure 3 for Efficient Long Sequence Encoding via Synchronization

Abstract:Pre-trained Transformer models have achieved successes in a wide range of NLP tasks, but are inefficient when dealing with long input sequences. Existing studies try to overcome this challenge via segmenting the long sequence followed by hierarchical encoding or post-hoc aggregation. We propose a synchronization mechanism for hierarchical encoding. Our approach first identifies anchor tokens across segments and groups them by their roles in the original input sequence. Then inside Transformer layer, anchor embeddings are synchronized within their group via a self-attention module. Our approach is a general framework with sufficient flexibility -- when adapted to a new task, it is easy to be enhanced with the task-specific anchor definitions. Experiments on two representative tasks with different types of long input texts, NarrativeQA summary setting and wild multi-hop reasoning from HotpotQA, demonstrate that our approach is able to improve the global information exchange among segments while maintaining efficiency.

* 5 pages, short paper

Via

Access Paper or Ask Questions

Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study

Jun 07, 2021

Xiangyang Mou, Chenghao Yang, Mo Yu, Bingsheng Yao, Xiaoxiao Guo, Saloni Potdar, Hui Su

Figure 1 for Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study

Figure 2 for Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study

Figure 3 for Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study

Figure 4 for Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study

Abstract:Recent advancements in open-domain question answering (ODQA), i.e., finding answers from large open-domain corpus like Wikipedia, have led to human-level performance on many datasets. However, progress in QA over book stories (Book QA) lags behind despite its similar task formulation to ODQA. This work provides a comprehensive and quantitative analysis about the difficulty of Book QA: (1) We benchmark the research on the NarrativeQA dataset with extensive experiments with cutting-edge ODQA techniques. This quantifies the challenges Book QA poses, as well as advances the published state-of-the-art with a $\sim$7\% absolute improvement on Rouge-L. (2) We further analyze the detailed challenges in Book QA through human studies.\footnote{\url{https://github.com/gorov/BookQA}.} Our findings indicate that the event-centric questions dominate this task, which exemplifies the inability of existing QA models to handle event-oriented scenarios.

* Accepted to TACL

Via

Access Paper or Ask Questions

Complementary Evidence Identification in Open-Domain Question Answering

Apr 05, 2021

Xiangyang Mou, Mo Yu, Shiyu Chang, Yufei Feng, Li Zhang, Hui Su

Figure 1 for Complementary Evidence Identification in Open-Domain Question Answering

Figure 2 for Complementary Evidence Identification in Open-Domain Question Answering

Figure 3 for Complementary Evidence Identification in Open-Domain Question Answering

Abstract:This paper proposes a new problem of complementary evidence identification for open-domain question answering (QA). The problem aims to efficiently find a small set of passages that covers full evidence from multiple aspects as to answer a complex question. To this end, we proposes a method that learns vector representations of passages and models the sufficiency and diversity within the selected set, in addition to the relevance between the question and passages. Our experiments demonstrate that our method considers the dependence within the supporting evidence and significantly improves the accuracy of complementary evidence selection in QA domain.

* 7 pages, EACL 2021

Via

Access Paper or Ask Questions

Multimodal Dialogue State Tracking By QA Approach with Data Augmentation

Jul 20, 2020

Xiangyang Mou, Brandyn Sigouin, Ian Steenstra, Hui Su

Figure 1 for Multimodal Dialogue State Tracking By QA Approach with Data Augmentation

Figure 2 for Multimodal Dialogue State Tracking By QA Approach with Data Augmentation

Figure 3 for Multimodal Dialogue State Tracking By QA Approach with Data Augmentation

Figure 4 for Multimodal Dialogue State Tracking By QA Approach with Data Augmentation

Abstract:Recently, a more challenging state tracking task, Audio-Video Scene-Aware Dialogue (AVSD), is catching an increasing amount of attention among researchers. Different from purely text-based dialogue state tracking, the dialogue in AVSD contains a sequence of question-answer pairs about a video and the final answer to the given question requires additional understanding of the video. This paper interprets the AVSD task from an open-domain Question Answering (QA) point of view and proposes a multimodal open-domain QA system to deal with the problem. The proposed QA system uses common encoder-decoder framework with multimodal fusion and attention. Teacher forcing is applied to train a natural language generator. We also propose a new data augmentation approach specifically under QA assumption. Our experiments show that our model and techniques bring significant improvements over the baseline model on the DSTC7-AVSD dataset and demonstrate the potentials of our data augmentation techniques.

* AAAI DSTC8 Workshop

Via

Access Paper or Ask Questions

Frustratingly Hard Evidence Retrieval for QA Over Books

Jul 20, 2020

Xiangyang Mou, Mo Yu, Bingsheng Yao, Chenghao Yang, Xiaoxiao Guo, Saloni Potdar, Hui Su

Figure 1 for Frustratingly Hard Evidence Retrieval for QA Over Books

Figure 2 for Frustratingly Hard Evidence Retrieval for QA Over Books

Figure 3 for Frustratingly Hard Evidence Retrieval for QA Over Books

Figure 4 for Frustratingly Hard Evidence Retrieval for QA Over Books

Abstract:A lot of progress has been made to improve question answering (QA) in recent years, but the special problem of QA over narrative book stories has not been explored in-depth. We formulate BookQA as an open-domain QA task given its similar dependency on evidence retrieval. We further investigate how state-of-the-art open-domain QA approaches can help BookQA. Besides achieving state-of-the-art on the NarrativeQA benchmark, our study also reveals the difficulty of evidence retrieval in books with a wealth of experiments and analysis - which necessitates future effort on novel solutions for evidence retrieval in BookQA.

* ACL 2020 NUSE Workshop, 6 pages

Via

Access Paper or Ask Questions