Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sheng-Syun Shen

Towards Machine Comprehension of Spoken Content: Initial TOEFL Listening Comprehension Test by Machine

Aug 23, 2016

Bo-Hsiang Tseng, Sheng-Syun Shen, Hung-Yi Lee, Lin-Shan Lee

Figure 1 for Towards Machine Comprehension of Spoken Content: Initial TOEFL Listening Comprehension Test by Machine

Figure 2 for Towards Machine Comprehension of Spoken Content: Initial TOEFL Listening Comprehension Test by Machine

Figure 3 for Towards Machine Comprehension of Spoken Content: Initial TOEFL Listening Comprehension Test by Machine

Figure 4 for Towards Machine Comprehension of Spoken Content: Initial TOEFL Listening Comprehension Test by Machine

Abstract:Multimedia or spoken content presents more attractive information than plain text content, but it's more difficult to display on a screen and be selected by a user. As a result, accessing large collections of the former is much more difficult and time-consuming than the latter for humans. It's highly attractive to develop a machine which can automatically understand spoken content and summarize the key information for humans to browse over. In this endeavor, we propose a new task of machine comprehension of spoken content. We define the initial goal as the listening comprehension test of TOEFL, a challenging academic English examination for English learners whose native language is not English. We further propose an Attention-based Multi-hop Recurrent Neural Network (AMRNN) architecture for this task, achieving encouraging results in the initial tests. Initial results also have shown that word-level attention is probably more robust than sentence-level attention for this task with ASR errors.

* Accepted conference paper: "The Annual Conference of the International Speech Communication Association (Interspeech), 2016"

Via

Access Paper or Ask Questions