Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Isaac Cowhey

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

Mar 14, 2018

Peter Clark, Isaac Cowhey, Oren Etzioni, Tushar Khot, Ashish Sabharwal, Carissa Schoenick, Oyvind Tafjord

Figure 1 for Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

Figure 2 for Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

Figure 3 for Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

Figure 4 for Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

Abstract:We present a new question set, text corpus, and baselines assembled to encourage AI research in advanced question answering. Together, these constitute the AI2 Reasoning Challenge (ARC), which requires far more powerful knowledge and reasoning than previous challenges such as SQuAD or SNLI. The ARC question set is partitioned into a Challenge Set and an Easy Set, where the Challenge Set contains only questions answered incorrectly by both a retrieval-based algorithm and a word co-occurence algorithm. The dataset contains only natural, grade-school science questions (authored for human tests), and is the largest public-domain set of this kind (7,787 questions). We test several baselines on the Challenge Set, including leading neural models from the SQuAD and SNLI tasks, and find that none are able to significantly outperform a random baseline, reflecting the difficult nature of this task. We are also releasing the ARC Corpus, a corpus of 14M science sentences relevant to the task, and implementations of the three neural baseline models tested. Can your model perform better? We pose ARC as a challenge to the community.

* 10 pages, 7 tables, 2 figures

Via

Access Paper or Ask Questions