Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Timothy Niven

Probing Neural Network Comprehension of Natural Language Arguments

Jul 17, 2019

Timothy Niven, Hung-Yu Kao

Figure 1 for Probing Neural Network Comprehension of Natural Language Arguments

Figure 2 for Probing Neural Network Comprehension of Natural Language Arguments

Figure 3 for Probing Neural Network Comprehension of Natural Language Arguments

Figure 4 for Probing Neural Network Comprehension of Natural Language Arguments

Abstract:We are surprised to find that BERT's peak performance of 77% on the Argument Reasoning Comprehension Task reaches just three points below the average untrained human baseline. However, we show that this result is entirely accounted for by exploitation of spurious statistical cues in the dataset. We analyze the nature of these cues and demonstrate that a range of models all exploit them. This analysis informs the construction of an adversarial dataset on which all models achieve random accuracy. Our adversarial dataset provides a more robust assessment of argument comprehension and should be adopted as the standard in future work.

* ACL 2019

Via

Access Paper or Ask Questions

Fake News Detection as Natural Language Inference

Jul 17, 2019

Kai-Chou Yang, Timothy Niven, Hung-Yu Kao

Figure 1 for Fake News Detection as Natural Language Inference

Figure 2 for Fake News Detection as Natural Language Inference

Figure 3 for Fake News Detection as Natural Language Inference

Figure 4 for Fake News Detection as Natural Language Inference

Abstract:This report describes the entry by the Intelligent Knowledge Management (IKM) Lab in the WSDM 2019 Fake News Classification challenge. We treat the task as natural language inference (NLI). We individually train a number of the strongest NLI models as well as BERT. We ensemble these results and retrain with noisy labels in two stages. We analyze transitivity relations in the train and test sets and determine a set of test cases that can be reliably classified on this basis. The remainder of test cases are classified by our ensemble. Our entry achieves test set accuracy of 88.063% for 3rd place in the competition.

Via

Access Paper or Ask Questions