Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Trusting RoBERTa over BERT: Insights from CheckListing the Natural Language Inference Task

Jul 15, 2021

Ishan Tarunesh, Somak Aditya, Monojit Choudhury

Figure 1 for Trusting RoBERTa over BERT: Insights from CheckListing the Natural Language Inference Task

Figure 2 for Trusting RoBERTa over BERT: Insights from CheckListing the Natural Language Inference Task

Figure 3 for Trusting RoBERTa over BERT: Insights from CheckListing the Natural Language Inference Task

Figure 4 for Trusting RoBERTa over BERT: Insights from CheckListing the Natural Language Inference Task

Share this with someone who'll enjoy it:

Abstract:The recent state-of-the-art natural language understanding (NLU) systems often behave unpredictably, failing on simpler reasoning examples. Despite this, there has been limited focus on quantifying progress towards systems with more predictable behavior. We think that reasoning capability-wise behavioral summary is a step towards bridging this gap. We create a CheckList test-suite (184K examples) for the Natural Language Inference (NLI) task, a representative NLU task. We benchmark state-of-the-art NLI systems on this test-suite, which reveals fine-grained insights into the reasoning abilities of BERT and RoBERTa. Our analysis further reveals inconsistencies of the models on examples derived from the same template or distinct templates but pertaining to same reasoning capability, indicating that generalizing the models' behavior through observations made on a CheckList is non-trivial. Through an user-study, we find that users were able to utilize behavioral information to generalize much better for examples predicted from RoBERTa, compared to that of BERT.

* 15 pages, 5 figures and 9 tables

View paper on

Share this with someone who'll enjoy it:

Title:Trusting RoBERTa over BERT: Insights from CheckListing the Natural Language Inference Task

Paper and Code