Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Gestalt: a Stacking Ensemble for SQuAD2.0

Apr 02, 2020

Mohamed El-Geish

Figure 1 for Gestalt: a Stacking Ensemble for SQuAD2.0

Figure 2 for Gestalt: a Stacking Ensemble for SQuAD2.0

Figure 3 for Gestalt: a Stacking Ensemble for SQuAD2.0

Figure 4 for Gestalt: a Stacking Ensemble for SQuAD2.0

Share this with someone who'll enjoy it:

Abstract:We propose a deep-learning system -- for the SQuAD2.0 task -- that finds, or indicates the lack of, a correct answer to a question in a context paragraph. Our goal is to learn an ensemble of heterogeneous SQuAD2.0 models that, when blended properly, outperforms the best model in the ensemble per se. We created a stacking ensemble that combines top-N predictions from two models, based on ALBERT and RoBERTa, into a multiclass classification task to pick the best answer out of their predictions. We explored various ensemble configurations, input representations, and model architectures. For evaluation, we examined test-set EM and F1 scores; our best-performing ensemble incorporated a CNN-based meta-model and scored 87.117 and 90.306, respectively -- a relative improvement of 0.55% for EM and 0.61% for F1 scores, compared to the baseline performance of the best model in the ensemble, an ALBERT-based model, at 86.644 for EM and 89.760 for F1.

* 11 pages, 7 figures, Stanford CS224n Natural Language Processing with Deep Learning

View paper on

Share this with someone who'll enjoy it:

Title:Gestalt: a Stacking Ensemble for SQuAD2.0

Paper and Code