Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mohamed Coulibali

NUBIA: NeUral Based Interchangeability Assessor for Text Generation

May 01, 2020

Hassan Kane, Muhammed Yusuf Kocyigit, Ali Abdalla, Pelkins Ajanoh, Mohamed Coulibali

Figure 1 for NUBIA: NeUral Based Interchangeability Assessor for Text Generation

Figure 2 for NUBIA: NeUral Based Interchangeability Assessor for Text Generation

Figure 3 for NUBIA: NeUral Based Interchangeability Assessor for Text Generation

Figure 4 for NUBIA: NeUral Based Interchangeability Assessor for Text Generation

Abstract:We present NUBIA, a methodology to build automatic evaluation metrics for text generation using only machine learning models as core components. A typical NUBIA model is composed of three modules: a neural feature extractor, an aggregator and a calibrator. We demonstrate an implementation of NUBIA which outperforms metrics currently used to evaluate machine translation, summaries and slightly exceeds/matches state of the art metrics on correlation with human judgement on the WMT segment-level Direct Assessment task, sentence-level ranking and image captioning evaluation. The model implemented is modular, explainable and set to continuously improve over time.

* 8 pages, 5 tables, and 2 figures

Via

Access Paper or Ask Questions

Towards Neural Language Evaluators

Oct 30, 2019

Hassan Kané, Yusuf Kocyigit, Pelkins Ajanoh, Ali Abdalla, Mohamed Coulibali

Abstract:We review three limitations of BLEU and ROUGE -- the most popular metrics used to assess reference summaries against hypothesis summaries, come up with criteria for what a good metric should behave like and propose concrete ways to use recent Transformers-based Language Models to assess reference summaries against hypothesis summaries.

* Accepted to NeurIPS 2019 Document Intelligence Workshop

Via

Access Paper or Ask Questions