Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Denis Peskov

Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence

Jul 05, 2021

Alexander Hoyle, Pranav Goel, Denis Peskov, Andrew Hian-Cheong, Jordan Boyd-Graber, Philip Resnik

Figure 1 for Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence

Figure 2 for Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence

Figure 3 for Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence

Figure 4 for Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence

Abstract:Topic model evaluation, like evaluation of other unsupervised methods, can be contentious. However, the field has coalesced around automated estimates of topic coherence, which rely on the frequency of word co-occurrences in a reference corpus. Recent models relying on neural components surpass classical topic models according to these metrics. At the same time, unlike classical models, the practice of neural topic model evaluation suffers from a validation gap: automatic coherence for neural models has not been validated using human experimentation. In addition, as we show via a meta-analysis of topic modeling literature, there is a substantial standardization gap in the use of automated topic modeling benchmarks. We address both the standardization gap and the validation gap. Using two of the most widely used topic model evaluation datasets, we assess a dominant classical model and two state-of-the-art neural models in a systematic, clearly documented, reproducible way. We use automatic coherence along with the two most widely accepted human judgment tasks, namely, topic rating and word intrusion. Automated evaluation will declare one model significantly different from another when corresponding human evaluations do not, calling into question the validity of fully automatic evaluations independent of human judgments.

Via

Access Paper or Ask Questions

Mitigating Noisy Inputs for Question Answering

Aug 08, 2019

Denis Peskov, Joe Barrow, Pedro Rodriguez, Graham Neubig, Jordan Boyd-Graber

Figure 1 for Mitigating Noisy Inputs for Question Answering

Figure 2 for Mitigating Noisy Inputs for Question Answering

Figure 3 for Mitigating Noisy Inputs for Question Answering

Figure 4 for Mitigating Noisy Inputs for Question Answering

Abstract:Natural language processing systems are often downstream of unreliable inputs: machine translation, optical character recognition, or speech recognition. For instance, virtual assistants can only answer your questions after understanding your speech. We investigate and mitigate the effects of noise from Automatic Speech Recognition systems on two factoid Question Answering (QA) tasks. Integrating confidences into the model and forced decoding of unknown words are empirically shown to improve the accuracy of downstream neural QA systems. We create and train models on a synthetic corpus of over 500,000 noisy sentences and evaluate on two human corpora from Quizbowl and Jeopardy! competitions.

Via

Access Paper or Ask Questions