Picture for Ondřej Plátek

Ondřej Plátek

Automatic Metrics in Natural Language Generation: A Survey of Current Evaluation Practices

Add code
Aug 17, 2024
Viaarxiv icon

factgenie: A Framework for Span-based Evaluation of Generated Texts

Add code
Jul 25, 2024
Viaarxiv icon

With a Little Help from the Authors: Reproducing Human Evaluation of an MT Error Detector

Add code
Aug 12, 2023
Viaarxiv icon

Three Ways of Using Large Language Models to Evaluate Chat

Add code
Aug 12, 2023
Viaarxiv icon

Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP

Add code
May 02, 2023
Figure 1 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 2 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 3 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 4 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Viaarxiv icon

TabGenie: A Toolkit for Table-to-Text Generation

Add code
Feb 27, 2023
Viaarxiv icon

MooseNet: A trainable metric for synthesized speech with plda backend

Add code
Jan 17, 2023
Viaarxiv icon

Recurrent Neural Networks for Dialogue State Tracking

Add code
Jul 13, 2016
Figure 1 for Recurrent Neural Networks for Dialogue State Tracking
Figure 2 for Recurrent Neural Networks for Dialogue State Tracking
Figure 3 for Recurrent Neural Networks for Dialogue State Tracking
Figure 4 for Recurrent Neural Networks for Dialogue State Tracking
Viaarxiv icon