Picture for Ondřej Plátek

Ondřej Plátek

Automatic Metrics in Natural Language Generation: A Survey of Current Evaluation Practices

Add code
Aug 17, 2024
Viaarxiv icon

factgenie: A Framework for Span-based Evaluation of Generated Texts

Add code
Jul 25, 2024
Viaarxiv icon

Three Ways of Using Large Language Models to Evaluate Chat

Add code
Aug 12, 2023
Viaarxiv icon

With a Little Help from the Authors: Reproducing Human Evaluation of an MT Error Detector

Add code
Aug 12, 2023
Viaarxiv icon

Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP

Add code
May 02, 2023
Figure 1 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 2 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 3 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 4 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Viaarxiv icon

TabGenie: A Toolkit for Table-to-Text Generation

Add code
Feb 27, 2023
Viaarxiv icon

MooseNet: A trainable metric for synthesized speech with plda backend

Add code
Jan 17, 2023
Viaarxiv icon

Recurrent Neural Networks for Dialogue State Tracking

Add code
Jul 13, 2016
Figure 1 for Recurrent Neural Networks for Dialogue State Tracking
Figure 2 for Recurrent Neural Networks for Dialogue State Tracking
Figure 3 for Recurrent Neural Networks for Dialogue State Tracking
Figure 4 for Recurrent Neural Networks for Dialogue State Tracking
Viaarxiv icon