Picture for Mark Cieliebak

Mark Cieliebak

A Measure of the System Dependence of Automated Metrics

Add code
Dec 04, 2024
Figure 1 for A Measure of the System Dependence of Automated Metrics
Figure 2 for A Measure of the System Dependence of Automated Metrics
Figure 3 for A Measure of the System Dependence of Automated Metrics
Figure 4 for A Measure of the System Dependence of Automated Metrics
Viaarxiv icon

Error-preserving Automatic Speech Recognition of Young English Learners' Language

Add code
Jun 05, 2024
Viaarxiv icon

Favi-Score: A Measure for Favoritism in Automated Preference Ratings for Generative AI Evaluation

Add code
Jun 03, 2024
Viaarxiv icon

Dialect Transfer for Swiss German Speech Translation

Add code
Oct 13, 2023
Viaarxiv icon

Correction of Errors in Preference Ratings from Automated Metrics for Text Generation

Add code
Jun 06, 2023
Viaarxiv icon

STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions

Add code
May 30, 2023
Figure 1 for STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions
Figure 2 for STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions
Figure 3 for STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions
Figure 4 for STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions
Viaarxiv icon

Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP

Add code
May 02, 2023
Figure 1 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 2 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 3 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 4 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Viaarxiv icon

On the Effectiveness of Automated Metrics for Text Generation Systems

Add code
Oct 24, 2022
Viaarxiv icon

SDS-200: A Swiss German Speech to Standard German Text Corpus

Add code
May 19, 2022
Figure 1 for SDS-200: A Swiss German Speech to Standard German Text Corpus
Figure 2 for SDS-200: A Swiss German Speech to Standard German Text Corpus
Figure 3 for SDS-200: A Swiss German Speech to Standard German Text Corpus
Figure 4 for SDS-200: A Swiss German Speech to Standard German Text Corpus
Viaarxiv icon

Probing the Robustness of Trained Metrics for Conversational Dialogue Systems

Add code
Feb 28, 2022
Figure 1 for Probing the Robustness of Trained Metrics for Conversational Dialogue Systems
Figure 2 for Probing the Robustness of Trained Metrics for Conversational Dialogue Systems
Figure 3 for Probing the Robustness of Trained Metrics for Conversational Dialogue Systems
Figure 4 for Probing the Robustness of Trained Metrics for Conversational Dialogue Systems
Viaarxiv icon