Picture for Markus Freitag

Markus Freitag

WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects

Add code
Feb 18, 2025
Viaarxiv icon

Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation

Add code
Jan 30, 2025
Viaarxiv icon

From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set

Add code
Nov 23, 2024
Figure 1 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Figure 2 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Figure 3 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Figure 4 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Viaarxiv icon

Mitigating Metric Bias in Minimum Bayes Risk Decoding

Add code
Nov 05, 2024
Figure 1 for Mitigating Metric Bias in Minimum Bayes Risk Decoding
Figure 2 for Mitigating Metric Bias in Minimum Bayes Risk Decoding
Figure 3 for Mitigating Metric Bias in Minimum Bayes Risk Decoding
Figure 4 for Mitigating Metric Bias in Minimum Bayes Risk Decoding
Viaarxiv icon

Learning from others' mistakes: Finetuning machine translation models with span-level error annotations

Add code
Oct 21, 2024
Figure 1 for Learning from others' mistakes: Finetuning machine translation models with span-level error annotations
Figure 2 for Learning from others' mistakes: Finetuning machine translation models with span-level error annotations
Figure 3 for Learning from others' mistakes: Finetuning machine translation models with span-level error annotations
Figure 4 for Learning from others' mistakes: Finetuning machine translation models with span-level error annotations
Viaarxiv icon

Beyond Human-Only: Evaluating Human-Machine Collaboration for Collecting High-Quality Translation Data

Add code
Oct 14, 2024
Viaarxiv icon

MetricX-24: The Google Submission to the WMT 2024 Metrics Shared Task

Add code
Oct 04, 2024
Viaarxiv icon

On the Implications of Verbose LLM Outputs: A Case Study in Translation Evaluation

Add code
Oct 01, 2024
Viaarxiv icon

Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts

Add code
Sep 10, 2024
Figure 1 for Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
Figure 2 for Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
Figure 3 for Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
Figure 4 for Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
Viaarxiv icon

Introducing the NewsPaLM MBR and QE Dataset: LLM-Generated High-Quality Parallel Data Outperforms Traditional Web-Crawled Data

Add code
Aug 14, 2024
Viaarxiv icon