Picture for Tom Kocmi

Tom Kocmi

Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier

Add code
Dec 05, 2024
Figure 1 for Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier
Figure 2 for Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier
Figure 3 for Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier
Figure 4 for Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier
Viaarxiv icon

Preliminary WMT24 Ranking of General MT Systems and LLMs

Add code
Jul 29, 2024
Figure 1 for Preliminary WMT24 Ranking of General MT Systems and LLMs
Figure 2 for Preliminary WMT24 Ranking of General MT Systems and LLMs
Figure 3 for Preliminary WMT24 Ranking of General MT Systems and LLMs
Figure 4 for Preliminary WMT24 Ranking of General MT Systems and LLMs
Viaarxiv icon

AI-Assisted Human Evaluation of Machine Translation

Add code
Jun 18, 2024
Viaarxiv icon

Error Span Annotation: A Balanced Approach for Human Evaluation of Machine Translation

Add code
Jun 17, 2024
Figure 1 for Error Span Annotation: A Balanced Approach for Human Evaluation of Machine Translation
Figure 2 for Error Span Annotation: A Balanced Approach for Human Evaluation of Machine Translation
Figure 3 for Error Span Annotation: A Balanced Approach for Human Evaluation of Machine Translation
Figure 4 for Error Span Annotation: A Balanced Approach for Human Evaluation of Machine Translation
Viaarxiv icon

Machine Translation Meta Evaluation through Translation Accuracy Challenge Sets

Add code
Jan 29, 2024
Viaarxiv icon

Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies

Add code
Jan 12, 2024
Viaarxiv icon

GEMBA-MQM: Detecting Translation Quality Error Spans with GPT-4

Add code
Oct 21, 2023
Viaarxiv icon

SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window

Add code
Sep 16, 2023
Viaarxiv icon

Large Language Models Are State-of-the-Art Evaluators of Translation Quality

Add code
Feb 28, 2023
Figure 1 for Large Language Models Are State-of-the-Art Evaluators of Translation Quality
Figure 2 for Large Language Models Are State-of-the-Art Evaluators of Translation Quality
Figure 3 for Large Language Models Are State-of-the-Art Evaluators of Translation Quality
Figure 4 for Large Language Models Are State-of-the-Art Evaluators of Translation Quality
Viaarxiv icon

Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference

Add code
Jan 28, 2023
Figure 1 for Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Figure 2 for Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Figure 3 for Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Figure 4 for Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Viaarxiv icon