Picture for Colin Cherry

Colin Cherry

SMOL: Professionally translated parallel data for 115 under-represented languages

Add code
Feb 17, 2025
Viaarxiv icon

Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation

Add code
Jan 30, 2025
Viaarxiv icon

On the Implications of Verbose LLM Outputs: A Case Study in Translation Evaluation

Add code
Oct 01, 2024
Viaarxiv icon

Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts

Add code
Sep 10, 2024
Figure 1 for Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
Figure 2 for Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
Figure 3 for Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
Figure 4 for Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
Viaarxiv icon

Don't Throw Away Data: Better Sequence Knowledge Distillation

Add code
Jul 15, 2024
Viaarxiv icon

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Add code
Feb 27, 2024
Viaarxiv icon

To Diverge or Not to Diverge: A Morphosyntactic Perspective on Machine Translation vs Human Translation

Add code
Jan 02, 2024
Viaarxiv icon

Quality Control at Your Fingertips: Quality-Aware Translation Models

Add code
Oct 10, 2023
Viaarxiv icon

XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages

Add code
May 24, 2023
Figure 1 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 2 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 3 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 4 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Viaarxiv icon

PaLM 2 Technical Report

Add code
May 17, 2023
Figure 1 for PaLM 2 Technical Report
Figure 2 for PaLM 2 Technical Report
Figure 3 for PaLM 2 Technical Report
Figure 4 for PaLM 2 Technical Report
Viaarxiv icon