Picture for Colin Cherry

Colin Cherry

Dima

Gemma 3 Technical Report

Add code
Mar 25, 2025
Viaarxiv icon

Leveraging Domain Knowledge at Inference Time for LLM Translation: Retrieval versus Generation

Add code
Mar 06, 2025
Viaarxiv icon

SMOL: Professionally translated parallel data for 115 under-represented languages

Add code
Feb 17, 2025
Viaarxiv icon

Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation

Add code
Jan 30, 2025
Figure 1 for Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation
Figure 2 for Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation
Figure 3 for Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation
Figure 4 for Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation
Viaarxiv icon

On the Implications of Verbose LLM Outputs: A Case Study in Translation Evaluation

Add code
Oct 01, 2024
Viaarxiv icon

Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts

Add code
Sep 10, 2024
Figure 1 for Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
Figure 2 for Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
Figure 3 for Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
Figure 4 for Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
Viaarxiv icon

Don't Throw Away Data: Better Sequence Knowledge Distillation

Add code
Jul 15, 2024
Viaarxiv icon

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Add code
Feb 27, 2024
Viaarxiv icon

To Diverge or Not to Diverge: A Morphosyntactic Perspective on Machine Translation vs Human Translation

Add code
Jan 02, 2024
Viaarxiv icon

Quality Control at Your Fingertips: Quality-Aware Translation Models

Add code
Oct 10, 2023
Viaarxiv icon