Picture for Mara Finkelstein

Mara Finkelstein

From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set

Add code
Nov 23, 2024
Figure 1 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Figure 2 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Figure 3 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Figure 4 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Viaarxiv icon

Learning from others' mistakes: Finetuning machine translation models with span-level error annotations

Add code
Oct 21, 2024
Figure 1 for Learning from others' mistakes: Finetuning machine translation models with span-level error annotations
Figure 2 for Learning from others' mistakes: Finetuning machine translation models with span-level error annotations
Figure 3 for Learning from others' mistakes: Finetuning machine translation models with span-level error annotations
Figure 4 for Learning from others' mistakes: Finetuning machine translation models with span-level error annotations
Viaarxiv icon

MetricX-24: The Google Submission to the WMT 2024 Metrics Shared Task

Add code
Oct 04, 2024
Viaarxiv icon

Introducing the NewsPaLM MBR and QE Dataset: LLM-Generated High-Quality Parallel Data Outperforms Traditional Web-Crawled Data

Add code
Aug 14, 2024
Viaarxiv icon

Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms

Add code
Jun 05, 2024
Viaarxiv icon

Pinpoint, Not Criticize: Refining Large Language Models via Fine-Grained Actionable Feedback

Add code
Nov 15, 2023
Viaarxiv icon

There's no Data Like Better Data: Using QE Metrics for MT Data Filtering

Add code
Nov 09, 2023
Viaarxiv icon

Quality Control at Your Fingertips: Quality-Aware Translation Models

Add code
Oct 10, 2023
Viaarxiv icon

MBR and QE Finetuning: Training-time Distillation of the Best and Most Expensive Decoding Methods

Add code
Sep 28, 2023
Viaarxiv icon

Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level

Add code
Aug 28, 2023
Viaarxiv icon