Picture for Parker Riley

Parker Riley

Enhancing Human Evaluation in Machine Translation with Comparative Judgment

Add code
Feb 25, 2025
Viaarxiv icon

WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects

Add code
Feb 18, 2025
Viaarxiv icon

From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set

Add code
Nov 23, 2024
Figure 1 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Figure 2 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Figure 3 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Figure 4 for From Jack of All Trades to Master of One: Specializing LLM-based Autoraters to a Test Set
Viaarxiv icon

Beyond Human-Only: Evaluating Human-Machine Collaboration for Collecting High-Quality Translation Data

Add code
Oct 14, 2024
Viaarxiv icon

Finding Replicable Human Evaluations via Stable Ranking Probability

Add code
Apr 01, 2024
Viaarxiv icon

The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation

Add code
Aug 14, 2023
Viaarxiv icon

XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages

Add code
May 24, 2023
Figure 1 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 2 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 3 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Figure 4 for XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Viaarxiv icon

PaLM 2 Technical Report

Add code
May 17, 2023
Figure 1 for PaLM 2 Technical Report
Figure 2 for PaLM 2 Technical Report
Figure 3 for PaLM 2 Technical Report
Figure 4 for PaLM 2 Technical Report
Viaarxiv icon

FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation

Add code
Oct 01, 2022
Figure 1 for FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation
Figure 2 for FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation
Figure 3 for FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation
Figure 4 for FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation
Viaarxiv icon

TextSETTR: Label-Free Text Style Extraction and Tunable Targeted Restyling

Add code
Oct 08, 2020
Figure 1 for TextSETTR: Label-Free Text Style Extraction and Tunable Targeted Restyling
Figure 2 for TextSETTR: Label-Free Text Style Extraction and Tunable Targeted Restyling
Figure 3 for TextSETTR: Label-Free Text Style Extraction and Tunable Targeted Restyling
Figure 4 for TextSETTR: Label-Free Text Style Extraction and Tunable Targeted Restyling
Viaarxiv icon