Picture for Jordan Lee Boyd-Graber

Jordan Lee Boyd-Graber

Personalized Help for Optimizing Low-Skilled Users' Strategy

Add code
Nov 14, 2024
Viaarxiv icon

ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks

Add code
Jun 24, 2024
Viaarxiv icon

AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models

Add code
Jun 16, 2024
Viaarxiv icon

More Victories, Less Cooperation: Assessing Cicero's Diplomacy Play

Add code
Jun 07, 2024
Viaarxiv icon

PANDA (Pedantic ANswer-correctness Determination and Adjudication):Improving Automatic Evaluation for Question Answering and Text Generation

Add code
Feb 17, 2024
Viaarxiv icon

Bridging Background Knowledge Gaps in Translation with Automatic Explicitation

Add code
Dec 03, 2023
Viaarxiv icon