Picture for Mohit Iyyer

Mohit Iyyer

Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations

Add code
Nov 11, 2024
Viaarxiv icon

Interactive Topic Models with Optimal Transport

Add code
Jun 28, 2024
Viaarxiv icon

VERISCORE: Evaluating the factuality of verifiable claims in long-form text generation

Add code
Jun 27, 2024
Viaarxiv icon

Suri: Multi-constraint Instruction Following for Long-form Text Generation

Add code
Jun 27, 2024
Viaarxiv icon

CaLMQA: Exploring culturally specific long-form question answering across 23 languages

Add code
Jun 25, 2024
Viaarxiv icon

One Thousand and One Pairs: A "novel" challenge for long-context language models

Add code
Jun 24, 2024
Viaarxiv icon

PostMark: A Robust Blackbox Watermark for Large Language Models

Add code
Jun 20, 2024
Viaarxiv icon

Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images

Add code
Apr 21, 2024
Viaarxiv icon

FABLES: Evaluating faithfulness and content selection in book-length summarization

Add code
Apr 01, 2024
Figure 1 for FABLES: Evaluating faithfulness and content selection in book-length summarization
Figure 2 for FABLES: Evaluating faithfulness and content selection in book-length summarization
Figure 3 for FABLES: Evaluating faithfulness and content selection in book-length summarization
Figure 4 for FABLES: Evaluating faithfulness and content selection in book-length summarization
Viaarxiv icon

GEE! Grammar Error Explanation with Large Language Models

Add code
Nov 16, 2023
Viaarxiv icon