Picture for Mohit Iyyer

Mohit Iyyer

BEARCUBS: A benchmark for computer-using web agents

Add code
Mar 10, 2025
Viaarxiv icon

One ruler to measure them all: Benchmarking multilingual long-context language models

Add code
Mar 03, 2025
Viaarxiv icon

CLIPPER: Compression enables long-context synthetic data generation

Add code
Feb 20, 2025
Viaarxiv icon

Whose story is it? Personalizing story generation by inferring author styles

Add code
Feb 18, 2025
Viaarxiv icon

OverThink: Slowdown Attacks on Reasoning LLMs

Add code
Feb 05, 2025
Viaarxiv icon

OVERTHINKING: Slowdown Attacks on Reasoning LLMs

Add code
Feb 04, 2025
Viaarxiv icon

People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text

Add code
Jan 26, 2025
Viaarxiv icon

Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations

Add code
Nov 11, 2024
Figure 1 for Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations
Figure 2 for Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations
Figure 3 for Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations
Figure 4 for Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations
Viaarxiv icon

Interactive Topic Models with Optimal Transport

Add code
Jun 28, 2024
Viaarxiv icon

Suri: Multi-constraint Instruction Following for Long-form Text Generation

Add code
Jun 27, 2024
Viaarxiv icon