Picture for Graham Neubig

Graham Neubig

Carnegie Mellon University

Effective Strategies for Asynchronous Software Engineering Agents

Add code
Mar 23, 2026
Viaarxiv icon

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

Add code
Mar 19, 2026
Viaarxiv icon

CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents

Add code
Mar 18, 2026
Viaarxiv icon

CUBE: A Standard for Unifying Agent Benchmarks

Add code
Mar 16, 2026
Viaarxiv icon

Mind the Sim2Real Gap in User Simulation for Agentic Tasks

Add code
Mar 11, 2026
Viaarxiv icon

A Rubric-Supervised Critic from Sparse Real-World Outcomes

Add code
Mar 04, 2026
Viaarxiv icon

Real-Time Generation of Game Video Commentary with Multimodal LLMs: Pause-Aware Decoding Approaches

Add code
Mar 03, 2026
Viaarxiv icon

How Well Does Agent Development Reflect Real-World Work?

Add code
Mar 01, 2026
Viaarxiv icon

Modeling Distinct Human Interaction in Web Agents

Add code
Feb 19, 2026
Viaarxiv icon

Hybrid-Gym: Training Coding Agents to Generalize Across Tasks

Add code
Feb 18, 2026
Viaarxiv icon