Picture for Graham Neubig

Graham Neubig

Carnegie Mellon University

On the limits and opportunities of AI reviewers: Reviewing the reviews of Nature-family papers with 45 expert scientists

Add code
May 20, 2026
Viaarxiv icon

Reinforcing Human Behavior Simulation via Verbal Feedback

Add code
May 19, 2026
Viaarxiv icon

Recursive Agent Optimization

Add code
May 07, 2026
Viaarxiv icon

Asking What Matters: Reward-Driven Clarification for Software Engineering Tasks

Add code
Apr 16, 2026
Viaarxiv icon

What do Language Models Learn and When? The Implicit Curriculum Hypothesis

Add code
Apr 09, 2026
Viaarxiv icon

Gym-Anything: Turn any Software into an Agent Environment

Add code
Apr 07, 2026
Viaarxiv icon

IDIOLEX: Unified and Continuous Representations for Idiolectal and Stylistic Variation

Add code
Apr 06, 2026
Viaarxiv icon

Effective Strategies for Asynchronous Software Engineering Agents

Add code
Mar 23, 2026
Viaarxiv icon

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

Add code
Mar 19, 2026
Viaarxiv icon

CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents

Add code
Mar 18, 2026
Viaarxiv icon