Picture for Graham Neubig

Graham Neubig

Carnegie Mellon University

Training Versatile Coding Agents in Synthetic Environments

Add code
Dec 13, 2025
Viaarxiv icon

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Add code
Dec 08, 2025
Viaarxiv icon

Unsupervised Discovery of Long-Term Spatiotemporal Periodic Workflows in Human Activities

Add code
Nov 18, 2025
Viaarxiv icon

The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents

Add code
Nov 05, 2025
Figure 1 for The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents
Figure 2 for The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents
Figure 3 for The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents
Figure 4 for The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents
Viaarxiv icon

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Add code
Oct 29, 2025
Viaarxiv icon

How Do AI Agents Do Human Work? Comparing AI and Human Workflows Across Diverse Occupations

Add code
Oct 26, 2025
Viaarxiv icon

MERLIN: A Testbed for Multilingual Multimodal Entity Recognition and Linking

Add code
Oct 16, 2025
Viaarxiv icon

Grounding Multilingual Multimodal LLMs With Cultural Knowledge

Add code
Aug 12, 2025
Viaarxiv icon

SimuRA: Towards General Goal-Oriented Agent via Simulative Reasoning Architecture with LLM-Based World Model

Add code
Jul 31, 2025
Figure 1 for SimuRA: Towards General Goal-Oriented Agent via Simulative Reasoning Architecture with LLM-Based World Model
Figure 2 for SimuRA: Towards General Goal-Oriented Agent via Simulative Reasoning Architecture with LLM-Based World Model
Figure 3 for SimuRA: Towards General Goal-Oriented Agent via Simulative Reasoning Architecture with LLM-Based World Model
Figure 4 for SimuRA: Towards General Goal-Oriented Agent via Simulative Reasoning Architecture with LLM-Based World Model
Viaarxiv icon

Checklists Are Better Than Reward Models For Aligning Language Models

Add code
Jul 24, 2025
Figure 1 for Checklists Are Better Than Reward Models For Aligning Language Models
Figure 2 for Checklists Are Better Than Reward Models For Aligning Language Models
Figure 3 for Checklists Are Better Than Reward Models For Aligning Language Models
Figure 4 for Checklists Are Better Than Reward Models For Aligning Language Models
Viaarxiv icon