Picture for Graham Neubig

Graham Neubig

Carnegie Mellon University

Interactive Agents to Overcome Ambiguity in Software Engineering

Add code
Feb 18, 2025
Viaarxiv icon

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Add code
Feb 12, 2025
Viaarxiv icon

Demystifying Long Chain-of-Thought Reasoning in LLMs

Add code
Feb 05, 2025
Viaarxiv icon

CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation

Add code
Jan 28, 2025
Viaarxiv icon

AutoPresent: Designing Structured Visuals from Scratch

Add code
Jan 01, 2025
Viaarxiv icon

Training Software Engineering Agents and Verifiers with SWE-Gym

Add code
Dec 30, 2024
Viaarxiv icon

Towards Automatic Evaluation for Image Transcreation

Add code
Dec 18, 2024
Figure 1 for Towards Automatic Evaluation for Image Transcreation
Figure 2 for Towards Automatic Evaluation for Image Transcreation
Figure 3 for Towards Automatic Evaluation for Image Transcreation
Figure 4 for Towards Automatic Evaluation for Image Transcreation
Viaarxiv icon

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Add code
Dec 18, 2024
Viaarxiv icon

The BrowserGym Ecosystem for Web Agent Research

Add code
Dec 10, 2024
Figure 1 for The BrowserGym Ecosystem for Web Agent Research
Figure 2 for The BrowserGym Ecosystem for Web Agent Research
Figure 3 for The BrowserGym Ecosystem for Web Agent Research
Figure 4 for The BrowserGym Ecosystem for Web Agent Research
Viaarxiv icon

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Add code
Dec 06, 2024
Viaarxiv icon