Picture for Graham Neubig

Graham Neubig

Carnegie Mellon University

Benchmarking Failures in Tool-Augmented Language Models

Add code
Mar 18, 2025
Viaarxiv icon

Efficient Many-Shot In-Context Learning with Dynamic Block-Sparse Attention

Add code
Mar 11, 2025
Viaarxiv icon

Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions

Add code
Mar 05, 2025
Viaarxiv icon

ESPnet-SpeechLM: An Open Speech Language Model Toolkit

Add code
Feb 21, 2025
Viaarxiv icon

Interactive Agents to Overcome Ambiguity in Software Engineering

Add code
Feb 18, 2025
Viaarxiv icon

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Add code
Feb 12, 2025
Viaarxiv icon

Demystifying Long Chain-of-Thought Reasoning in LLMs

Add code
Feb 05, 2025
Viaarxiv icon

CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation

Add code
Jan 28, 2025
Figure 1 for CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation
Figure 2 for CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation
Figure 3 for CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation
Figure 4 for CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation
Viaarxiv icon

AutoPresent: Designing Structured Visuals from Scratch

Add code
Jan 01, 2025
Viaarxiv icon

Training Software Engineering Agents and Verifiers with SWE-Gym

Add code
Dec 30, 2024
Figure 1 for Training Software Engineering Agents and Verifiers with SWE-Gym
Figure 2 for Training Software Engineering Agents and Verifiers with SWE-Gym
Figure 3 for Training Software Engineering Agents and Verifiers with SWE-Gym
Figure 4 for Training Software Engineering Agents and Verifiers with SWE-Gym
Viaarxiv icon