Picture for Takumi Yanagawa

Takumi Yanagawa

IBM

ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks

Add code
Feb 07, 2025
Viaarxiv icon