Picture for Hirokuni Kitahara

Hirokuni Kitahara

IBM

ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks

Add code
Feb 07, 2025
Viaarxiv icon