VeriLA: A Human-Centered Evaluation Framework for Interpretable Verification of LLM Agent Failures

Add code
Mar 16, 2025

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: