Automated test generation to evaluate tool-augmented LLMs as conversational AI agents

Add code
Sep 24, 2024

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: