Picture for Julia Shephard

Julia Shephard

EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments

Add code
Mar 24, 2025
Viaarxiv icon