Picture for Zhuoyi Huang

Zhuoyi Huang

MARPLE: A Benchmark for Long-Horizon Inference

Add code
Oct 02, 2024
Viaarxiv icon

Mini-BEHAVIOR: A Procedurally Generated Benchmark for Long-horizon Decision-Making in Embodied AI

Add code
Oct 03, 2023
Viaarxiv icon