Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:AutoPlanBench: : Automatically generating benchmarks for LLM planners from PDDL

Nov 16, 2023

Katharina Stein, Alexander Koller

Share this with someone who'll enjoy it:

Abstract:LLMs are being increasingly used for planning-style tasks, but their capabilities for planning and reasoning are poorly understood. We present a novel method for automatically converting planning benchmarks written in PDDL into textual descriptions and offer a benchmark dataset created with our method. We show that while the best LLM planners do well on many planning tasks, others remain out of reach of current methods.

View paper on

Share this with someone who'll enjoy it:

Title:AutoPlanBench: : Automatically generating benchmarks for LLM planners from PDDL

Paper and Code