Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:PG3: Policy-Guided Planning for Generalized Policy Generation

Apr 21, 2022

Ryan Yang, Tom Silver, Aidan Curtis, Tomas Lozano-Perez, Leslie Pack Kaelbling

Figure 1 for PG3: Policy-Guided Planning for Generalized Policy Generation

Figure 2 for PG3: Policy-Guided Planning for Generalized Policy Generation

Figure 3 for PG3: Policy-Guided Planning for Generalized Policy Generation

Figure 4 for PG3: Policy-Guided Planning for Generalized Policy Generation

Share this with someone who'll enjoy it:

Abstract:A longstanding objective in classical planning is to synthesize policies that generalize across multiple problems from the same domain. In this work, we study generalized policy search-based methods with a focus on the score function used to guide the search over policies. We demonstrate limitations of two score functions and propose a new approach that overcomes these limitations. The main idea behind our approach, Policy-Guided Planning for Generalized Policy Generation (PG3), is that a candidate policy should be used to guide planning on training problems as a mechanism for evaluating that candidate. Theoretical results in a simplified setting give conditions under which PG3 is optimal or admissible. We then study a specific instantiation of policy search where planning problems are PDDL-based and policies are lifted decision lists. Empirical results in six domains confirm that PG3 learns generalized policies more efficiently and effectively than several baselines. Code: https://github.com/ryangpeixu/pg3

* IJCAI 2022

View paper on

Share this with someone who'll enjoy it:

Title:PG3: Policy-Guided Planning for Generalized Policy Generation

Paper and Code