Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dana Nau

HTN Plan Repair Algorithms Compared: Strengths and Weaknesses of Different Methods

Apr 22, 2025

Paul Zaidins, Robert P. Goldman, Ugur Kuter, Dana Nau, Mark Roberts

Abstract:This paper provides theoretical and empirical comparisons of three recent hierarchical plan repair algorithms: SHOPFixer, IPyHOPPER, and Rewrite. Our theoretical results show that the three algorithms correspond to three different definitions of the plan repair problem, leading to differences in the algorithms' search spaces, the repair problems they can solve, and the kinds of repairs they can make. Understanding these distinctions is important when choosing a repair method for any given application. Building on the theoretical results, we evaluate the algorithms empirically in a series of benchmark planning problems. Our empirical results provide more detailed insight into the runtime repair performance of these systems and the coverage of the repair problems solved, based on algorithmic properties such as replanning, chronological backtracking, and backjumping over plan trees.

* 20 pages; 19 figures; To appear in the Proceedings for ICAPS 2025, the 35th International Conference on Automated Planning and Schedulings

Via

Access Paper or Ask Questions

Automating Curriculum Learning for Reinforcement Learning using a Skill-Based Bayesian Network

Feb 21, 2025

Vincent Hsiao, Mark Roberts, Laura M. Hiatt, George Konidaris, Dana Nau

Abstract:A major challenge for reinforcement learning is automatically generating curricula to reduce training time or improve performance in some target task. We introduce SEBNs (Skill-Environment Bayesian Networks) which model a probabilistic relationship between a set of skills, a set of goals that relate to the reward structure, and a set of environment features to predict policy performance on (possibly unseen) tasks. We develop an algorithm that uses the inferred estimates of agent success from SEBN to weigh the possible next tasks by expected improvement. We evaluate the benefit of the resulting curriculum on three environments: a discrete gridworld, continuous control, and simulated robotics. The results show that curricula constructed using SEBN frequently outperform other baselines.

Via

Access Paper or Ask Questions

Automatically Learning HTN Methods from Landmarks

Apr 09, 2024

Ruoxi Li, Dana Nau, Mark Roberts, Morgan Fine-Morris

Abstract:Hierarchical Task Network (HTN) planning usually requires a domain engineer to provide manual input about how to decompose a planning problem. Even HTN-MAKER, a well-known method-learning algorithm, requires a domain engineer to annotate the tasks with information about what to learn. We introduce CURRICULAMA, an HTN method learning algorithm that completely automates the learning process. It uses landmark analysis to compose annotated tasks and leverages curriculum learning to order the learning of methods from simpler to more complex. This eliminates the need for manual input, resolving a core issue with HTN-MAKER. We prove CURRICULAMA's soundness, and show experimentally that it has a substantially similar convergence rate in learning a complete set of methods to HTN-MAKER.

* This work has been submitted to FLAIRS-24

Via

Access Paper or Ask Questions

Deliberative Acting, Online Planning and Learning with Hierarchical Operational Models

Oct 02, 2020

Sunandita Patra, James Mason, Malik Ghallab, Dana Nau, Paolo Traverso

Figure 1 for Deliberative Acting, Online Planning and Learning with Hierarchical Operational Models

Figure 2 for Deliberative Acting, Online Planning and Learning with Hierarchical Operational Models

Figure 3 for Deliberative Acting, Online Planning and Learning with Hierarchical Operational Models

Figure 4 for Deliberative Acting, Online Planning and Learning with Hierarchical Operational Models

Abstract:The most common representation formalisms for automated planning are descriptive models that abstractly describe what the actions do and are tailored for effciently computing the next state(s) in a state-transition system. However, real-world acting requires operational models that describe how to do things, with rich control structures for closed-loop online decision-making in a dynamic environment. To use a different action model for planning than the one used for acting causes problems with combining acting and planning, in particular for the development and consistency verification of the different models. As an alternative, we define and implement an integrated acting-and-planning system in which both planning and acting use the same operational models, which are written in a general-purpose hierarchical task-oriented language offering rich control structures. The acting component, called Reactive Acting Engine (RAE), is inspired by the well-known PRS system, except that instead of being purely reactive, it can get advice from the planner. Our planner uses a UCT-like Monte Carlo Tree Search procedure, called UPOM (UCT Procedure for Operational Models), whose rollouts are simulations of the actor's operational models. We also present learning strategies for use with RAE and UPOM that acquire, from online acting experiences and/or simulated planning results, a mapping from decision contexts to method instances as well as a heuristic function to guide UPOM. Our experimental results show that UPOM and our learning strategies significantly improve the acting efficiency and robustness of RAE. We discuss the asymptotic convergence of UPOM by mapping its search space to an MDP.

* Currently under review at AIJ. arXiv admin note: text overlap with arXiv:2003.03932

Via

Access Paper or Ask Questions

Integrating Acting, Planning and Learning in Hierarchical Operational Models

Mar 09, 2020

Sunandita Patra, James Mason, Amit Kumar, Malik Ghallab, Paolo Traverso, Dana Nau

Figure 1 for Integrating Acting, Planning and Learning in Hierarchical Operational Models

Figure 2 for Integrating Acting, Planning and Learning in Hierarchical Operational Models

Figure 3 for Integrating Acting, Planning and Learning in Hierarchical Operational Models

Figure 4 for Integrating Acting, Planning and Learning in Hierarchical Operational Models

Abstract:We present new planning and learning algorithms for RAE, the Refinement Acting Engine. RAE uses hierarchical operational models to perform tasks in dynamically changing environments. Our planning procedure, UPOM, does a UCT-like search in the space of operational models in order to find a near-optimal method to use for the task and context at hand. Our learning strategies acquire, from online acting experiences and/or simulated planning results, a mapping from decision contexts to method instances as well as a heuristic function to guide UPOM. Our experimental results show that UPOM and our learning strategies significantly improve RAE's performance in four test domains using two different metrics: efficiency and success ratio.

* Accepted in ICAPS 2020 (30th International Conference on Automated Planning and Scheduling)

Via

Access Paper or Ask Questions

An Evaluation of Two Alternatives to Minimax

Mar 27, 2013

Dana Nau, Paul Purdom, Chun-Hung Tzeng

Figure 1 for An Evaluation of Two Alternatives to Minimax

Figure 2 for An Evaluation of Two Alternatives to Minimax

Abstract:In the field of Artificial Intelligence, traditional approaches to choosing moves in games involve the we of the minimax algorithm. However, recent research results indicate that minimizing may not always be the best approach. In this paper we summarize the results of some measurements on several model games with several different evaluation functions. These measurements, which are presented in detail in [NPT], show that there are some new algorithms that can make significantly better use of evaluation function values than the minimax algorithm does.

* Appears in Proceedings of the First Conference on Uncertainty in Artificial Intelligence (UAI1985)

Via

Access Paper or Ask Questions

Predicting The Performance of Minimax and Product in Game-Tree

Mar 27, 2013

Ping-Chung Chi, Dana Nau

Figure 1 for Predicting The Performance of Minimax and Product in Game-Tree

Figure 2 for Predicting The Performance of Minimax and Product in Game-Tree

Figure 3 for Predicting The Performance of Minimax and Product in Game-Tree

Abstract:The discovery that the minimax decision rule performs poorly in some games has sparked interest in possible alternatives to minimax. Until recently, the only games in which minimax was known to perform poorly were games which were mainly of theoretical interest. However, this paper reports results showing poor performance of minimax in a more common game called kalah. For the kalah games tested, a non-minimax decision rule called the product rule performs significantly better than minimax. This paper also discusses a possible way to predict whether or not minimax will perform well in a game when compared to product. A parameter called the rate of heuristic flaw (rhf) has been found to correlate positively with the. performance of product against minimax. Both analytical and experimental results are given that appear to support the predictive power of rhf.

* Appears in Proceedings of the Second Conference on Uncertainty in Artificial Intelligence (UAI1986)

Via

Access Paper or Ask Questions