Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Robert Mattmüller

Albert-Ludwigs-Universität Freiburg

Learning Heuristic Selection with Dynamic Algorithm Configuration

Jun 15, 2020

David Speck, André Biedenkapp, Frank Hutter, Robert Mattmüller, Marius Lindauer

Figure 1 for Learning Heuristic Selection with Dynamic Algorithm Configuration

Figure 2 for Learning Heuristic Selection with Dynamic Algorithm Configuration

Figure 3 for Learning Heuristic Selection with Dynamic Algorithm Configuration

Figure 4 for Learning Heuristic Selection with Dynamic Algorithm Configuration

Abstract:A key challenge in satisfying planning is to use multiple heuristics within one heuristic search. An aggregation of multiple heuristic estimates, for example by taking the maximum, has the disadvantage that bad estimates of a single heuristic can negatively affect the whole search. Since the performance of a heuristic varies from instance to instance, approaches such as algorithm selection can be successfully applied. In addition, alternating between multiple heuristics during the search makes it possible to use all heuristics equally and improve performance. However, all these approaches ignore the internal search dynamics of a planning system, which can help to select the most helpful heuristics for the current expansion step. We show that dynamic algorithm configuration can be used for dynamic heuristic selection which takes into account the internal search dynamics of a planning system. Furthermore, we prove that this approach generalizes over existing approaches and that it can exponentially improve the performance of the heuristic search. To learn dynamic heuristic selection, we propose an approach based on reinforcement learning and show empirically that domain-wise learned policies, which take the internal search dynamics of a planning system into account, can exceed existing approaches in terms of coverage.

* 9 pages, 4 figures, 3 tables

Via

Access Paper or Ask Questions

Cooperative Epistemic Multi-Agent Planning for Implicit Coordination

Mar 07, 2017

Thorsten Engesser, Thomas Bolander, Robert Mattmüller, Bernhard Nebel

Figure 1 for Cooperative Epistemic Multi-Agent Planning for Implicit Coordination

Figure 2 for Cooperative Epistemic Multi-Agent Planning for Implicit Coordination

Figure 3 for Cooperative Epistemic Multi-Agent Planning for Implicit Coordination

Abstract:Epistemic planning can be used for decision making in multi-agent situations with distributed knowledge and capabilities. Recently, Dynamic Epistemic Logic (DEL) has been shown to provide a very natural and expressive framework for epistemic planning. We extend the DEL-based epistemic planning framework to include perspective shifts, allowing us to define new notions of sequential and conditional planning with implicit coordination. With these, it is possible to solve planning tasks with joint goals in a decentralized manner without the agents having to negotiate about and commit to a joint policy at plan time. First we define the central planning notions and sketch the implementation of a planning system built on those notions. Afterwards we provide some case studies in order to evaluate the planner empirically and to show that the concept is useful for multi-agent systems in practice.

* EPTCS 243, 2017, pp. 75-90
* In Proceedings M4M9 2017, arXiv:1703.01736

Via

Access Paper or Ask Questions