Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Anthony R. Cassandra

Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes

Feb 06, 2013

Anthony R. Cassandra, Michael L. Littman, Nevin Lianwen Zhang

Figure 1 for Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes

Figure 2 for Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes

Figure 3 for Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes

Figure 4 for Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes

Abstract:Most exact algorithms for general partially observable Markov decision processes (POMDPs) use a form of dynamic programming in which a piecewise-linear and convex representation of one value function is transformed into another. We examine variations of the "incremental pruning" method for solving this problem and compare them to earlier algorithms from theoretical and empirical perspectives. We find that incremental pruning is presently the most efficient exact method for solving POMDPs.

* Appears in Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence (UAI1997)

Via

Access Paper or Ask Questions

Solving POMDPs by Searching the Space of Finite Policies

Jan 23, 2013

Nicolas Meuleau, Kee-Eung Kim, Leslie Pack Kaelbling, Anthony R. Cassandra

Figure 1 for Solving POMDPs by Searching the Space of Finite Policies

Figure 2 for Solving POMDPs by Searching the Space of Finite Policies

Figure 3 for Solving POMDPs by Searching the Space of Finite Policies

Figure 4 for Solving POMDPs by Searching the Space of Finite Policies

Abstract:Solving partially observable Markov decision processes (POMDPs) is highly intractable in general, at least in part because the optimal policy may be infinitely large. In this paper, we explore the problem of finding the optimal policy from a restricted set of policies, represented as finite state automata of a given size. This problem is also intractable, but we show that the complexity can be greatly reduced when the POMDP and/or policy are further constrained. We demonstrate good empirical results with a branch-and-bound method for finding globally optimal deterministic policies, and a gradient-ascent method for finding locally optimal stochastic policies.

* Appears in Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence (UAI1999)

Via

Access Paper or Ask Questions