Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Martin Chmelík

Stochastic Shortest Path with Energy Constraints in POMDPs

May 11, 2016

Tomáš Brázdil, Krishnendu Chatterjee, Martin Chmelík, Anchit Gupta, Petr Novotný

Figure 1 for Stochastic Shortest Path with Energy Constraints in POMDPs

Abstract:We consider partially observable Markov decision processes (POMDPs) with a set of target states and positive integer costs associated with every transition. The traditional optimization objective (stochastic shortest path) asks to minimize the expected total cost until the target set is reached. We extend the traditional framework of POMDPs to model energy consumption, which represents a hard constraint. The energy levels may increase and decrease with transitions, and the hard constraint requires that the energy level must remain positive in all steps till the target is reached. First, we present a novel algorithm for solving POMDPs with energy levels, developing on existing POMDP solvers and using RTDP as its main method. Our second contribution is related to policy representation. For larger POMDP instances the policies computed by existing solvers are too large to be understandable. We present an automated procedure based on machine learning techniques that automatically extracts important decisions of the policy allowing us to compute succinct human readable policies. Finally, we show experimentally that our algorithm performs well and computes succinct policies on a number of POMDP instances from the literature that were naturally enhanced with energy levels.

* Technical report accompanying a paper published in proceedings of AAMAS 2016

Via

Access Paper or Ask Questions

Qualitative Analysis of POMDPs with Temporal Logic Specifications for Robotics Applications

Feb 18, 2015

Krishnendu Chatterjee, Martin Chmelík, Raghav Gupta, Ayush Kanodia

Figure 1 for Qualitative Analysis of POMDPs with Temporal Logic Specifications for Robotics Applications

Figure 2 for Qualitative Analysis of POMDPs with Temporal Logic Specifications for Robotics Applications

Figure 3 for Qualitative Analysis of POMDPs with Temporal Logic Specifications for Robotics Applications

Figure 4 for Qualitative Analysis of POMDPs with Temporal Logic Specifications for Robotics Applications

Abstract:We consider partially observable Markov decision processes (POMDPs), that are a standard framework for robotics applications to model uncertainties present in the real world, with temporal logic specifications. All temporal logic specifications in linear-time temporal logic (LTL) can be expressed as parity objectives. We study the qualitative analysis problem for POMDPs with parity objectives that asks whether there is a controller (policy) to ensure that the objective holds with probability 1 (almost-surely). While the qualitative analysis of POMDPs with parity objectives is undecidable, recent results show that when restricted to finite-memory policies the problem is EXPTIME-complete. While the problem is intractable in theory, we present a practical approach to solve the qualitative analysis problem. We designed several heuristics to deal with the exponential complexity, and have used our implementation on a number of well-known POMDP examples for robotics applications. Our results provide the first practical approach to solve the qualitative analysis of robot motion planning with LTL properties in the presence of uncertainty.

Via

Access Paper or Ask Questions

Optimal Cost Almost-sure Reachability in POMDPs

Nov 14, 2014

Krishnendu Chatterjee, Martin Chmelík, Raghav Gupta, Ayush Kanodia

Figure 1 for Optimal Cost Almost-sure Reachability in POMDPs

Figure 2 for Optimal Cost Almost-sure Reachability in POMDPs

Abstract:We consider partially observable Markov decision processes (POMDPs) with a set of target states and every transition is associated with an integer cost. The optimization objective we study asks to minimize the expected total cost till the target set is reached, while ensuring that the target set is reached almost-surely (with probability 1). We show that for integer costs approximating the optimal cost is undecidable. For positive costs, our results are as follows: (i) we establish matching lower and upper bounds for the optimal cost and the bound is double exponential; (ii) we show that the problem of approximating the optimal cost is decidable and present approximation algorithms developing on the existing algorithms for POMDPs with finite-horizon objectives. While the worst-case running time of our algorithm is double exponential, we also present efficient stopping criteria for the algorithm and show experimentally that it performs well in many examples of interest.

* Full Version of Optimal Cost Almost-sure Reachability in POMDPs, AAAI 2015. arXiv admin note: text overlap with arXiv:1207.4166 by other authors

Via

Access Paper or Ask Questions

POMDPs under Probabilistic Semantics

Aug 22, 2013

Krishnendu Chatterjee, Martin Chmelík

Figure 1 for POMDPs under Probabilistic Semantics

Figure 2 for POMDPs under Probabilistic Semantics

Figure 3 for POMDPs under Probabilistic Semantics

Figure 4 for POMDPs under Probabilistic Semantics

Abstract:We consider partially observable Markov decision processes (POMDPs) with limit-average payoff, where a reward value in the interval [0,1] is associated to every transition, and the payoff of an infinite path is the long-run average of the rewards. We consider two types of path constraints: (i) quantitative constraint defines the set of paths where the payoff is at least a given threshold {\lambda} in (0, 1]; and (ii) qualitative constraint which is a special case of quantitative constraint with {\lambda} = 1. We consider the computation of the almost-sure winning set, where the controller needs to ensure that the path constraint is satisfied with probability 1. Our main results for qualitative path constraint are as follows: (i) the problem of deciding the existence of a finite-memory controller is EXPTIME-complete; and (ii) the problem of deciding the existence of an infinite-memory controller is undecidable. For quantitative path constraint we show that the problem of deciding the existence of a finite-memory controller is undecidable.

* Full version of: POMDPs under Probabilistic Semantics, UAI 2013

Via

Access Paper or Ask Questions