Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kush Grover

Learning Explainable and Better Performing Representations of POMDP Strategies

Jan 20, 2024

Alexander Bork, Debraj Chakraborty, Kush Grover, Jan Kretinsky, Stefanie Mohr

Figure 1 for Learning Explainable and Better Performing Representations of POMDP Strategies

Figure 2 for Learning Explainable and Better Performing Representations of POMDP Strategies

Figure 3 for Learning Explainable and Better Performing Representations of POMDP Strategies

Figure 4 for Learning Explainable and Better Performing Representations of POMDP Strategies

Abstract:Strategies for partially observable Markov decision processes (POMDP) typically require memory. One way to represent this memory is via automata. We present a method to learn an automaton representation of a strategy using a modification of the L*-algorithm. Compared to the tabular representation of a strategy, the resulting automaton is dramatically smaller and thus also more explainable. Moreover, in the learning process, our heuristics may even improve the strategy's performance. In contrast to approaches that synthesize an automaton directly from the POMDP thereby solving it, our approach is incomparably more scalable.

* Technical report for the submission to TACAS 24

Via

Access Paper or Ask Questions

MULTIGAIN 2.0: MDP controller synthesis for multiple mean-payoff, LTL and steady-state constraints

May 26, 2023

Severin Bals, Alexandros Evangelidis, Kush Grover, Jan Kretinsky, Jakob Waibel

Abstract:We present MULTIGAIN 2.0, a major extension to the controller synthesis tool MultiGain, built on top of the probabilistic model checker PRISM. This new version extends MultiGain's multi-objective capabilities, by allowing for the formal verification and synthesis of controllers for probabilistic systems with multi-dimensional long-run average reward structures, steady-state constraints, and linear temporal logic properties. Additionally, MULTIGAIN 2.0 provides an approach for finding finite memory solutions and the capability for two- and three-dimensional visualization of Pareto curves to facilitate trade-off analysis in multi-objective scenarios

Via

Access Paper or Ask Questions