Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Active Learning of Markov Decision Processes using Baum-Welch algorithm (Extended)

Oct 06, 2021

Giovanni Bacci, Anna Ingólfsdóttir, Kim Larsen, Raphaël Reynouard

Figure 1 for Active Learning of Markov Decision Processes using Baum-Welch algorithm (Extended)

Figure 2 for Active Learning of Markov Decision Processes using Baum-Welch algorithm (Extended)

Figure 3 for Active Learning of Markov Decision Processes using Baum-Welch algorithm (Extended)

Figure 4 for Active Learning of Markov Decision Processes using Baum-Welch algorithm (Extended)

Share this with someone who'll enjoy it:

Abstract:Cyber-physical systems (CPSs) are naturally modelled as reactive systems with nondeterministic and probabilistic dynamics. Model-based verification techniques have proved effective in the deployment of safety-critical CPSs. Central for a successful application of such techniques is the construction of an accurate formal model for the system. Manual construction can be a resource-demanding and error-prone process, thus motivating the design of automata learning algorithms to synthesise a system model from observed system behaviours. This paper revisits and adapts the classic Baum-Welch algorithm for learning Markov decision processes and Markov chains. For the case of MDPs, which typically demand more observations, we present a model-based active learning sampling strategy that choses examples which are most informative w.r.t.\ the current model hypothesis. We empirically compare our approach with state-of-the-art tools and demonstrate that the proposed active learning procedure can significantly reduce the number of observations required to obtain accurate models.

* 7 pages, 7 figures, submitted and accepted (short) to ICMLA 2021

View paper on

Share this with someone who'll enjoy it:

Title:Active Learning of Markov Decision Processes using Baum-Welch algorithm (Extended)

Paper and Code