Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hector Kohler

Scool, CRIStAL

Evaluating Interpretable Reinforcement Learning by Distilling Policies into Programs

Mar 11, 2025

Hector Kohler, Quentin Delfosse, Waris Radji, Riad Akrour, Philippe Preux

Abstract:There exist applications of reinforcement learning like medicine where policies need to be ''interpretable'' by humans. User studies have shown that some policy classes might be more interpretable than others. However, it is costly to conduct human studies of policy interpretability. Furthermore, there is no clear definition of policy interpretabiliy, i.e., no clear metrics for interpretability and thus claims depend on the chosen definition. We tackle the problem of empirically evaluating policies interpretability without humans. Despite this lack of clear definition, researchers agree on the notions of ''simulatability'': policy interpretability should relate to how humans understand policy actions given states. To advance research in interpretable reinforcement learning, we contribute a new methodology to evaluate policy interpretability. This new methodology relies on proxies for simulatability that we use to conduct a large-scale empirical evaluation of policy interpretability. We use imitation learning to compute baseline policies by distilling expert neural networks into small programs. We then show that using our methodology to evaluate the baselines interpretability leads to similar conclusions as user studies. We show that increasing interpretability does not necessarily reduce performances and can sometimes increase them. We also show that there is no policy class that better trades off interpretability and performance across tasks making it necessary for researcher to have methodologies for comparing policies interpretability.

* 12 pages of main text, under review

Via

Access Paper or Ask Questions

Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning

May 23, 2024

Hector Kohler, Quentin Delfosse, Riad Akrour, Kristian Kersting, Philippe Preux

Figure 1 for Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning

Figure 2 for Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning

Figure 3 for Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning

Figure 4 for Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning

Abstract:Deep reinforcement learning agents are prone to goal misalignments. The black-box nature of their policies hinders the detection and correction of such misalignments, and the trust necessary for real-world deployment. So far, solutions learning interpretable policies are inefficient or require many human priors. We propose INTERPRETER, a fast distillation method producing INTerpretable Editable tRee Programs for ReinforcEmenT lEaRning. We empirically demonstrate that INTERPRETER compact tree programs match oracles across a diverse set of sequential decision tasks and evaluate the impact of our design choices on interpretability and performances. We show that our policies can be interpreted and edited to correct misalignments on Atari games and to explain real farming strategies.

Via

Access Paper or Ask Questions

PID Tuning using Cross-Entropy Deep Learning: a Lyapunov Stability Analysis

Apr 18, 2024

Hector Kohler, Benoit Clement, Thomas Chaffre, Gilles Le Chenadec

Abstract:Underwater Unmanned Vehicles (UUVs) have to constantly compensate for the external disturbing forces acting on their body. Adaptive Control theory is commonly used there to grant the control law some flexibility in its response to process variation. Today, learning-based (LB) adaptive methods are leading the field where model-based control structures are combined with deep model-free learning algorithms. This work proposes experiments and metrics to empirically study the stability of such a controller. We perform this stability analysis on a LB adaptive control system whose adaptive parameters are determined using a Cross-Entropy Deep Learning method.

* IFAC-PapersOnLine, Volume 55, Issue 31, 2022

Via

Access Paper or Ask Questions

Towards a Research Community in Interpretable Reinforcement Learning: the InterpPol Workshop

Apr 16, 2024

Hector Kohler, Quentin Delfosse, Paul Festor, Philippe Preux

Figure 1 for Towards a Research Community in Interpretable Reinforcement Learning: the InterpPol Workshop

Abstract:Embracing the pursuit of intrinsically explainable reinforcement learning raises crucial questions: what distinguishes explainability from interpretability? Should explainable and interpretable agents be developed outside of domains where transparency is imperative? What advantages do interpretable policies offer over neural networks? How can we rigorously define and measure interpretability in policies, without user studies? What reinforcement learning paradigms,are the most suited to develop interpretable agents? Can Markov Decision Processes integrate interpretable state representations? In addition to motivate an Interpretable RL community centered around the aforementioned questions, we propose the first venue dedicated to Interpretable RL: the InterpPol Workshop.

Via

Access Paper or Ask Questions

Discovering the Interpretability-Performance Pareto Front of Decision Trees with Dynamic Programming

Sep 22, 2023

Hector Kohler, Riad Akrour, Philippe Preux

Figure 1 for Discovering the Interpretability-Performance Pareto Front of Decision Trees with Dynamic Programming

Figure 2 for Discovering the Interpretability-Performance Pareto Front of Decision Trees with Dynamic Programming

Figure 3 for Discovering the Interpretability-Performance Pareto Front of Decision Trees with Dynamic Programming

Figure 4 for Discovering the Interpretability-Performance Pareto Front of Decision Trees with Dynamic Programming

Abstract:Decision trees are known to be intrinsically interpretable as they can be inspected and interpreted by humans. Furthermore, recent hardware advances have rekindled an interest for optimal decision tree algorithms, that produce more accurate trees than the usual greedy approaches. However, these optimal algorithms return a single tree optimizing a hand defined interpretability-performance trade-off, obtained by specifying a maximum number of decision nodes, giving no further insights about the quality of this trade-off. In this paper, we propose a new Markov Decision Problem (MDP) formulation for finding optimal decision trees. The main interest of this formulation is that we can compute the optimal decision trees for several interpretability-performance trade-offs by solving a single dynamic program, letting the user choose a posteriori the tree that best suits their needs. Empirically, we show that our method is competitive with state-of-the-art algorithms in terms of accuracy and runtime while returning a whole set of trees on the interpretability-performance Pareto front.

Via

Access Paper or Ask Questions

AdaStop: sequential testing for efficient and reliable comparisons of Deep RL Agents

Jun 19, 2023

Timothée Mathieu, Riccardo Della Vecchia, Alena Shilova, Matheus Centa de Medeiros, Hector Kohler, Odalric-Ambrym Maillard, Philippe Preux

Abstract:The reproducibility of many experimental results in Deep Reinforcement Learning (RL) is under question. To solve this reproducibility crisis, we propose a theoretically sound methodology to compare multiple Deep RL algorithms. The performance of one execution of a Deep RL algorithm is random so that independent executions are needed to assess it precisely. When comparing several RL algorithms, a major question is how many executions must be made and how can we assure that the results of such a comparison is theoretically sound. Researchers in Deep RL often use less than 5 independent executions to compare algorithms: we claim that this is not enough in general. Moreover, when comparing several algorithms at once, the error of each comparison accumulates and must be taken into account with a multiple tests procedure to preserve low error guarantees. To address this problem in a statistically sound way, we introduce AdaStop, a new statistical test based on multiple group sequential tests. When comparing algorithms, AdaStop adapts the number of executions to stop as early as possible while ensuring that we have enough information to distinguish algorithms that perform better than the others in a statistical significant way. We prove both theoretically and empirically that AdaStop has a low probability of making an error (Family-Wise Error). Finally, we illustrate the effectiveness of AdaStop in multiple use-cases, including toy examples and difficult cases such as Mujoco environments.

Via

Access Paper or Ask Questions

Optimal Interpretability-Performance Trade-off of Classification Trees with Black-Box Reinforcement Learning

Apr 11, 2023

Hector Kohler, Riad Akrour, Philippe Preux

Abstract:Interpretability of AI models allows for user safety checks to build trust in these models. In particular, decision trees (DTs) provide a global view on the learned model and clearly outlines the role of the features that are critical to classify a given data. However, interpretability is hindered if the DT is too large. To learn compact trees, a Reinforcement Learning (RL) framework has been recently proposed to explore the space of DTs. A given supervised classification task is modeled as a Markov decision problem (MDP) and then augmented with additional actions that gather information about the features, equivalent to building a DT. By appropriately penalizing these actions, the RL agent learns to optimally trade-off size and performance of a DT. However, to do so, this RL agent has to solve a partially observable MDP. The main contribution of this paper is to prove that it is sufficient to solve a fully observable problem to learn a DT optimizing the interpretability-performance trade-off. As such any planning or RL algorithm can be used. We demonstrate the effectiveness of this approach on a set of classical supervised classification datasets and compare our approach with other interpretability-performance optimizing methods.

Via

Access Paper or Ask Questions