Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:HEX: Human-in-the-loop Explainability via Deep Reinforcement Learning

Jun 02, 2022

Michael T. Lash

Figure 1 for HEX: Human-in-the-loop Explainability via Deep Reinforcement Learning

Figure 2 for HEX: Human-in-the-loop Explainability via Deep Reinforcement Learning

Figure 3 for HEX: Human-in-the-loop Explainability via Deep Reinforcement Learning

Figure 4 for HEX: Human-in-the-loop Explainability via Deep Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:The use of machine learning (ML) models in decision-making contexts, particularly those used in high-stakes decision-making, are fraught with issue and peril since a person - not a machine - must ultimately be held accountable for the consequences of the decisions made using such systems. Machine learning explainability (MLX) promises to provide decision-makers with prediction-specific rationale, assuring them that the model-elicited predictions are made for the right reasons and are thus reliable. Few works explicitly consider this key human-in-the-loop (HITL) component, however. In this work we propose HEX, a human-in-the-loop deep reinforcement learning approach to MLX. HEX incorporates 0-distrust projection to synthesize decider specific explanation-providing policies from any arbitrary classification model. HEX is also constructed to operate in limited or reduced training data scenarios, such as those employing federated learning. Our formulation explicitly considers the decision boundary of the ML model in question, rather than the underlying training data, which is a shortcoming of many model-agnostic MLX methods. Our proposed methods thus synthesize HITL MLX policies that explicitly capture the decision boundary of the model in question for use in limited data scenarios.

View paper on

Share this with someone who'll enjoy it:

Title:HEX: Human-in-the-loop Explainability via Deep Reinforcement Learning

Paper and Code