Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nathan Hunt

Verifiably Safe Exploration for End-to-End Reinforcement Learning

Jul 02, 2020

Nathan Hunt, Nathan Fulton, Sara Magliacane, Nghia Hoang, Subhro Das, Armando Solar-Lezama

Figure 1 for Verifiably Safe Exploration for End-to-End Reinforcement Learning

Figure 2 for Verifiably Safe Exploration for End-to-End Reinforcement Learning

Figure 3 for Verifiably Safe Exploration for End-to-End Reinforcement Learning

Figure 4 for Verifiably Safe Exploration for End-to-End Reinforcement Learning

Abstract:Deploying deep reinforcement learning in safety-critical settings requires developing algorithms that obey hard constraints during exploration. This paper contributes a first approach toward enforcing formal safety constraints on end-to-end policies with visual inputs. Our approach draws on recent advances in object detection and automated reasoning for hybrid dynamical systems. The approach is evaluated on a novel benchmark that emphasizes the challenge of safely exploring in the presence of hard constraints. Our benchmark draws from several proposed problem sets for safe learning and includes problems that emphasize challenges such as reward signals that are not aligned with safety constraints. On each of these benchmark problems, our algorithm completely avoids unsafe behavior while remaining competitive at optimizing for as much reward as is safe. We also prove that our method of enforcing the safety constraints preserves all safe policies from the original environment.

Via

Access Paper or Ask Questions

Formal Verification of End-to-End Learning in Cyber-Physical Systems: Progress and Challenges

Jun 15, 2020

Nathan Fulton, Nathan Hunt, Nghia Hoang, Subhro Das

Figure 1 for Formal Verification of End-to-End Learning in Cyber-Physical Systems: Progress and Challenges

Figure 2 for Formal Verification of End-to-End Learning in Cyber-Physical Systems: Progress and Challenges

Figure 3 for Formal Verification of End-to-End Learning in Cyber-Physical Systems: Progress and Challenges

Figure 4 for Formal Verification of End-to-End Learning in Cyber-Physical Systems: Progress and Challenges

Abstract:Autonomous systems -- such as self-driving cars, autonomous drones, and automated trains -- must come with strong safety guarantees. Over the past decade, techniques based on formal methods have enjoyed some success in providing strong correctness guarantees for large software systems including operating system kernels, cryptographic protocols, and control software for drones. These successes suggest it might be possible to ensure the safety of autonomous systems by constructing formal, computer-checked correctness proofs. This paper identifies three assumptions underlying existing formal verification techniques, explains how each of these assumptions limits the applicability of verification in autonomous systems, and summarizes preliminary work toward improving the strength of evidence provided by formal verification.

* 7 pages, 4 figures. NeurIPS Workshop on Safety and Robustness in Decision Making, 2019

Via

Access Paper or Ask Questions

Clinical Intervention Prediction and Understanding using Deep Networks

May 23, 2017

Harini Suresh, Nathan Hunt, Alistair Johnson, Leo Anthony Celi, Peter Szolovits, Marzyeh Ghassemi

Figure 1 for Clinical Intervention Prediction and Understanding using Deep Networks

Figure 2 for Clinical Intervention Prediction and Understanding using Deep Networks

Figure 3 for Clinical Intervention Prediction and Understanding using Deep Networks

Figure 4 for Clinical Intervention Prediction and Understanding using Deep Networks

Abstract:Real-time prediction of clinical interventions remains a challenge within intensive care units (ICUs). This task is complicated by data sources that are noisy, sparse, heterogeneous and outcomes that are imbalanced. In this paper, we integrate data from all available ICU sources (vitals, labs, notes, demographics) and focus on learning rich representations of this data to predict onset and weaning of multiple invasive interventions. In particular, we compare both long short-term memory networks (LSTM) and convolutional neural networks (CNN) for prediction of five intervention tasks: invasive ventilation, non-invasive ventilation, vasopressors, colloid boluses, and crystalloid boluses. Our predictions are done in a forward-facing manner to enable "real-time" performance, and predictions are made with a six hour gap time to support clinically actionable planning. We achieve state-of-the-art results on our predictive tasks using deep architectures. We explore the use of feature occlusion to interpret LSTM models, and compare this to the interpretability gained from examining inputs that maximally activate CNN outputs. We show that our models are able to significantly outperform baselines in intervention prediction, and provide insight into model learning, which is crucial for the adoption of such models in practice.

Via

Access Paper or Ask Questions