Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hanyang Hu

Learning Robust Policies via Interpretable Hamilton-Jacobi Reachability-Guided Disturbances

Sep 29, 2024

Hanyang Hu, Xilun Zhang, Xubo Lyu, Mo Chen

Figure 1 for Learning Robust Policies via Interpretable Hamilton-Jacobi Reachability-Guided Disturbances

Figure 2 for Learning Robust Policies via Interpretable Hamilton-Jacobi Reachability-Guided Disturbances

Figure 3 for Learning Robust Policies via Interpretable Hamilton-Jacobi Reachability-Guided Disturbances

Figure 4 for Learning Robust Policies via Interpretable Hamilton-Jacobi Reachability-Guided Disturbances

Abstract:Deep Reinforcement Learning (RL) has shown remarkable success in robotics with complex and heterogeneous dynamics. However, its vulnerability to unknown disturbances and adversarial attacks remains a significant challenge. In this paper, we propose a robust policy training framework that integrates model-based control principles with adversarial RL training to improve robustness without the need for external black-box adversaries. Our approach introduces a novel Hamilton-Jacobi reachability-guided disturbance for adversarial RL training, where we use interpretable worst-case or near-worst-case disturbances as adversaries against the robust policy. We evaluated its effectiveness across three distinct tasks: a reach-avoid game in both simulation and real-world settings, and a highly dynamic quadrotor stabilization task in simulation. We validate that our learned critic network is consistent with the ground-truth HJ value function, while the policy network shows comparable performance with other learning-based methods.

Via

Access Paper or Ask Questions

Task-Oriented Koopman-Based Control with Contrastive Encoder

Sep 28, 2023

Xubo Lyu, Hanyang Hu, Seth Siriya, Ye Pu, Mo Chen

Abstract:We present task-oriented Koopman-based control that utilizes end-to-end reinforcement learning and contrastive encoder to simultaneously learn the Koopman latent embedding, operator and associated linear controller within an iterative loop. By prioritizing the task cost as main objective for controller learning, we reduce the reliance of controller design on a well-identified model, which extends Koopman control beyond low-dimensional systems to high-dimensional, complex nonlinear systems, including pixel-based scenarios.

* Accepted by the 7th Annual Conference on Robot Learning (CoRL), 2023 (oral spotlight)

Via

Access Paper or Ask Questions