Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tomer Arnon

Entropy-regularized Point-based Value Iteration

Feb 14, 2024

Harrison Delecki, Marcell Vazquez-Chanlatte, Esen Yel, Kyle Wray, Tomer Arnon, Stefan Witwicki, Mykel J. Kochenderfer

Figure 1 for Entropy-regularized Point-based Value Iteration

Figure 2 for Entropy-regularized Point-based Value Iteration

Figure 3 for Entropy-regularized Point-based Value Iteration

Figure 4 for Entropy-regularized Point-based Value Iteration

Abstract:Model-based planners for partially observable problems must accommodate both model uncertainty during planning and goal uncertainty during objective inference. However, model-based planners may be brittle under these types of uncertainty because they rely on an exact model and tend to commit to a single optimal behavior. Inspired by results in the model-free setting, we propose an entropy-regularized model-based planner for partially observable problems. Entropy regularization promotes policy robustness for planning and objective inference by encouraging policies to be no more committed to a single action than necessary. We evaluate the robustness and objective inference performance of entropy-regularized policies in three problem domains. Our results show that entropy-regularized policies outperform non-entropy-regularized baselines in terms of higher expected returns under modeling errors and higher accuracy during objective inference.

Via

Access Paper or Ask Questions

Algorithms for Verifying Deep Neural Networks

Mar 15, 2019

Changliu Liu, Tomer Arnon, Christopher Lazarus, Clark Barrett, Mykel J. Kochenderfer

Figure 1 for Algorithms for Verifying Deep Neural Networks

Figure 2 for Algorithms for Verifying Deep Neural Networks

Figure 3 for Algorithms for Verifying Deep Neural Networks

Figure 4 for Algorithms for Verifying Deep Neural Networks

Abstract:Deep neural networks are widely used for nonlinear function approximation with applications ranging from computer vision to control. Although these networks involve the composition of simple arithmetic operations, it can be very challenging to verify whether a particular network satisfies certain input-output properties. This article surveys methods that have emerged recently for soundly verifying such properties. These methods borrow insights from reachability analysis, optimization, and search. We discuss fundamental differences and connections between existing algorithms. In addition, we provide pedagogical implementations of existing methods and compare them on a set of benchmark problems.

Via

Access Paper or Ask Questions