Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alexander Vladimirsky

Surveillance Evasion Through Bayesian Reinforcement Learning

Sep 30, 2021

Dongping Qi, David Bindel, Alexander Vladimirsky

Figure 1 for Surveillance Evasion Through Bayesian Reinforcement Learning

Figure 2 for Surveillance Evasion Through Bayesian Reinforcement Learning

Figure 3 for Surveillance Evasion Through Bayesian Reinforcement Learning

Abstract:We consider a 2D continuous path planning problem with a completely unknown intensity of random termination: an Evader is trying to escape a domain while minimizing the cumulative risk of detection (termination) by adversarial Observers. Those Observers' surveillance intensity is a priori unknown and has to be learned through repetitive path planning. We propose a new algorithm that utilizes Gaussian process regression to model the unknown surveillance intensity and relies on a confidence bound technique to promote strategic exploration. We illustrate our method through several examples and confirm the convergence of averaged regret experimentally.

* 6 pages, 3 figures

Via

Access Paper or Ask Questions

A bi-criteria path planning algorithm for robotics applications

Jan 08, 2017

Zachary Clawson, Xuchu Ding, Brendan Englot, Thomas A. Frewen, William M. Sisson, Alexander Vladimirsky

Figure 1 for A bi-criteria path planning algorithm for robotics applications

Figure 2 for A bi-criteria path planning algorithm for robotics applications

Figure 3 for A bi-criteria path planning algorithm for robotics applications

Figure 4 for A bi-criteria path planning algorithm for robotics applications

Abstract:Realistic path planning applications often require optimizing with respect to several criteria simultaneously. Here we introduce an efficient algorithm for bi-criteria path planning on graphs. Our approach is based on augmenting the state space to keep track of the "budget" remaining to satisfy the constraints on secondary cost. The resulting augmented graph is acyclic and the primary cost can be then minimized by a simple upward sweep through budget levels. The efficiency and accuracy of our algorithm is tested on Probabilistic Roadmap graphs to minimize the distance of travel subject to a constraint on the overall threat exposure of the robot. We also present the results from field experiments illustrating the use of this approach on realistic robotic systems.

* 19 pages, 12 figures; submitted for publication to IEEE Transactions on Automation Science and Engineering

Via

Access Paper or Ask Questions

Optimal control with reset-renewable resources

Sep 27, 2014

Ryo Takei, Weiyan Chen, Zachary Clawson, Slav Kirov, Alexander Vladimirsky

Figure 1 for Optimal control with reset-renewable resources

Figure 2 for Optimal control with reset-renewable resources

Figure 3 for Optimal control with reset-renewable resources

Figure 4 for Optimal control with reset-renewable resources

Abstract:We consider both discrete and continuous control problems constrained by a fixed budget of some resource, which may be renewed upon entering a preferred subset of the state space. In the discrete case, we consider both deterministic and stochastic shortest path problems with full budget resets in all preferred nodes. In the continuous case, we derive augmented PDEs of optimal control, which are then solved numerically on the extended state space with a full/instantaneous budget reset on the preferred subset. We introduce an iterative algorithm for solving these problems efficiently. The method's performance is demonstrated on a range of computational examples, including the optimal path planning with constraints on prolonged visibility by a static enemy observer. In addition, we also develop an algorithm that works on the original state space to solve a related but simpler problem: finding the subsets of the domain "reachable-within-the-budget". This manuscript is an extended version of the paper accepted for publication by SIAM J. on Control and Optimization. In the journal version, Section 3 and the Appendix were omitted due to space limitations.

* 31 pages, 13 figures; accepted by SIAM J. on Control & Optimization (updated to address reviewers' comments)

Via

Access Paper or Ask Questions