Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pierre-Arnaud Coquelin

INRIA Futurs

Optimal Policies Search for Sensor Management

Mar 19, 2009

Thomas Bréhard, Emmanuel Duflos, Philippe Vanheeghe, Pierre-Arnaud Coquelin

Figure 1 for Optimal Policies Search for Sensor Management

Figure 2 for Optimal Policies Search for Sensor Management

Figure 3 for Optimal Policies Search for Sensor Management

Abstract:This paper introduces a new approach to solve sensor management problems. Classically sensor management problems can be well formalized as Partially-Observed Markov Decision Processes (POMPD). The original approach developped here consists in deriving the optimal parameterized policy based on a stochastic gradient estimation. We assume in this work that it is possible to learn the optimal policy off-line (in simulation) using models of the environement and of the sensor(s). The learned policy can then be used to manage the sensor(s). In order to approximate the gradient in a stochastic context, we introduce a new method to approximate the gradient, based on Infinitesimal Perturbation Approximation (IPA). The effectiveness of this general framework is illustrated by the managing of an Electronically Scanned Array Radar. First simulations results are finally proposed.

Via

Access Paper or Ask Questions