Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Girish N. Nair

Entropy-Regularized Partially Observed Markov Decision Processes

Dec 22, 2021

Timothy L. Molloy, Girish N. Nair

Figure 1 for Entropy-Regularized Partially Observed Markov Decision Processes

Figure 2 for Entropy-Regularized Partially Observed Markov Decision Processes

Figure 3 for Entropy-Regularized Partially Observed Markov Decision Processes

Abstract:We investigate partially observed Markov decision processes (POMDPs) with cost functions regularized by entropy terms describing state, observation, and control uncertainty. Standard POMDP techniques are shown to offer bounded-error solutions to these entropy-regularized POMDPs, with exact solutions when the regularization involves the joint entropy of the state, observation, and control trajectories. Our joint-entropy result is particularly surprising since it constitutes a novel, tractable formulation of active state estimation.

* 20 pages, 2 figures, submitted

Via

Access Paper or Ask Questions

Smoother Entropy for Active State Trajectory Estimation and Obfuscation in POMDPs

Aug 19, 2021

Timothy L. Molloy, Girish N. Nair

Figure 1 for Smoother Entropy for Active State Trajectory Estimation and Obfuscation in POMDPs

Figure 2 for Smoother Entropy for Active State Trajectory Estimation and Obfuscation in POMDPs

Figure 3 for Smoother Entropy for Active State Trajectory Estimation and Obfuscation in POMDPs

Figure 4 for Smoother Entropy for Active State Trajectory Estimation and Obfuscation in POMDPs

Abstract:We study the problem of controlling a partially observed Markov decision process (POMDP) to either aid or hinder the estimation of its state trajectory by optimising the conditional entropy of the state trajectory given measurements and controls, a quantity we dub the smoother entropy. Our consideration of the smoother entropy contrasts with previous active state estimation and obfuscation approaches that instead resort to measures of marginal (or instantaneous) state uncertainty due to tractability concerns. By establishing novel expressions of the smoother entropy in terms of the usual POMDP belief state, we show that our active estimation and obfuscation problems can be reformulated as Markov decision processes (MDPs) that are fully observed in the belief state. Surprisingly, we identify belief-state MDP reformulations of both active estimation and obfuscation with concave cost and cost-to-go functions, which enables the use of standard POMDP techniques to construct tractable bounded-error (approximate) solutions. We show in simulations that optimisation of the smoother entropy leads to superior trajectory estimation and obfuscation compared to alternative approaches.

* 41 pages, 2 figures, submitted (under review)

Via

Access Paper or Ask Questions

Intelligent Reference Curation for Visual Place Recognition via Bayesian Selective Fusion

Oct 19, 2020

Timothy L. Molloy, Tobias Fischer, Michael Milford, Girish N. Nair

Figure 1 for Intelligent Reference Curation for Visual Place Recognition via Bayesian Selective Fusion

Figure 2 for Intelligent Reference Curation for Visual Place Recognition via Bayesian Selective Fusion

Figure 3 for Intelligent Reference Curation for Visual Place Recognition via Bayesian Selective Fusion

Figure 4 for Intelligent Reference Curation for Visual Place Recognition via Bayesian Selective Fusion

Abstract:The key challenge of visual place recognition (VPR) lies in recognizing places despite drastic visual appearance changes due to factors such as time of day, season, or weather or lighting conditions. Numerous approaches based on deep-learnt image descriptors, sequence matching, domain translation, and probabilistic localization have had success in addressing this challenge, but most rely on the availability of carefully curated representative reference images of the possible places. In this paper, we propose a novel approach, dubbed Bayesian Selective Fusion, for actively selecting and fusing informative reference images to determine the best place match for a given query image. The selective element of our approach avoids the counterproductive fusion of every reference image and enables the dynamic selection of informative reference images in environments with changing visual conditions (such as indoors with flickering lights, outdoors during sunshowers or over the day-night cycle). The probabilistic element of our approach provides a means of fusing multiple reference images that accounts for their varying uncertainty via a novel training-free likelihood function for VPR. On difficult query images from two benchmark datasets, we demonstrate that our approach matches and exceeds the performance of several alternative fusion approaches along with state-of-the-art techniques that are provided with a priori (unfair) knowledge of the best reference images. Our approach is well suited for long-term robot autonomy where dynamic visual environments are commonplace since it is training-free, descriptor-agnostic, and complements existing techniques such as sequence matching.

* 8 pages, 10 figures, paper is currently under review

Via

Access Paper or Ask Questions