Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jonathan Shapiro

Thompson Sampling in Switching Environments with Bayesian Online Change Point Detection

Feb 15, 2013

Joseph Mellor, Jonathan Shapiro

Figure 1 for Thompson Sampling in Switching Environments with Bayesian Online Change Point Detection

Figure 2 for Thompson Sampling in Switching Environments with Bayesian Online Change Point Detection

Figure 3 for Thompson Sampling in Switching Environments with Bayesian Online Change Point Detection

Figure 4 for Thompson Sampling in Switching Environments with Bayesian Online Change Point Detection

Abstract:Thompson Sampling has recently been shown to be optimal in the Bernoulli Multi-Armed Bandit setting[Kaufmann et al., 2012]. This bandit problem assumes stationary distributions for the rewards. It is often unrealistic to model the real world as a stationary distribution. In this paper we derive and evaluate algorithms using Thompson Sampling for a Switching Multi-Armed Bandit Problem. We propose a Thompson Sampling strategy equipped with a Bayesian change point mechanism to tackle this problem. We develop algorithms for a variety of cases with constant switching rate: when switching occurs all arms change (Global Switching), switching occurs independently for each arm (Per-Arm Switching), when the switching rate is known and when it must be inferred from data. This leads to a family of algorithms we collectively term Change-Point Thompson Sampling (CTS). We show empirical results of the algorithm in 4 artificial environments, and 2 derived from real world data; news click-through[Yahoo!, 2011] and foreign exchange data[Dukascopy, 2012], comparing them to some other bandit algorithms. In real world data CTS is the most effective.

* A version will appear in the Sixteenth international conference on Artificial Intelligence and Statistics (AIStats 2013)

Via

Access Paper or Ask Questions

Bayesian Mixture Models for Frequent Itemset Discovery

Sep 26, 2012

Ruefei He, Jonathan Shapiro

Figure 1 for Bayesian Mixture Models for Frequent Itemset Discovery

Figure 2 for Bayesian Mixture Models for Frequent Itemset Discovery

Figure 3 for Bayesian Mixture Models for Frequent Itemset Discovery

Figure 4 for Bayesian Mixture Models for Frequent Itemset Discovery

Abstract:In binary-transaction data-mining, traditional frequent itemset mining often produces results which are not straightforward to interpret. To overcome this problem, probability models are often used to produce more compact and conclusive results, albeit with some loss of accuracy. Bayesian statistics have been widely used in the development of probability models in machine learning in recent years and these methods have many advantages, including their abilities to avoid overfitting. In this paper, we develop two Bayesian mixture models with the Dirichlet distribution prior and the Dirichlet process (DP) prior to improve the previous non-Bayesian mixture model developed for transaction dataset mining. We implement the inference of both mixture models using two methods: a collapsed Gibbs sampling scheme and a variational approximation algorithm. Experiments in several benchmark problems have shown that both mixture models achieve better performance than a non-Bayesian mixture model. The variational algorithm is the faster of the two approaches while the Gibbs sampling method achieves a more accurate results. The Dirichlet process mixture model can automatically grow to a proper complexity for a better approximation. Once the model is built, it can be very fast to query and run analysis on (typically 10 times faster than Eclat, as we will show in the experiment section). However, these approaches also show that mixture models underestimate the probabilities of frequent itemsets. Consequently, these models have a higher sensitivity but a lower specificity.

Via

Access Paper or Ask Questions

Novelty Detection on a Mobile Robot Using Habituation

Jun 02, 2000

Stephen Marsland, Ulrich Nehmzow, Jonathan Shapiro

Figure 1 for Novelty Detection on a Mobile Robot Using Habituation

Figure 2 for Novelty Detection on a Mobile Robot Using Habituation

Figure 3 for Novelty Detection on a Mobile Robot Using Habituation

Figure 4 for Novelty Detection on a Mobile Robot Using Habituation

Abstract:In this paper a novelty filter is introduced which allows a robot operating in an un structured environment to produce a self-organised model of its surroundings and to detect deviations from the learned model. The environment is perceived using the rob ot's 16 sonar sensors. The algorithm produces a novelty measure for each sensor scan relative to the model it has learned. This means that it highlights stimuli which h ave not been previously experienced. The novelty filter proposed uses a model of hab ituation. Habituation is a decrement in behavioural response when a stimulus is pre sented repeatedly. Robot experiments are presented which demonstrate the reliable o peration of the filter in a number of environments.

* 10 pages, 6 figures. In From Animals to Animats, The Sixth International Conference on Simulation of Adaptive Behaviour, Paris, 2000

Via

Access Paper or Ask Questions

A Real-Time Novelty Detector for a Mobile Robot

Jun 02, 2000

Stephen Marsland, Ulrich Nehmzow, Jonathan Shapiro

Figure 1 for A Real-Time Novelty Detector for a Mobile Robot

Figure 2 for A Real-Time Novelty Detector for a Mobile Robot

Figure 3 for A Real-Time Novelty Detector for a Mobile Robot

Figure 4 for A Real-Time Novelty Detector for a Mobile Robot

Abstract:Recognising new or unusual features of an environment is an ability which is potentially very useful to a robot. This paper demonstrates an algorithm which achieves this task by learning an internal representation of `normality' from sonar scans taken as a robot explores the environment. This model of the environment is used to evaluate the novelty of each sonar scan presented to it with relation to the model. Stimuli which have not been seen before, and therefore have more novelty, are highlighted by the filter. The filter has the ability to forget about features which have been learned, so that stimuli which are seen only rarely recover their response over time. A number of robot experiments are presented which demonstrate the operation of the filter.

* 8 pages, 6 figures. In Proceedings of EUREL European Advanced Robotics Systems Masterclass and Conference, 2000

Via

Access Paper or Ask Questions

Novelty Detection for Robot Neotaxis

Jun 02, 2000

Stephen Marsland, Ulrich Nehmzow, Jonathan Shapiro

Figure 1 for Novelty Detection for Robot Neotaxis

Figure 2 for Novelty Detection for Robot Neotaxis

Figure 3 for Novelty Detection for Robot Neotaxis

Figure 4 for Novelty Detection for Robot Neotaxis

Abstract:The ability of a robot to detect and respond to changes in its environment is potentially very useful, as it draws attention to new and potentially important features. We describe an algorithm for learning to filter out previously experienced stimuli to allow further concentration on novel features. The algorithm uses a model of habituation, a biological process which causes a decrement in response with repeated presentation. Experiments with a mobile robot are presented in which the robot detects the most novel stimulus and turns towards it (`neotaxis').

* 7 pages, 5 figures. In Proceedings of the Second International Conference on Neural Computation, 2000

Via

Access Paper or Ask Questions