Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tony Jebara

Selectively Contextual Bandits

May 09, 2022

Claudia Roberts, Maria Dimakopoulou, Qifeng Qiao, Ashok Chandrashekhar, Tony Jebara

Figure 1 for Selectively Contextual Bandits

Figure 2 for Selectively Contextual Bandits

Abstract:Contextual bandits are widely used in industrial personalization systems. These online learning frameworks learn a treatment assignment policy in the presence of treatment effects that vary with the observed contextual features of the users. While personalization creates a rich user experience that reflect individual interests, there are benefits of a shared experience across a community that enable participation in the zeitgeist. Such benefits are emergent through network effects and are not captured in regret metrics typically employed in evaluating bandits. To balance these needs, we propose a new online learning algorithm that preserves benefits of personalization while increasing the commonality in treatments across users. Our approach selectively interpolates between a contextual bandit algorithm and a context-free multi-arm bandit and leverages the contextual information for a treatment decision only if it promises significant gains. Apart from helping users of personalization systems balance their experience between the individualized and shared, simplifying the treatment assignment policy by making it selectively reliant on the context can help improve the rate of learning in some cases. We evaluate our approach in a classification setting using public datasets and show the benefits of the hybrid policy.

Via

Access Paper or Ask Questions

Active Multitask Learning with Committees

Mar 24, 2021

Jingxi Xu, Da Tang, Tony Jebara

Figure 1 for Active Multitask Learning with Committees

Figure 2 for Active Multitask Learning with Committees

Abstract:The cost of annotating training data has traditionally been a bottleneck for supervised learning approaches. The problem is further exacerbated when supervised learning is applied to a number of correlated tasks simultaneously since the amount of labels required scales with the number of tasks. To mitigate this concern, we propose an active multitask learning algorithm that achieves knowledge transfer between tasks. The approach forms a so-called committee for each task that jointly makes decisions and directly shares data across similar tasks. Our approach reduces the number of queries needed during training while maintaining high accuracy on test data. Empirical results on benchmark datasets show significant improvements on both accuracy and number of query requests.

Via

Access Paper or Ask Questions

Learning Correlated Latent Representations with Adaptive Priors

Jul 16, 2019

Da Tang, Dawen Liang, Nicholas Ruozzi, Tony Jebara

Figure 1 for Learning Correlated Latent Representations with Adaptive Priors

Figure 2 for Learning Correlated Latent Representations with Adaptive Priors

Abstract:Variational Auto-Encoders (VAEs) have been widely applied for learning compact low-dimensional latent representations for high-dimensional data. When the correlation structure among data points is available, previous work proposed Correlated Variational Auto-Encoders (CVAEs) which employ a structured mixture model as prior and a structured variational posterior for each mixture component to enforce the learned latent representations to follow the same correlation structure. However, as we demonstrate in this paper, such a choice can not guarantee that CVAEs can capture all of the correlations. Furthermore, it prevents us from obtaining a tractable joint and marginal variational distribution. To address these issues, we propose Adaptive Correlated Variational Auto-Encoders (ACVAEs), which apply an adaptive prior distribution that can be adjusted during training, and learn a tractable joint distribution via a saddle-point optimization procedure. Its tractable form also enables further refinement with belief propagation. Experimental results on two real datasets show that ACVAEs outperform other benchmarks significantly.

* 12pages, 2 figures

Via

Access Paper or Ask Questions

A New Distribution on the Simplex with Auto-Encoding Applications

May 28, 2019

Andrew Stirn, Tony Jebara, David A Knowles

Figure 1 for A New Distribution on the Simplex with Auto-Encoding Applications

Figure 2 for A New Distribution on the Simplex with Auto-Encoding Applications

Figure 3 for A New Distribution on the Simplex with Auto-Encoding Applications

Figure 4 for A New Distribution on the Simplex with Auto-Encoding Applications

Abstract:We construct a new distribution for the simplex using the Kumaraswamy distribution and an ordered stick-breaking process. We explore and develop the theoretical properties of this new distribution and prove that it exhibits symmetry under the same conditions as the well-known Dirichlet. Like the Dirichlet, the new distribution is adept at capturing sparsity but, unlike the Dirichlet, has an exact and closed form reparameterization--making it well suited for deep variational Bayesian modeling. We demonstrate the distribution's utility in a variety of semi-supervised auto-encoding tasks. In all cases, the resulting models achieve competitive performance commensurate with their simplicity, use of explicit probability models, and abstinence from adversarial training.

* 15 pages, 5 figures, 2 tables

Via

Access Paper or Ask Questions

Correlated Variational Auto-Encoders

May 16, 2019

Da Tang, Dawen Liang, Tony Jebara, Nicholas Ruozzi

Figure 1 for Correlated Variational Auto-Encoders

Figure 2 for Correlated Variational Auto-Encoders

Figure 3 for Correlated Variational Auto-Encoders

Abstract:Variational Auto-Encoders (VAEs) are capable of learning latent representations for high dimensional data. However, due to the i.i.d. assumption, VAEs only optimize the singleton variational distributions and fail to account for the correlations between data points, which might be crucial for learning latent representations from dataset where a priori we know correlations exist. We propose Correlated Variational Auto-Encoders (CVAEs) that can take the correlation structure into consideration when learning latent representations with VAEs. CVAEs apply a prior based on the correlation structure. To address the intractability introduced by the correlated prior, we develop an approximation by average of a set of tractable lower bounds over all maximal acyclic subgraphs of the undirected correlation graph. Experimental results on matching and link prediction on public benchmark rating datasets and spectral clustering on a synthetic dataset show the effectiveness of the proposed method over baseline algorithms.

* International Conference on Machine Learning (ICML), 2019

Via

Access Paper or Ask Questions

Beta Survival Models

May 09, 2019

David Hubbard, Benoit Rostykus, Yves Raimond, Tony Jebara

Abstract:This article analyzes the problem of estimating the time until an event occurs, also known as survival modeling. We observe through substantial experiments on large real-world datasets and use-cases that populations are largely heterogeneous. Sub-populations have different mean and variance in their survival rates requiring flexible models that capture heterogeneity. We leverage a classical extension of the logistic function into the survival setting to characterize unobserved heterogeneity using the beta distribution. This yields insights into the geometry of the problem as well as efficient estimation methods for linear, tree and neural network models that adjust the beta distribution based on observed covariates. We also show that the additional information captured by the beta distribution leads to interesting ranking implications as we determine who is most-at-risk. We show theoretically that the ranking is variable as we forecast forward in time and prove that pairwise comparisons of survival remain transitive. Empirical results using large-scale datasets across two use-cases (online conversions and retention modeling), demonstrate the competitiveness of the method. The simplicity of the method and its ability to capture skew in the data makes it a viable alternative to standard techniques particularly when we are interested in the time to event and when the underlying probabilities are heterogeneous.

* 11 pages, 9 figures

Via

Access Paper or Ask Questions

Thompson Sampling for Noncompliant Bandits

Dec 03, 2018

Andrew Stirn, Tony Jebara

Figure 1 for Thompson Sampling for Noncompliant Bandits

Figure 2 for Thompson Sampling for Noncompliant Bandits

Figure 3 for Thompson Sampling for Noncompliant Bandits

Figure 4 for Thompson Sampling for Noncompliant Bandits

Abstract:Thompson sampling, a Bayesian method for balancing exploration and exploitation in bandit problems, has theoretical guarantees and exhibits strong empirical performance in many domains. Traditional Thompson sampling, however, assumes perfect compliance, where an agent's chosen action is treated as the implemented action. This article introduces a stochastic noncompliance model that relaxes this assumption. We prove that any noncompliance in a 2-armed Bernoulli bandit increases existing regret bounds. With our noncompliance model, we derive Thompson sampling variants that explicitly handle both observed and latent noncompliance. With extensive empirical analysis, we demonstrate that our algorithms either match or outperform traditional Thompson sampling in both compliant and noncompliant environments.

* 21 pages, 5 figures

Via

Access Paper or Ask Questions

Item Recommendation with Variational Autoencoders and Heterogenous Priors

Oct 07, 2018

Giannis Karamanolakis, Kevin Raji Cherian, Ananth Ravi Narayan, Jie Yuan, Da Tang, Tony Jebara

Figure 1 for Item Recommendation with Variational Autoencoders and Heterogenous Priors

Figure 2 for Item Recommendation with Variational Autoencoders and Heterogenous Priors

Figure 3 for Item Recommendation with Variational Autoencoders and Heterogenous Priors

Figure 4 for Item Recommendation with Variational Autoencoders and Heterogenous Priors

Abstract:In recent years, Variational Autoencoders (VAEs) have been shown to be highly effective in both standard collaborative filtering applications and extensions such as incorporation of implicit feedback. We extend VAEs to collaborative filtering with side information, for instance when ratings are combined with explicit text feedback from the user. Instead of using a user-agnostic standard Gaussian prior, we incorporate user-dependent priors in the latent VAE space to encode users' preferences as functions of the review text. Taking into account both the rating and the text information to represent users in this multimodal latent space is promising to improve recommendation quality. Our proposed model is shown to outperform the existing VAE models for collaborative filtering (up to 29.41% relative improvement in ranking metric) along with other baselines that incorporate both user ratings and text for item recommendation.

* Accepted for the 3rd Workshop on Deep Learning for Recommender Systems (DLRS 2018), held in conjunction with the 12th ACM Conference on Recommender Systems (RecSys 2018) in Vancouver, Canada

Via

Access Paper or Ask Questions

Subgoal Discovery for Hierarchical Dialogue Policy Learning

Sep 22, 2018

Da Tang, Xiujun Li, Jianfeng Gao, Chong Wang, Lihong Li, Tony Jebara

Figure 1 for Subgoal Discovery for Hierarchical Dialogue Policy Learning

Figure 2 for Subgoal Discovery for Hierarchical Dialogue Policy Learning

Figure 3 for Subgoal Discovery for Hierarchical Dialogue Policy Learning

Figure 4 for Subgoal Discovery for Hierarchical Dialogue Policy Learning

Abstract:Developing agents to engage in complex goal-oriented dialogues is challenging partly because the main learning signals are very sparse in long conversations. In this paper, we propose a divide-and-conquer approach that discovers and exploits the hidden structure of the task to enable efficient policy learning. First, given successful example dialogues, we propose the Subgoal Discovery Network (SDN) to divide a complex goal-oriented task into a set of simpler subgoals in an unsupervised fashion. We then use these subgoals to learn a multi-level policy by hierarchical reinforcement learning. We demonstrate our method by building a dialogue agent for the composite task of travel planning. Experiments with simulated and real users show that our approach performs competitively against a state-of-the-art method that requires human-defined subgoals. Moreover, we show that the learned subgoals are often human comprehensible.

* 11 pages, 6 figures, EMNLP 2018

Via

Access Paper or Ask Questions

A refinement of Bennett's inequality with applications to portfolio optimization

Apr 16, 2018

Tony Jebara

Figure 1 for A refinement of Bennett's inequality with applications to portfolio optimization

Figure 2 for A refinement of Bennett's inequality with applications to portfolio optimization

Figure 3 for A refinement of Bennett's inequality with applications to portfolio optimization

Figure 4 for A refinement of Bennett's inequality with applications to portfolio optimization

Abstract:A refinement of Bennett's inequality is introduced which is strictly tighter than the classical bound. The new bound establishes the convergence of the average of independent random variables to its expected value. It also carefully exploits information about the potentially heterogeneous mean, variance, and ceiling of each random variable. The bound is strictly sharper in the homogeneous setting and very often significantly sharper in the heterogeneous setting. The improved convergence rates are obtained by leveraging Lambert's W function. We apply the new bound in a portfolio optimization setting to allocate a budget across investments with heterogeneous returns.

Via

Access Paper or Ask Questions