Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sanjay Thakur

Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales

Oct 23, 2019

Sanjay Thakur, Herke Van Hoof, Gunshi Gupta, David Meger

Figure 1 for Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales

Figure 2 for Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales

Figure 3 for Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales

Figure 4 for Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales

Abstract:Neural Network based controllers hold enormous potential to learn complex, high-dimensional functions. However, they are prone to overfitting and unwarranted extrapolations. PAC Bayes is a generalized framework which is more resistant to overfitting and that yields performance bounds that hold with arbitrarily high probability even on the unjustified extrapolations. However, optimizing to learn such a function and a bound is intractable for complex tasks. In this work, we propose a method to simultaneously learn such a function and estimate performance bounds that scale organically to high-dimensions, non-linear environments without making any explicit assumptions about the environment. We build our approach on a parallel that we draw between the formulations called ELBO and PAC Bayes when the risk metric is negative log likelihood. Through our experiments on multiple high dimensional MuJoCo locomotion tasks, we validate the correctness of our theory, show its ability to generalize better, and investigate the factors that are important for its learning. The code for all the experiments is available at https://bit.ly/2qv0JjA.

Via

Access Paper or Ask Questions

Time2Vec: Learning a Vector Representation of Time

Jul 11, 2019

Seyed Mehran Kazemi, Rishab Goel, Sepehr Eghbali, Janahan Ramanan, Jaspreet Sahota, Sanjay Thakur, Stella Wu, Cathal Smyth, Pascal Poupart, Marcus Brubaker

Figure 1 for Time2Vec: Learning a Vector Representation of Time

Figure 2 for Time2Vec: Learning a Vector Representation of Time

Figure 3 for Time2Vec: Learning a Vector Representation of Time

Figure 4 for Time2Vec: Learning a Vector Representation of Time

Abstract:Time is an important feature in many applications involving events that occur synchronously and/or asynchronously. To effectively consume time information, recent studies have focused on designing new architectures. In this paper, we take an orthogonal but complementary approach by providing a model-agnostic vector representation for time, called Time2Vec, that can be easily imported into many existing and future architectures and improve their performances. We show on a range of models and problems that replacing the notion of time with its Time2Vec representation improves the performance of the final model.

Via

Access Paper or Ask Questions

Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks

Mar 13, 2019

Sanjay Thakur, Herke van Hoof, Juan Camilo Gamboa Higuera, Doina Precup, David Meger

Figure 1 for Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks

Figure 2 for Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks

Figure 3 for Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks

Figure 4 for Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks

Abstract:Diversity of environments is a key challenge that causes learned robotic controllers to fail due to the discrepancies between the training and evaluation conditions. Training from demonstrations in various conditions can mitigate---but not completely prevent---such failures. Learned controllers such as neural networks typically do not have a notion of uncertainty that allows to diagnose an offset between training and testing conditions, and potentially intervene. In this work, we propose to use Bayesian Neural Networks, which have such a notion of uncertainty. We show that uncertainty can be leveraged to consistently detect situations in high-dimensional simulated and real robotic domains in which the performance of the learned controller would be sub-par. Also, we show that such an uncertainty based solution allows making an informed decision about when to invoke a fallback strategy. One fallback strategy is to request more data. We empirically show that providing data only when requested results in increased data-efficiency.

* Copyright 20XX IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

Via

Access Paper or Ask Questions