Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andreas Munk

Assisting the Adversary to Improve GAN Training

Oct 03, 2020

Andreas Munk, William Harvey, Frank Wood

Figure 1 for Assisting the Adversary to Improve GAN Training

Figure 2 for Assisting the Adversary to Improve GAN Training

Figure 3 for Assisting the Adversary to Improve GAN Training

Figure 4 for Assisting the Adversary to Improve GAN Training

Abstract:We propose a method for improved training of generative adversarial networks (GANs). Some of the most popular methods for improving the stability and performance of GANs involve constraining or regularizing the discriminator. Our method, on the other hand, involves regularizing the generator. It can be used alongside existing approaches to GAN training and is simple and straightforward to implement. Our method is motivated by a common mismatch between theoretical analysis and practice: analysis often assumes that the discriminator reaches its optimum on each iteration. In practice, this is essentially never true, often leading to poor gradient estimates for the generator. To address this, we introduce the Adversary's Assistant (AdvAs). It is a theoretically motivated penalty imposed on the generator based on the norm of the gradients used to train the discriminator. This encourages the generator to move towards points where the discriminator is optimal. We demonstrate the effect of applying AdvAs to several GAN objectives, datasets and network architectures. The results indicate a reduction in the mismatch between theory and practice and that AdvAs can lead to improvement of GAN training, as measured by FID scores.

Via

Access Paper or Ask Questions

Amortized Rejection Sampling in Universal Probabilistic Programming

Nov 30, 2019

Saeid Naderiparizi, Adam Ścibior, Andreas Munk, Mehrdad Ghadiri, Atılım Güneş Baydin, Bradley Gram-Hansen, Christian Schroeder de Witt, Robert Zinkov, Philip H. S. Torr, Tom Rainforth(+2 more)

Figure 1 for Amortized Rejection Sampling in Universal Probabilistic Programming

Figure 2 for Amortized Rejection Sampling in Universal Probabilistic Programming

Figure 3 for Amortized Rejection Sampling in Universal Probabilistic Programming

Figure 4 for Amortized Rejection Sampling in Universal Probabilistic Programming

Abstract:Existing approaches to amortized inference in probabilistic programs with unbounded loops can produce estimators with infinite variance. An instance of this is importance sampling inference in programs that explicitly include rejection sampling as part of the user-programmed generative procedure. In this paper we develop a new and efficient amortized importance sampling estimator. We prove finite variance of our estimator and empirically demonstrate our method's correctness and efficiency compared to existing alternatives on generative programs containing rejection sampling loops and discuss how to implement our method in a generic probabilistic programming framework.

Via

Access Paper or Ask Questions

Attention for Inference Compilation

Oct 25, 2019

William Harvey, Andreas Munk, Atılım Güneş Baydin, Alexander Bergholm, Frank Wood

Figure 1 for Attention for Inference Compilation

Figure 2 for Attention for Inference Compilation

Figure 3 for Attention for Inference Compilation

Figure 4 for Attention for Inference Compilation

Abstract:We present a new approach to automatic amortized inference in universal probabilistic programs which improves performance compared to current methods. Our approach is a variation of inference compilation (IC) which leverages deep neural networks to approximate a posterior distribution over latent variables in a probabilistic program. A challenge with existing IC network architectures is that they can fail to model long-range dependencies between latent variables. To address this, we introduce an attention mechanism that attends to the most salient variables previously sampled in the execution of a probabilistic program. We demonstrate that the addition of attention allows the proposal distributions to better match the true posterior, enhancing inference about latent variables in simulators.

Via

Access Paper or Ask Questions

Deep Probabilistic Surrogate Networks for Universal Simulator Approximation

Oct 25, 2019

Andreas Munk, Adam Ścibior, Atılım Güneş Baydin, Andrew Stewart, Goran Fernlund, Anoush Poursartip, Frank Wood

Figure 1 for Deep Probabilistic Surrogate Networks for Universal Simulator Approximation

Figure 2 for Deep Probabilistic Surrogate Networks for Universal Simulator Approximation

Figure 3 for Deep Probabilistic Surrogate Networks for Universal Simulator Approximation

Figure 4 for Deep Probabilistic Surrogate Networks for Universal Simulator Approximation

Abstract:We present a framework for automatically structuring and training fast, approximate, deep neural surrogates of existing stochastic simulators. Unlike traditional approaches to surrogate modeling, our surrogates retain the interpretable structure of the reference simulators. The particular way we achieve this allows us to replace the reference simulator with the surrogate when undertaking amortized inference in the probabilistic programming sense. The fidelity and speed of our surrogates allow for not only faster "forward" stochastic simulation but also for accurate and substantially faster inference. We support these claims via experiments that involve a commercial composite-materials curing simulator. Employing our surrogate modeling technique makes inference an order of magnitude faster, opening up the possibility of doing simulator-based, non-invasive, just-in-time parts quality testing; in this case inferring safety-critical latent internal temperature profiles of composite materials undergoing curing from surface temperature profile measurements.

Via

Access Paper or Ask Questions

Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale

Jul 08, 2019

Atılım Güneş Baydin, Lei Shao, Wahid Bhimji, Lukas Heinrich, Lawrence Meadows, Jialin Liu, Andreas Munk, Saeid Naderiparizi, Bradley Gram-Hansen, Gilles Louppe(+7 more)

Figure 1 for Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale

Figure 2 for Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale

Figure 3 for Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale

Figure 4 for Etalumis: Bringing Probabilistic Programming to Scientific Simulators at Scale

Abstract:Probabilistic programming languages (PPLs) are receiving widespread attention for performing Bayesian inference in complex generative models. However, applications to science remain limited because of the impracticability of rewriting complex scientific simulators in a PPL, the computational cost of inference, and the lack of scalable implementations. To address these, we present a novel PPL framework that couples directly to existing scientific simulators through a cross-platform probabilistic execution protocol and provides Markov chain Monte Carlo (MCMC) and deep-learning-based inference compilation (IC) engines for tractable inference. To guide IC inference, we perform distributed training of a dynamic 3DCNN--LSTM architecture with a PyTorch-MPI-based framework on 1,024 32-core CPU nodes of the Cori supercomputer with a global minibatch size of 128k: achieving a performance of 450 Tflop/s through enhancements to PyTorch. We demonstrate a Large Hadron Collider (LHC) use-case with the C++ Sherpa simulator and achieve the largest-scale posterior inference in a Turing-complete PPL.

* 14 pages, 8 figures

Via

Access Paper or Ask Questions