Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Juan Pavez

Effective LHC measurements with matrix elements and machine learning

Jun 04, 2019

Johann Brehmer, Kyle Cranmer, Irina Espejo, Felix Kling, Gilles Louppe, Juan Pavez

Figure 1 for Effective LHC measurements with matrix elements and machine learning

Abstract:One major challenge for the legacy measurements at the LHC is that the likelihood function is not tractable when the collected data is high-dimensional and the detector response has to be modeled. We review how different analysis strategies solve this issue, including the traditional histogram approach used in most particle physics analyses, the Matrix Element Method, Optimal Observables, and modern techniques based on neural density estimation. We then discuss powerful new inference methods that use a combination of matrix element information and machine learning to accurately estimate the likelihood function. The MadMiner package automates all necessary data-processing steps. In first studies we find that these new techniques have the potential to substantially improve the sensitivity of the LHC legacy measurements.

* Keynote at the 19th International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT 2019)

Via

Access Paper or Ask Questions

Mining gold from implicit models to improve likelihood-free inference

Oct 09, 2018

Johann Brehmer, Gilles Louppe, Juan Pavez, Kyle Cranmer

Figure 1 for Mining gold from implicit models to improve likelihood-free inference

Figure 2 for Mining gold from implicit models to improve likelihood-free inference

Figure 3 for Mining gold from implicit models to improve likelihood-free inference

Figure 4 for Mining gold from implicit models to improve likelihood-free inference

Abstract:Simulators often provide the best description of real-world phenomena. However, they also lead to challenging inverse problems because the density they implicitly define is often intractable. We present a new suite of simulation-based inference techniques that go beyond the traditional Approximate Bayesian Computation approach, which struggles in a high-dimensional setting, and extend methods that use surrogate models based on neural networks. We show that additional information, such as the joint likelihood ratio and the joint score, can often be extracted from simulators and used to augment the training data for these surrogate models. Finally, we demonstrate that these new techniques are more sample efficient and provide higher-fidelity inference than traditional methods.

* Code available at https://github.com/johannbrehmer/simulator-mining-example . v2: Fixed typos. v3: Expanded discussion, added Lotka-Volterra example

Via

Access Paper or Ask Questions

Likelihood-free inference with an improved cross-entropy estimator

Aug 02, 2018

Markus Stoye, Johann Brehmer, Gilles Louppe, Juan Pavez, Kyle Cranmer

Figure 1 for Likelihood-free inference with an improved cross-entropy estimator

Figure 2 for Likelihood-free inference with an improved cross-entropy estimator

Figure 3 for Likelihood-free inference with an improved cross-entropy estimator

Figure 4 for Likelihood-free inference with an improved cross-entropy estimator

Abstract:We extend recent work (Brehmer, et. al., 2018) that use neural networks as surrogate models for likelihood-free inference. As in the previous work, we exploit the fact that the joint likelihood ratio and joint score, conditioned on both observed and latent variables, can often be extracted from an implicit generative model or simulator to augment the training data for these surrogate models. We show how this augmented training data can be used to provide a new cross-entropy estimator, which provides improved sample efficiency compared to previous loss functions exploiting this augmented training data.

* 8 pages, 3 figures

Via

Access Paper or Ask Questions

A Guide to Constraining Effective Field Theories with Machine Learning

Jul 26, 2018

Johann Brehmer, Kyle Cranmer, Gilles Louppe, Juan Pavez

Figure 1 for A Guide to Constraining Effective Field Theories with Machine Learning

Figure 2 for A Guide to Constraining Effective Field Theories with Machine Learning

Figure 3 for A Guide to Constraining Effective Field Theories with Machine Learning

Figure 4 for A Guide to Constraining Effective Field Theories with Machine Learning

Abstract:We develop, discuss, and compare several inference techniques to constrain theory parameters in collider experiments. By harnessing the latent-space structure of particle physics processes, we extract extra information from the simulator. This augmented data can be used to train neural networks that precisely estimate the likelihood ratio. The new methods scale well to many observables and high-dimensional parameter spaces, do not require any approximations of the parton shower and detector response, and can be evaluated in microseconds. Using weak-boson-fusion Higgs production as an example process, we compare the performance of several techniques. The best results are found for likelihood ratio estimators trained with extra information about the score, the gradient of the log likelihood function with respect to the theory parameters. The score also provides sufficient statistics that contain all the information needed for inference in the neighborhood of the Standard Model. These methods enable us to put significantly stronger bounds on effective dimension-six operators than the traditional approach based on histograms. They also outperform generic machine learning methods that do not make use of the particle physics structure, demonstrating their potential to substantially improve the new physics reach of the LHC legacy results.

* Phys. Rev. D 98, 052004 (2018)
* See also the companion publication "Constraining Effective Field Theories with Machine Learning" at arXiv:1805.00013, a brief introduction presenting the key ideas. The code for these studies is available at https://github.com/johannbrehmer/higgs_inference . v2: Added references. v3: Improved description of algorithms, added references. v4: Clarified text, added references

Via

Access Paper or Ask Questions

Constraining Effective Field Theories with Machine Learning

Jul 26, 2018

Johann Brehmer, Kyle Cranmer, Gilles Louppe, Juan Pavez

Figure 1 for Constraining Effective Field Theories with Machine Learning

Figure 2 for Constraining Effective Field Theories with Machine Learning

Abstract:We present powerful new analysis techniques to constrain effective field theories at the LHC. By leveraging the structure of particle physics processes, we extract extra information from Monte-Carlo simulations, which can be used to train neural network models that estimate the likelihood ratio. These methods scale well to processes with many observables and theory parameters, do not require any approximations of the parton shower or detector response, and can be evaluated in microseconds. We show that they allow us to put significantly stronger bounds on dimension-six operators than existing methods, demonstrating their potential to improve the precision of the LHC legacy constraints.

* Phys. Rev. Lett. 121, 111801 (2018)
* See also the companion publication "A Guide to Constraining Effective Field Theories with Machine Learning" at arXiv:1805.00020, an in-depth analysis of machine learning techniques for LHC measurements. The code for these studies is available at https://github.com/johannbrehmer/higgs_inference . v2: New schematic figure explaining the new algorithms, added references. v3, v4: Added references

Via

Access Paper or Ask Questions

Working Memory Networks: Augmenting Memory Networks with a Relational Reasoning Module

May 23, 2018

Juan Pavez, Héctor Allende, Héctor Allende-Cid

Figure 1 for Working Memory Networks: Augmenting Memory Networks with a Relational Reasoning Module

Figure 2 for Working Memory Networks: Augmenting Memory Networks with a Relational Reasoning Module

Figure 3 for Working Memory Networks: Augmenting Memory Networks with a Relational Reasoning Module

Figure 4 for Working Memory Networks: Augmenting Memory Networks with a Relational Reasoning Module

Abstract:During the last years, there has been a lot of interest in achieving some kind of complex reasoning using deep neural networks. To do that, models like Memory Networks (MemNNs) have combined external memory storages and attention mechanisms. These architectures, however, lack of more complex reasoning mechanisms that could allow, for instance, relational reasoning. Relation Networks (RNs), on the other hand, have shown outstanding results in relational reasoning tasks. Unfortunately, their computational cost grows quadratically with the number of memories, something prohibitive for larger problems. To solve these issues, we introduce the Working Memory Network, a MemNN architecture with a novel working memory storage and reasoning module. Our model retains the relational reasoning abilities of the RN while reducing its computational complexity from quadratic to linear. We tested our model on the text QA dataset bAbI and the visual QA dataset NLVR. In the jointly trained bAbI-10k, we set a new state-of-the-art, achieving a mean error of less than 0.5%. Moreover, a simple ensemble of two of our models solves all 20 tasks in the joint version of the benchmark.

* Published in ACL 2018. The code for these studies is available at https://github.com/jgpavez/Working-Memory-Networks

Via

Access Paper or Ask Questions

Approximating Likelihood Ratios with Calibrated Discriminative Classifiers

Mar 18, 2016

Kyle Cranmer, Juan Pavez, Gilles Louppe

Figure 1 for Approximating Likelihood Ratios with Calibrated Discriminative Classifiers

Figure 2 for Approximating Likelihood Ratios with Calibrated Discriminative Classifiers

Figure 3 for Approximating Likelihood Ratios with Calibrated Discriminative Classifiers

Figure 4 for Approximating Likelihood Ratios with Calibrated Discriminative Classifiers

Abstract:In many fields of science, generalized likelihood ratio tests are established tools for statistical inference. At the same time, it has become increasingly common that a simulator (or generative model) is used to describe complex processes that tie parameters $\theta$ of an underlying theory and measurement apparatus to high-dimensional observations $\mathbf{x}\in \mathbb{R}^p$. However, simulator often do not provide a way to evaluate the likelihood function for a given observation $\mathbf{x}$, which motivates a new class of likelihood-free inference algorithms. In this paper, we show that likelihood ratios are invariant under a specific class of dimensionality reduction maps $\mathbb{R}^p \mapsto \mathbb{R}$. As a direct consequence, we show that discriminative classifiers can be used to approximate the generalized likelihood ratio statistic when only a generative model for the data is available. This leads to a new machine learning-based approach to likelihood-free inference that is complementary to Approximate Bayesian Computation, and which does not require a prior on the model parameters. Experimental results on artificial problems with known exact likelihoods illustrate the potential of the proposed method.

* 35 pages, 5 figures

Via

Access Paper or Ask Questions