Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Maria Spiropulu

California Institute of Technology

Fast Jet Tagging with MLP-Mixers on FPGAs

Mar 05, 2025

Chang Sun, Jennifer Ngadiuba, Maurizio Pierini, Maria Spiropulu

Abstract:We explore the innovative use of MLP-Mixer models for real-time jet tagging and establish their feasibility on resource-constrained hardware like FPGAs. MLP-Mixers excel in processing sequences of jet constituents, achieving state-of-the-art performance on datasets mimicking Large Hadron Collider conditions. By using advanced optimization techniques such as High-Granularity Quantization and Distributed Arithmetic, we achieve unprecedented efficiency. These models match or surpass the accuracy of previous architectures, reduce hardware resource usage by up to 97%, double the throughput, and half the latency. Additionally, non-permutation-invariant architectures enable smart feature prioritization and efficient FPGA deployment, setting a new benchmark for machine learning in real-time data processing at particle colliders.

Via

Access Paper or Ask Questions

Gradient-based Automatic Per-Weight Mixed Precision Quantization for Neural Networks On-Chip

May 01, 2024

Chang Sun, Thea K. Årrestad, Vladimir Loncar, Jennifer Ngadiuba, Maria Spiropulu

Figure 1 for Gradient-based Automatic Per-Weight Mixed Precision Quantization for Neural Networks On-Chip

Figure 2 for Gradient-based Automatic Per-Weight Mixed Precision Quantization for Neural Networks On-Chip

Figure 3 for Gradient-based Automatic Per-Weight Mixed Precision Quantization for Neural Networks On-Chip

Figure 4 for Gradient-based Automatic Per-Weight Mixed Precision Quantization for Neural Networks On-Chip

Abstract:Model size and inference speed at deployment time, are major challenges in many deep learning applications. A promising strategy to overcome these challenges is quantization. However, a straightforward uniform quantization to very low precision can result in significant accuracy loss. Mixed-precision quantization, based on the idea that certain parts of the network can accommodate lower precision without compromising performance compared to other parts, offers a potential solution. In this work, we present High Granularity Quantization (HGQ), an innovative quantization-aware training method designed to fine-tune the per-weight and per-activation precision in an automatic way for ultra-low latency and low power neural networks which are to be deployed on FPGAs. We demonstrate that HGQ can outperform existing methods by a substantial margin, achieving resource reduction by up to a factor of 20 and latency improvement by a factor of 5 while preserving accuracy.

Via

Access Paper or Ask Questions

Fast Particle-based Anomaly Detection Algorithm with Variational Autoencoder

Nov 28, 2023

Ryan Liu, Abhijith Gandrakota, Jennifer Ngadiuba, Maria Spiropulu, Jean-Roch Vlimant

Figure 1 for Fast Particle-based Anomaly Detection Algorithm with Variational Autoencoder

Figure 2 for Fast Particle-based Anomaly Detection Algorithm with Variational Autoencoder

Figure 3 for Fast Particle-based Anomaly Detection Algorithm with Variational Autoencoder

Figure 4 for Fast Particle-based Anomaly Detection Algorithm with Variational Autoencoder

Abstract:Model-agnostic anomaly detection is one of the promising approaches in the search for new beyond the standard model physics. In this paper, we present Set-VAE, a particle-based variational autoencoder (VAE) anomaly detection algorithm. We demonstrate a 2x signal efficiency gain compared with traditional subjettiness-based jet selection. Furthermore, with an eye to the future deployment to trigger systems, we propose the CLIP-VAE, which reduces the inference-time cost of anomaly detection by using the KL-divergence loss as the anomaly score, resulting in a 2x acceleration in latency and reducing the caching requirement.

* 7 pages, 4 figures, accepted at the Machine Learning and the Physical Sciences Workshop, NeurIPS 2023

Via

Access Paper or Ask Questions

Efficient and Robust Jet Tagging at the LHC with Knowledge Distillation

Nov 23, 2023

Ryan Liu, Abhijith Gandrakota, Jennifer Ngadiuba, Maria Spiropulu, Jean-Roch Vlimant

Abstract:The challenging environment of real-time data processing systems at the Large Hadron Collider (LHC) strictly limits the computational complexity of algorithms that can be deployed. For deep learning models, this implies that only models with low computational complexity that have weak inductive bias are feasible. To address this issue, we utilize knowledge distillation to leverage both the performance of large models and the reduced computational complexity of small ones. In this paper, we present an implementation of knowledge distillation, demonstrating an overall boost in the student models' performance for the task of classifying jets at the LHC. Furthermore, by using a teacher model with a strong inductive bias of Lorentz symmetry, we show that we can induce the same inductive bias in the student model which leads to better robustness against arbitrary Lorentz boost.

* 7 pages, 3 figures, accepted at the Machine Learning and the Physical Sciences Workshop, NeurIPS 2023

Via

Access Paper or Ask Questions

Source-Agnostic Gravitational-Wave Detection with Recurrent Autoencoders

Aug 13, 2021

Eric A. Moreno, Jean-Roch Vlimant, Maria Spiropulu, Bartlomiej Borzyszkowski, Maurizio Pierini

Figure 1 for Source-Agnostic Gravitational-Wave Detection with Recurrent Autoencoders

Figure 2 for Source-Agnostic Gravitational-Wave Detection with Recurrent Autoencoders

Figure 3 for Source-Agnostic Gravitational-Wave Detection with Recurrent Autoencoders

Figure 4 for Source-Agnostic Gravitational-Wave Detection with Recurrent Autoencoders

Abstract:We present an application of anomaly detection techniques based on deep recurrent autoencoders to the problem of detecting gravitational wave signals in laser interferometers. Trained on noise data, this class of algorithms could detect signals using an unsupervised strategy, i.e., without targeting a specific kind of source. We develop a custom architecture to analyze the data from two interferometers. We compare the obtained performance to that obtained with other autoencoder architectures and with a convolutional classifier. The unsupervised nature of the proposed strategy comes with a cost in terms of accuracy, when compared to more traditional supervised techniques. On the other hand, there is a qualitative gain in generalizing the experimental sensitivity beyond the ensemble of pre-computed signal templates. The recurrent autoencoder outperforms other autoencoders based on different architectures. The class of recurrent autoencoders presented in this paper could complement the search strategy employed for gravitational wave detection and extend the reach of the ongoing detection campaigns.

* 16 pages, 6 figures

Via

Access Paper or Ask Questions

Physics and Computing Performance of the Exa.TrkX TrackML Pipeline

Mar 11, 2021

Xiangyang Ju, Daniel Murnane, Paolo Calafiura, Nicholas Choma, Sean Conlon, Steve Farrell, Yaoyuan Xu, Maria Spiropulu, Jean-Roch Vlimant, Adam Aurisano(+14 more)

Figure 1 for Physics and Computing Performance of the Exa.TrkX TrackML Pipeline

Figure 2 for Physics and Computing Performance of the Exa.TrkX TrackML Pipeline

Figure 3 for Physics and Computing Performance of the Exa.TrkX TrackML Pipeline

Figure 4 for Physics and Computing Performance of the Exa.TrkX TrackML Pipeline

Abstract:The Exa.TrkX project has applied geometric learning concepts such as metric learning and graph neural networks to HEP particle tracking. The Exa.TrkX tracking pipeline clusters detector measurements to form track candidates and filters them. The pipeline, originally developed using the TrackML dataset (a simulation of an LHC-like tracking detector), has been demonstrated on various detectors, including the DUNE LArTPC and the CMS High-Granularity Calorimeter. This paper documents new developments needed to study the physics and computing performance of the Exa.TrkX pipeline on the full TrackML dataset, a first step towards validating the pipeline using ATLAS and CMS data. The pipeline achieves tracking efficiency and purity similar to production tracking algorithms. Crucially for future HEP applications, the pipeline benefits significantly from GPU acceleration, and its computational requirements scale close to linearly with the number of particles in the event.

Via

Access Paper or Ask Questions

MLPF: Efficient machine-learned particle-flow reconstruction using graph neural networks

Jan 21, 2021

Joosep Pata, Javier Duarte, Jean-Roch Vlimant, Maurizio Pierini, Maria Spiropulu

Figure 1 for MLPF: Efficient machine-learned particle-flow reconstruction using graph neural networks

Figure 2 for MLPF: Efficient machine-learned particle-flow reconstruction using graph neural networks

Figure 3 for MLPF: Efficient machine-learned particle-flow reconstruction using graph neural networks

Figure 4 for MLPF: Efficient machine-learned particle-flow reconstruction using graph neural networks

Abstract:In general-purpose particle detectors, the particle flow algorithm may be used to reconstruct a coherent particle-level view of the event by combining information from the calorimeters and the trackers, significantly improving the detector resolution for jets and the missing transverse momentum. In view of the planned high-luminosity upgrade of the CERN Large Hadron Collider, it is necessary to revisit existing reconstruction algorithms and ensure that both the physics and computational performance are sufficient in a high-pileup environment. Recent developments in machine learning may offer a prospect for efficient event reconstruction based on parametric models. We introduce MLPF, an end-to-end trainable machine-learned particle flow algorithm for reconstructing particle flow candidates based on parallelizable, computationally efficient, scalable graph neural networks and a multi-task objective. We report the physics and computational performance of the MLPF algorithm on on a synthetic dataset of ttbar events in HL-LHC running conditions, including the simulation of multiple interaction effects, and discuss potential next steps and considerations towards ML-based reconstruction in a general purpose particle detector.

* 12 pages, 10 figures

Via

Access Paper or Ask Questions

Track Seeding and Labelling with Embedded-space Graph Neural Networks

Jun 30, 2020

Nicholas Choma, Daniel Murnane, Xiangyang Ju, Paolo Calafiura, Sean Conlon, Steven Farrell, Prabhat, Giuseppe Cerati, Lindsey Gray, Thomas Klijnsma(+9 more)

Figure 1 for Track Seeding and Labelling with Embedded-space Graph Neural Networks

Figure 2 for Track Seeding and Labelling with Embedded-space Graph Neural Networks

Figure 3 for Track Seeding and Labelling with Embedded-space Graph Neural Networks

Figure 4 for Track Seeding and Labelling with Embedded-space Graph Neural Networks

Abstract:To address the unprecedented scale of HL-LHC data, the Exa.TrkX project is investigating a variety of machine learning approaches to particle track reconstruction. The most promising of these solutions, graph neural networks (GNN), process the event as a graph that connects track measurements (detector hits corresponding to nodes) with candidate line segments between the hits (corresponding to edges). Detector information can be associated with nodes and edges, enabling a GNN to propagate the embedded parameters around the graph and predict node-, edge- and graph-level observables. Previously, message-passing GNNs have shown success in predicting doublet likelihood, and we here report updates on the state-of-the-art architectures for this task. In addition, the Exa.TrkX project has investigated innovations in both graph construction, and embedded representations, in an effort to achieve fully learned end-to-end track finding. Hence, we present a suite of extensions to the original model, with encouraging results for hitgraph classification. In addition, we explore increased performance by constructing graphs from learned representations which contain non-linear metric structure, allowing for efficient clustering and neighborhood queries of data points. We demonstrate how this framework fits in with both traditional clustering pipelines, and GNN approaches. The embedded graphs feed into high-accuracy doublet and triplet classifiers, or can be used as an end-to-end track classifier by clustering in an embedded space. A set of post-processing methods improve performance with knowledge of the detector physics. Finally, we present numerical results on the TrackML particle tracking challenge dataset, where our framework shows favorable results in both seeding and track finding.

* Proceedings submission in Connecting the Dots Workshop 2020, 10 pages

Via

Access Paper or Ask Questions

Calorimetry with Deep Learning: Particle Simulation and Reconstruction for Collider Physics

Jan 08, 2020

Dawit Belayneh, Federico Carminati, Amir Farbin, Benjamin Hooberman, Gulrukh Khattak, Miaoyuan Liu, Junze Liu, Dominick Olivito, Vitória Barin Pacela, Maurizio Pierini(+6 more)

Figure 1 for Calorimetry with Deep Learning: Particle Simulation and Reconstruction for Collider Physics

Figure 2 for Calorimetry with Deep Learning: Particle Simulation and Reconstruction for Collider Physics

Figure 3 for Calorimetry with Deep Learning: Particle Simulation and Reconstruction for Collider Physics

Figure 4 for Calorimetry with Deep Learning: Particle Simulation and Reconstruction for Collider Physics

Abstract:Using detailed simulations of calorimeter showers as training data, we investigate the use of deep learning algorithms for the simulation and reconstruction of particles produced in high-energy physics collisions. We train neural networks on shower data at the calorimeter-cell level, and show significant improvements for simulation and reconstruction when using these networks compared to methods which rely on currently-used state-of-the-art algorithms. We define two models: an end-to-end reconstruction network which performs simultaneous particle identification and energy regression of particles when given calorimeter shower data, and a generative network which can provide reasonable modeling of calorimeter showers for different particle types at specified angles and energies. We investigate the optimization of our models with hyperparameter scans. Furthermore, we demonstrate the applicability of the reconstruction model to shower inputs from other detector geometries, specifically ATLAS-like and CMS-like geometries. These networks can serve as fast and computationally light methods for particle shower simulation and reconstruction for current and future experiments at particle colliders.

* 26 pages, 38 figures. Corrected typos and added additional references in v2. Extended Acknowledgements section in v3

Via

Access Paper or Ask Questions

Quantum adiabatic machine learning with zooming

Aug 13, 2019

Alexander Zlokapa, Alex Mott, Joshua Job, Jean-Roch Vlimant, Daniel Lidar, Maria Spiropulu

Figure 1 for Quantum adiabatic machine learning with zooming

Figure 2 for Quantum adiabatic machine learning with zooming

Figure 3 for Quantum adiabatic machine learning with zooming

Figure 4 for Quantum adiabatic machine learning with zooming

Abstract:Recent work has shown that quantum annealing for machine learning (QAML) can perform comparably to state-of-the-art machine learning methods with a specific application to Higgs boson classification. We propose a variant algorithm (QAML-Z) that iteratively zooms in on a region of the energy surface by mapping the problem to a continuous space and sequentially applying quantum annealing to an augmented set of weak classifiers. Results on a programmable quantum annealer show that QAML-Z increases the performance difference between QAML and classical deep neural networks by over 40% as measured by area under the ROC curve for small training set sizes. Furthermore, QAML-Z reduces the advantage of deep neural networks over QAML for large training sets by around 50%, indicating that QAML-Z produces stronger classifiers that retain the robustness of the original QAML algorithm.

* 7 pages, 4 figures

Via

Access Paper or Ask Questions