Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andreas Ipp

Strategic White Paper on AI Infrastructure for Particle, Nuclear, and Astroparticle Physics: Insights from JENA and EuCAIF

Mar 18, 2025

Sascha Caron, Andreas Ipp, Gert Aarts, Gábor Bíró, Daniele Bonacorsi, Elena Cuoco, Caterina Doglioni, Tommaso Dorigo, Julián García Pardiñas, Stefano Giagu(+9 more)

Abstract:Artificial intelligence (AI) is transforming scientific research, with deep learning methods playing a central role in data analysis, simulations, and signal detection across particle, nuclear, and astroparticle physics. Within the JENA communities-ECFA, NuPECC, and APPEC-and as part of the EuCAIF initiative, AI integration is advancing steadily. However, broader adoption remains constrained by challenges such as limited computational resources, a lack of expertise, and difficulties in transitioning from research and development (R&D) to production. This white paper provides a strategic roadmap, informed by a community survey, to address these barriers. It outlines critical infrastructure requirements, prioritizes training initiatives, and proposes funding strategies to scale AI capabilities across fundamental physics over the next five years.

* 19 pages, 5 figures

Via

Access Paper or Ask Questions

Physics-Driven Learning for Inverse Problems in Quantum Chromodynamics

Jan 09, 2025

Gert Aarts, Kenji Fukushima, Tetsuo Hatsuda, Andreas Ipp, Shuzhe Shi, Lingxiao Wang, Kai Zhou

Abstract:The integration of deep learning techniques and physics-driven designs is reforming the way we address inverse problems, in which accurate physical properties are extracted from complex data sets. This is particularly relevant for quantum chromodynamics (QCD), the theory of strong interactions, with its inherent limitations in observational data and demanding computational approaches. This perspective highlights advances and potential of physics-driven learning methods, focusing on predictions of physical quantities towards QCD physics, and drawing connections to machine learning(ML). It is shown that the fusion of ML and physics can lead to more efficient and reliable problem-solving strategies. Key ideas of ML, methodology of embedding physics priors, and generative models as inverse modelling of physical probability distributions are introduced. Specific applications cover first-principle lattice calculations, and QCD physics of hadrons, neutron stars, and heavy-ion collisions. These examples provide a structured and concise overview of how incorporating prior knowledge such as symmetry, continuity and equations into deep learning designs can address diverse inverse problems across different physical sciences.

* Nature Reviews Physics (2025)
* 14 pages, 5 figures, submitted version to Nat Rev Phys

Via

Access Paper or Ask Questions

Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models

Jan 09, 2025

Kristian G. Barman, Sascha Caron, Emily Sullivan, Henk W. de Regt, Roberto Ruiz de Austri, Mieke Boon, Michael Färber, Stefan Fröse, Faegheh Hasibi, Andreas Ipp(+12 more)

Figure 1 for Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models

Figure 2 for Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models

Abstract:This paper explores ideas and provides a potential roadmap for the development and evaluation of physics-specific large-scale AI models, which we call Large Physics Models (LPMs). These models, based on foundation models such as Large Language Models (LLMs) - trained on broad data - are tailored to address the demands of physics research. LPMs can function independently or as part of an integrated framework. This framework can incorporate specialized tools, including symbolic reasoning modules for mathematical manipulations, frameworks to analyse specific experimental and simulated data, and mechanisms for synthesizing theories and scientific literature. We begin by examining whether the physics community should actively develop and refine dedicated models, rather than relying solely on commercial LLMs. We then outline how LPMs can be realized through interdisciplinary collaboration among experts in physics, computer science, and philosophy of science. To integrate these models effectively, we identify three key pillars: Development, Evaluation, and Philosophical Reflection. Development focuses on constructing models capable of processing physics texts, mathematical formulations, and diverse physical data. Evaluation assesses accuracy and reliability by testing and benchmarking. Finally, Philosophical Reflection encompasses the analysis of broader implications of LLMs in physics, including their potential to generate new scientific understanding and what novel collaboration dynamics might arise in research. Inspired by the organizational structure of experimental collaborations in particle physics, we propose a similarly interdisciplinary and collaborative approach to building and refining Large Physics Models. This roadmap provides specific objectives, defines pathways to achieve them, and identifies challenges that must be addressed to realise physics-specific large scale AI models.

Via

Access Paper or Ask Questions

Machine learning a fixed point action for SU gauge theory with a gauge equivariant convolutional neural network

Jan 12, 2024

Kieran Holland, Andreas Ipp, David I. Müller, Urs Wenger

Figure 1 for Machine learning a fixed point action for SU gauge theory with a gauge equivariant convolutional neural network

Figure 2 for Machine learning a fixed point action for SU gauge theory with a gauge equivariant convolutional neural network

Figure 3 for Machine learning a fixed point action for SU gauge theory with a gauge equivariant convolutional neural network

Figure 4 for Machine learning a fixed point action for SU gauge theory with a gauge equivariant convolutional neural network

Abstract:Fixed point lattice actions are designed to have continuum classical properties unaffected by discretization effects and reduced lattice artifacts at the quantum level. They provide a possible way to extract continuum physics with coarser lattices, thereby allowing to circumvent problems with critical slowing down and topological freezing toward the continuum limit. A crucial ingredient for practical applications is to find an accurate and compact parametrization of a fixed point action, since many of its properties are only implicitly defined. Here we use machine learning methods to revisit the question of how to parametrize fixed point actions. In particular, we obtain a fixed point action for four-dimensional SU(3) gauge theory using convolutional neural networks with exact gauge invariance. The large operator space allows us to find superior parametrizations compared to previous studies, a necessary first step for future Monte Carlo simulations.

* 22 pages, 15 figures, 6 tables

Via

Access Paper or Ask Questions

Applications of Lattice Gauge Equivariant Neural Networks

Dec 01, 2022

Matteo Favoni, Andreas Ipp, David I. Müller

Abstract:The introduction of relevant physical information into neural network architectures has become a widely used and successful strategy for improving their performance. In lattice gauge theories, such information can be identified with gauge symmetries, which are incorporated into the network layers of our recently proposed Lattice Gauge Equivariant Convolutional Neural Networks (L-CNNs). L-CNNs can generalize better to differently sized lattices than traditional neural networks and are by construction equivariant under lattice gauge transformations. In these proceedings, we present our progress on possible applications of L-CNNs to Wilson flow or continuous normalizing flow. Our methods are based on neural ordinary differential equations which allow us to modify link configurations in a gauge equivariant manner. For simplicity, we focus on simple toy models to test these ideas in practice.

* 8 pages, 4 figures, proceedings of XVth Quark Confinement and the Hadron Spectrum conference

Via

Access Paper or Ask Questions

Equivariance and generalization in neural networks

Dec 23, 2021

Srinath Bulusu, Matteo Favoni, Andreas Ipp, David I. Müller, Daniel Schuh

Figure 1 for Equivariance and generalization in neural networks

Figure 2 for Equivariance and generalization in neural networks

Figure 3 for Equivariance and generalization in neural networks

Figure 4 for Equivariance and generalization in neural networks

Abstract:The crucial role played by the underlying symmetries of high energy physics and lattice field theories calls for the implementation of such symmetries in the neural network architectures that are applied to the physical system under consideration. In these proceedings, we focus on the consequences of incorporating translational equivariance among the network properties, particularly in terms of performance and generalization. The benefits of equivariant networks are exemplified by studying a complex scalar field theory, on which various regression and classification tasks are examined. For a meaningful comparison, promising equivariant and non-equivariant architectures are identified by means of a systematic search. The results indicate that in most of the tasks our best equivariant architectures can perform and generalize significantly better than their non-equivariant counterparts, which applies not only to physical parameters beyond those represented in the training set, but also to different lattice sizes.

* 8 pages, 7 figures, proceedings for the 14th Quark Confinement and the Hadron Spectrum Conference (vConf2021)

Via

Access Paper or Ask Questions

Generalization capabilities of neural networks in lattice applications

Dec 23, 2021

Srinath Bulusu, Matteo Favoni, Andreas Ipp, David I. Müller, Daniel Schuh

Figure 1 for Generalization capabilities of neural networks in lattice applications

Figure 2 for Generalization capabilities of neural networks in lattice applications

Figure 3 for Generalization capabilities of neural networks in lattice applications

Figure 4 for Generalization capabilities of neural networks in lattice applications

Abstract:In recent years, the use of machine learning has become increasingly popular in the context of lattice field theories. An essential element of such theories is represented by symmetries, whose inclusion in the neural network properties can lead to high reward in terms of performance and generalizability. A fundamental symmetry that usually characterizes physical systems on a lattice with periodic boundary conditions is equivariance under spacetime translations. Here we investigate the advantages of adopting translationally equivariant neural networks in favor of non-equivariant ones. The system we consider is a complex scalar field with quartic interaction on a two-dimensional lattice in the flux representation, on which the networks carry out various regression and classification tasks. Promising equivariant and non-equivariant architectures are identified with a systematic search. We demonstrate that in most of these tasks our best equivariant architectures can perform and generalize significantly better than their non-equivariant counterparts, which applies not only to physical parameters beyond those represented in the training set, but also to different lattice sizes.

* 10 pages, 7 figures, proceedings for the 38th International Symposium on Lattice Field Theory (LATTICE21)

Via

Access Paper or Ask Questions

Preserving gauge invariance in neural networks

Dec 21, 2021

Matteo Favoni, Andreas Ipp, David I. Müller, Daniel Schuh

Figure 1 for Preserving gauge invariance in neural networks

Figure 2 for Preserving gauge invariance in neural networks

Figure 3 for Preserving gauge invariance in neural networks

Abstract:In these proceedings we present lattice gauge equivariant convolutional neural networks (L-CNNs) which are able to process data from lattice gauge theory simulations while exactly preserving gauge symmetry. We review aspects of the architecture and show how L-CNNs can represent a large class of gauge invariant and equivariant functions on the lattice. We compare the performance of L-CNNs and non-equivariant networks using a non-linear regression problem and demonstrate how gauge invariance is broken for non-equivariant models.

* 8 pages, 3 figures, proceedings for vConf 2021

Via

Access Paper or Ask Questions

Lattice gauge symmetry in neural networks

Nov 08, 2021

Matteo Favoni, Andreas Ipp, David I. Müller, Daniel Schuh

Figure 1 for Lattice gauge symmetry in neural networks

Figure 2 for Lattice gauge symmetry in neural networks

Figure 3 for Lattice gauge symmetry in neural networks

Abstract:We review a novel neural network architecture called lattice gauge equivariant convolutional neural networks (L-CNNs), which can be applied to generic machine learning problems in lattice gauge theory while exactly preserving gauge symmetry. We discuss the concept of gauge equivariance which we use to explicitly construct a gauge equivariant convolutional layer and a bilinear layer. The performance of L-CNNs and non-equivariant CNNs is compared using seemingly simple non-linear regression tasks, where L-CNNs demonstrate generalizability and achieve a high degree of accuracy in their predictions compared to their non-equivariant counterparts.

* 10 pages, 3 figures, proceedings for the 38th International Symposium on Lattice Field Theory (LATTICE21)

Via

Access Paper or Ask Questions

Generalization capabilities of translationally equivariant neural networks

Apr 14, 2021

Srinath Bulusu, Matteo Favoni, Andreas Ipp, David I. Müller, Daniel Schuh

Figure 1 for Generalization capabilities of translationally equivariant neural networks

Figure 2 for Generalization capabilities of translationally equivariant neural networks

Figure 3 for Generalization capabilities of translationally equivariant neural networks

Figure 4 for Generalization capabilities of translationally equivariant neural networks

Abstract:The rising adoption of machine learning in high energy physics and lattice field theory necessitates the re-evaluation of common methods that are widely used in computer vision, which, when applied to problems in physics, can lead to significant drawbacks in terms of performance and generalizability. One particular example for this is the use of neural network architectures that do not reflect the underlying symmetries of the given physical problem. In this work, we focus on complex scalar field theory on a two-dimensional lattice and investigate the benefits of using group equivariant convolutional neural network architectures based on the translation group. For a meaningful comparison, we conduct a systematic search for equivariant and non-equivariant neural network architectures and apply them to various regression and classification tasks. We demonstrate that in most of these tasks our best equivariant architectures can perform and generalize significantly better than their non-equivariant counterparts, which applies not only to physical parameters beyond those represented in the training set, but also to different lattice sizes.

* 25 pages, 18 figures, v2: updated references

Via

Access Paper or Ask Questions