Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Paul Fuchs

chemtrain-deploy: A parallel and scalable framework for machine learning potentials in million-atom MD simulations

Jun 04, 2025

Paul Fuchs, Weilong Chen, Stephan Thaler, Julija Zavadlav

Abstract:Machine learning potentials (MLPs) have advanced rapidly and show great promise to transform molecular dynamics (MD) simulations. However, most existing software tools are tied to specific MLP architectures, lack integration with standard MD packages, or are not parallelizable across GPUs. To address these challenges, we present chemtrain-deploy, a framework that enables model-agnostic deployment of MLPs in LAMMPS. chemtrain-deploy supports any JAX-defined semi-local potential, allowing users to exploit the functionality of LAMMPS and perform large-scale MLP-based MD simulations on multiple GPUs. It achieves state-of-the-art efficiency and scales to systems containing millions of atoms. We validate its performance and scalability using graph neural network architectures, including MACE, Allegro, and PaiNN, applied to a variety of systems, such as liquid-vapor interfaces, crystalline materials, and solvated peptides. Our results highlight the practical utility of chemtrain-deploy for real-world, high-performance simulations and provide guidance for MLP architecture selection and future design.

* Source code available at: https://github.com/tummfm/chemtrain

Via

Access Paper or Ask Questions

JaxSGMC: Modular stochastic gradient MCMC in JAX

May 16, 2025

Stephan Thaler, Paul Fuchs, Ana Cukarska, Julija Zavadlav

Abstract:We present JaxSGMC, an application-agnostic library for stochastic gradient Markov chain Monte Carlo (SG-MCMC) in JAX. SG-MCMC schemes are uncertainty quantification (UQ) methods that scale to large datasets and high-dimensional models, enabling trustworthy neural network predictions via Bayesian deep learning. JaxSGMC implements several state-of-the-art SG-MCMC samplers to promote UQ in deep learning by reducing the barriers of entry for switching from stochastic optimization to SG-MCMC sampling. Additionally, JaxSGMC allows users to build custom samplers from standard SG-MCMC building blocks. Due to this modular structure, we anticipate that JaxSGMC will accelerate research into novel SG-MCMC schemes and facilitate their application across a broad range of domains.

* SoftwareX, Volume 26, 2024, 101722, ISSN 2352-7110

Via

Access Paper or Ask Questions

Learning Non-Local Molecular Interactions via Equivariant Local Representations and Charge Equilibration

Jan 31, 2025

Paul Fuchs, Michał Sanocki, Julija Zavadlav

Figure 1 for Learning Non-Local Molecular Interactions via Equivariant Local Representations and Charge Equilibration

Figure 2 for Learning Non-Local Molecular Interactions via Equivariant Local Representations and Charge Equilibration

Figure 3 for Learning Non-Local Molecular Interactions via Equivariant Local Representations and Charge Equilibration

Figure 4 for Learning Non-Local Molecular Interactions via Equivariant Local Representations and Charge Equilibration

Abstract:Graph Neural Network (GNN) potentials relying on chemical locality offer near-quantum mechanical accuracy at significantly reduced computational costs. By propagating local information to distance particles, Message-passing neural networks (MPNNs) extend the locality concept to model interactions beyond their local neighborhood. Still, this locality precludes modeling long-range effects, such as charge transfer, electrostatic interactions, and dispersion effects, which are critical to adequately describe many real-world systems. In this work, we propose the Charge Equilibration Layer for Long-range Interactions (CELLI) to address the challenging modeling of non-local interactions and the high computational cost of MPNNs. This novel architecture generalizes the fourth-generation high-dimensional neural network (4GHDNN) concept, integrating the charge equilibration (Qeq) method into a model-agnostic building block for modern equivariant GNN potentials. A series of benchmarks show that CELLI can extend the strictly local Allegro architecture to model highly non-local interactions and charge transfer. Our architecture generalizes to diverse datasets and large structures, achieving an accuracy comparable to MPNNs at about twice the computational efficiency.

Via

Access Paper or Ask Questions

chemtrain: Learning Deep Potential Models via Automatic Differentiation and Statistical Physics

Aug 28, 2024

Paul Fuchs, Stephan Thaler, Sebastien Röcken, Julija Zavadlav

Figure 1 for chemtrain: Learning Deep Potential Models via Automatic Differentiation and Statistical Physics

Figure 2 for chemtrain: Learning Deep Potential Models via Automatic Differentiation and Statistical Physics

Figure 3 for chemtrain: Learning Deep Potential Models via Automatic Differentiation and Statistical Physics

Figure 4 for chemtrain: Learning Deep Potential Models via Automatic Differentiation and Statistical Physics

Abstract:Neural Networks (NNs) are promising models for refining the accuracy of molecular dynamics, potentially opening up new fields of application. Typically trained bottom-up, atomistic NN potential models can reach first-principle accuracy, while coarse-grained implicit solvent NN potentials surpass classical continuum solvent models. However, overcoming the limitations of costly generation of accurate reference data and data inefficiency of common bottom-up training demands efficient incorporation of data from many sources. This paper introduces the framework chemtrain to learn sophisticated NN potential models through customizable training routines and advanced training algorithms. These routines can combine multiple top-down and bottom-up algorithms, e.g., to incorporate both experimental and simulation data or pre-train potentials with less costly algorithms. chemtrain provides an object-oriented high-level interface to simplify the creation of custom routines. On the lower level, chemtrain relies on JAX to compute gradients and scale the computations to use available resources. We demonstrate the simplicity and importance of combining multiple algorithms in the examples of parametrizing an all-atomistic model of titanium and a coarse-grained implicit solvent model of alanine dipeptide.

* Package source code published at http://github.com/tummfm/chemtrain

Via

Access Paper or Ask Questions