Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nilin Abrahamsen

A Kaczmarz-inspired approach to accelerate the optimization of neural network wavefunctions

Jan 18, 2024

Gil Goldshlager, Nilin Abrahamsen, Lin Lin

Figure 1 for A Kaczmarz-inspired approach to accelerate the optimization of neural network wavefunctions

Figure 2 for A Kaczmarz-inspired approach to accelerate the optimization of neural network wavefunctions

Figure 3 for A Kaczmarz-inspired approach to accelerate the optimization of neural network wavefunctions

Figure 4 for A Kaczmarz-inspired approach to accelerate the optimization of neural network wavefunctions

Abstract:Neural network wavefunctions optimized using the variational Monte Carlo method have been shown to produce highly accurate results for the electronic structure of atoms and small molecules, but the high cost of optimizing such wavefunctions prevents their application to larger systems. We propose the Subsampled Projected-Increment Natural Gradient Descent (SPRING) optimizer to reduce this bottleneck. SPRING combines ideas from the recently introduced minimum-step stochastic reconfiguration optimizer (MinSR) and the classical randomized Kaczmarz method for solving linear least-squares problems. We demonstrate that SPRING outperforms both MinSR and the popular Kronecker-Factored Approximate Curvature method (KFAC) across a number of small atoms and molecules, given that the learning rates of all methods are optimally tuned. For example, on the oxygen atom, SPRING attains chemical accuracy after forty thousand training iterations, whereas both MinSR and KFAC fail to do so even after one hundred thousand iterations.

Via

Access Paper or Ask Questions

Inventing painting styles through natural inspiration

May 19, 2023

Nilin Abrahamsen, Jiahao Yao

Abstract:We propose two procedures to create painting styles using models trained only on natural images, providing objective proof that the model is not plagiarizing human art styles. In the first procedure we use the inductive bias from the artistic medium to achieve creative expression. Abstraction is achieved by using a reconstruction loss. The second procedure uses an additional natural image as inspiration to create a new style. These two procedures make it possible to invent new painting styles with no artistic training data. We believe that our approach can help pave the way for the ethical employment of generative AI in art, without infringing upon the originality of human creators.

Via

Access Paper or Ask Questions

Convergence of stochastic gradient descent on parameterized sphere with applications to variational Monte Carlo simulation

Mar 24, 2023

Nilin Abrahamsen, Zhiyan Ding, Gil Goldshlager, Lin Lin

Figure 1 for Convergence of stochastic gradient descent on parameterized sphere with applications to variational Monte Carlo simulation

Figure 2 for Convergence of stochastic gradient descent on parameterized sphere with applications to variational Monte Carlo simulation

Figure 3 for Convergence of stochastic gradient descent on parameterized sphere with applications to variational Monte Carlo simulation

Figure 4 for Convergence of stochastic gradient descent on parameterized sphere with applications to variational Monte Carlo simulation

Abstract:We analyze stochastic gradient descent (SGD) type algorithms on a high-dimensional sphere which is parameterized by a neural network up to a normalization constant. We provide a new algorithm for the setting of supervised learning and show its convergence both theoretically and numerically. We also provide the first proof of convergence for the unsupervised setting, which corresponds to the widely used variational Monte Carlo (VMC) method in quantum physics.

* Added reference to recent (concurrent) proof of VMC convergence [Li et al. arXiv:2303.10599]

Via

Access Paper or Ask Questions

Anti-symmetric Barron functions and their approximation with sums of determinants

Mar 22, 2023

Nilin Abrahamsen, Lin Lin

Abstract:A fundamental problem in quantum physics is to encode functions that are completely anti-symmetric under permutations of identical particles. The Barron space consists of high-dimensional functions that can be parameterized by infinite neural networks with one hidden layer. By explicitly encoding the anti-symmetric structure, we prove that the anti-symmetric functions which belong to the Barron space can be efficiently approximated with sums of determinants. This yields a factorial improvement in complexity compared to the standard representation in the Barron space and provides a theoretical explanation for the effectiveness of determinant-based architectures in ab-initio quantum chemistry.

Via

Access Paper or Ask Questions

Taming the sign problem of explicitly antisymmetrized neural networks via rough activation functions

May 24, 2022

Nilin Abrahamsen, Lin Lin

Figure 1 for Taming the sign problem of explicitly antisymmetrized neural networks via rough activation functions

Figure 2 for Taming the sign problem of explicitly antisymmetrized neural networks via rough activation functions

Figure 3 for Taming the sign problem of explicitly antisymmetrized neural networks via rough activation functions

Figure 4 for Taming the sign problem of explicitly antisymmetrized neural networks via rough activation functions

Abstract:Explicit antisymmetrization of a two-layer neural network is a potential candidate for a universal function approximator for generic antisymmetric functions, which are ubiquitous in quantum physics. However, this strategy suffers from a sign problem, namely, due to near exact cancellation of positive and negative contributions, the magnitude of the antisymmetrized function may be significantly smaller than that before antisymmetrization. We prove that the severity of the sign problem is directly related to the smoothness of the activation function. For smooth activation functions (e.g., $\tanh$), the sign problem of the explicitly antisymmetrized two-layer neural network deteriorates super-polynomially with respect to the system size. On the other hand, for rough activation functions (e.g., ReLU), the deterioration rate of the sign problem can be tamed to be at most polynomial with respect to the system size. Finally, the cost of a direct implementation of antisymmetrized two-layer neural network scales factorially with respect to the system size. We describe an efficient algorithm for approximate evaluation of such a network, of which the cost scales polynomially with respect to the system size and inverse precision.

Via

Access Paper or Ask Questions

Sparse Gaussian ICA

Apr 03, 2018

Nilin Abrahamsen, Philippe Rigollet

Abstract:Independent component analysis (ICA) is a cornerstone of modern data analysis. Its goal is to recover a latent random vector S with independent components from samples of X=AS where A is an unknown mixing matrix. Critically, all existing methods for ICA rely on and exploit strongly the assumption that S is not Gaussian as otherwise A becomes unidentifiable. In this paper, we show that in fact one can handle the case of Gaussian components by imposing structure on the matrix A. Specifically, we assume that A is sparse and generic in the sense that it is generated from a sparse Bernoulli-Gaussian ensemble. Under this condition, we give an efficient algorithm to recover the columns of A given only the covariance matrix of X as input even when S has several Gaussian components.

* Corrected typos

Via

Access Paper or Ask Questions