Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Cristina Cipriani

The Challenges of the Nonlinear Regime for Physics-Informed Neural Networks

Feb 06, 2024

Andrea Bonfanti, Giuseppe Bruno, Cristina Cipriani

Abstract:The Neural Tangent Kernel (NTK) viewpoint represents a valuable approach to examine the training dynamics of Physics-Informed Neural Networks (PINNs) in the infinite width limit. We leverage this perspective and focus on the case of nonlinear Partial Differential Equations (PDEs) solved by PINNs. We provide theoretical results on the different behaviors of the NTK depending on the linearity of the differential operator. Moreover, inspired by our theoretical results, we emphasize the advantage of employing second-order methods for training PINNs. Additionally, we explore the convergence capabilities of second-order methods and address the challenges of spectral bias and slow convergence. Every theoretical result is supported by numerical examples with both linear and nonlinear PDEs, and we validate our training method on benchmark test cases.

* 8 pages, 10 figures, appendix of 10 additional pages

Via

Access Paper or Ask Questions

A minimax optimal control approach for robust neural ODEs

Nov 03, 2023

Cristina Cipriani, Alessandro Scagliotti, Tobias Wöhrer

Figure 1 for A minimax optimal control approach for robust neural ODEs

Figure 2 for A minimax optimal control approach for robust neural ODEs

Figure 3 for A minimax optimal control approach for robust neural ODEs

Abstract:In this paper, we address the adversarial training of neural ODEs from a robust control perspective. This is an alternative to the classical training via empirical risk minimization, and it is widely used to enforce reliable outcomes for input perturbations. Neural ODEs allow the interpretation of deep neural networks as discretizations of control systems, unlocking powerful tools from control theory for the development and the understanding of machine learning. In this specific case, we formulate the adversarial training with perturbed data as a minimax optimal control problem, for which we derive first order optimality conditions in the form of Pontryagin's Maximum Principle. We provide a novel interpretation of robust training leading to an alternative weighted technique, which we test on a low-dimensional classification task.

* 6 pages, 2 figures and 1 table

Via

Access Paper or Ask Questions

From NeurODEs to AutoencODEs: a mean-field control framework for width-varying Neural Networks

Jul 05, 2023

Cristina Cipriani, Massimo Fornasier, Alessandro Scagliotti

Figure 1 for From NeurODEs to AutoencODEs: a mean-field control framework for width-varying Neural Networks

Figure 2 for From NeurODEs to AutoencODEs: a mean-field control framework for width-varying Neural Networks

Figure 3 for From NeurODEs to AutoencODEs: a mean-field control framework for width-varying Neural Networks

Figure 4 for From NeurODEs to AutoencODEs: a mean-field control framework for width-varying Neural Networks

Abstract:In our work, we build upon the established connection between Residual Neural Networks (ResNets) and continuous-time control systems known as NeurODEs. By construction, NeurODEs have been limited to constant-width layers, making them unsuitable for modeling deep learning architectures with width-varying layers. In this paper, we propose a continuous-time Autoencoder, which we call AutoencODE, and we extend to this case the mean-field control framework already developed for usual NeurODEs. In this setting, we tackle the case of low Tikhonov regularization, resulting in potentially non-convex cost landscapes. While the global results obtained for high Tikhonov regularization may not hold globally, we show that many of them can be recovered in regions where the loss function is locally convex. Inspired by our theoretical findings, we develop a training method tailored to this specific type of Autoencoders with residual connections, and we validate our approach through numerical experiments conducted on various examples.

* 34 pages, 11 figures

Via

Access Paper or Ask Questions