Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mattia G. Bergomi

Persistence-based operators in machine learning

Dec 28, 2022

Mattia G. Bergomi, Massimo Ferri, Alessandro Mella, Pietro Vertechi

Abstract:Artificial neural networks can learn complex, salient data features to achieve a given task. On the opposite end of the spectrum, mathematically grounded methods such as topological data analysis allow users to design analysis pipelines fully aware of data constraints and symmetries. We introduce a class of persistence-based neural network layers. Persistence-based layers allow the users to easily inject knowledge about symmetries (equivariance) respected by the data, are equipped with learnable weights, and can be composed with state-of-the-art neural architectures.

Via

Access Paper or Ask Questions

Neural network layers as parametric spans

Aug 01, 2022

Mattia G. Bergomi, Pietro Vertechi

Abstract:Properties such as composability and automatic differentiation made artificial neural networks a pervasive tool in applications. Tackling more challenging problems caused neural networks to progressively become more complex and thus difficult to define from a mathematical perspective. We present a general definition of linear layer arising from a categorical framework based on the notions of integration theory and parametric spans. This definition generalizes and encompasses classical layers (e.g., dense, convolutional), while guaranteeing existence and computability of the layer's derivatives for backpropagation.

* 10 pages, submitted to SYCO 9

Via

Access Paper or Ask Questions

Machines of finite depth: towards a formalization of neural networks

Apr 27, 2022

Pietro Vertechi, Mattia G. Bergomi

Figure 1 for Machines of finite depth: towards a formalization of neural networks

Figure 2 for Machines of finite depth: towards a formalization of neural networks

Figure 3 for Machines of finite depth: towards a formalization of neural networks

Figure 4 for Machines of finite depth: towards a formalization of neural networks

Abstract:We provide a unifying framework where artificial neural networks and their architectures can be formally described as particular cases of a general mathematical construction--machines of finite depth. Unlike neural networks, machines have a precise definition, from which several properties follow naturally. Machines of finite depth are modular (they can be combined), efficiently computable and differentiable. The backward pass of a machine is again a machine and can be computed without overhead using the same procedure as the forward pass. We prove this statement theoretically and practically, via a unified implementation that generalizes several classical architectures--dense, convolutional, and recurrent neural networks with a rich shortcut structure--and their respective backpropagation rules.

* 30 pages, 3 figures

Via

Access Paper or Ask Questions

Parametric machines: a fresh approach to architecture search

Jul 08, 2020

Pietro Vertechi, Patrizio Frosini, Mattia G. Bergomi

Figure 1 for Parametric machines: a fresh approach to architecture search

Figure 2 for Parametric machines: a fresh approach to architecture search

Figure 3 for Parametric machines: a fresh approach to architecture search

Figure 4 for Parametric machines: a fresh approach to architecture search

Abstract:Using tools from category theory, we provide a framework where artificial neural networks, and their architectures, can be formally described. We first define the notion of machine in a general categorical context, and show how simple machines can be combined into more complex ones. We explore finite- and infinite-depth machines, which generalize neural networks and neural ordinary differential equations. Borrowing ideas from functional analysis and kernel methods, we build complete, normed, infinite-dimensional spaces of machines, and discuss how to find optimal architectures and parameters -- within those spaces -- to solve a given computational problem. In our numerical experiments, these kernel-inspired networks can outperform classical neural networks when the training dataset is small.

* 31 pages, 4 figures

Via

Access Paper or Ask Questions

Towards a topological-geometrical theory of group equivariant non-expansive operators for data analysis and machine learning

Dec 31, 2018

Mattia G. Bergomi, Patrizio Frosini, Daniela Giorgi, Nicola Quercioli

Figure 1 for Towards a topological-geometrical theory of group equivariant non-expansive operators for data analysis and machine learning

Figure 2 for Towards a topological-geometrical theory of group equivariant non-expansive operators for data analysis and machine learning

Figure 3 for Towards a topological-geometrical theory of group equivariant non-expansive operators for data analysis and machine learning

Figure 4 for Towards a topological-geometrical theory of group equivariant non-expansive operators for data analysis and machine learning

Abstract:The aim of this paper is to provide a general mathematical framework for group equivariance in the machine learning context. The framework builds on a synergy between persistent homology and the theory of group actions. We define group-equivariant non-expansive operators (GENEOs), which are maps between function spaces associated with groups of transformations. We study the topological and metric properties of the space of GENEOs to evaluate their approximating power and set the basis for general strategies to initialise and compose operators. We begin by defining suitable pseudo-metrics for the function spaces, the equivariance groups, and the set of non-expansive operators. Basing on these pseudo-metrics, we prove that the space of GENEOs is compact and convex, under the assumption that the function spaces are compact and convex. These results provide fundamental guarantees in a machine learning perspective. We show examples on the MNIST and fashion-MNIST datasets. By considering isometry-equivariant non-expansive operators, we describe a simple strategy to select and sample operators, and show how the selected and sampled operators can be used to perform both classical metric learning and an effective initialisation of the kernels of a convolutional neural network.

* 34 pages, 4 figures

Via

Access Paper or Ask Questions

idtracker.ai: Tracking all individuals in large collectives of unmarked animals

Mar 12, 2018

Francisco Romero-Ferrero, Mattia G. Bergomi, Robert Hinz, Francisco J. H. Heras, Gonzalo G. de Polavieja

Figure 1 for idtracker.ai: Tracking all individuals in large collectives of unmarked animals

Figure 2 for idtracker.ai: Tracking all individuals in large collectives of unmarked animals

Figure 3 for idtracker.ai: Tracking all individuals in large collectives of unmarked animals

Figure 4 for idtracker.ai: Tracking all individuals in large collectives of unmarked animals

Abstract:Our understanding of collective animal behavior is limited by our ability to track each of the individuals. We describe an algorithm and software, idtracker.ai, that extracts from video all trajectories with correct identities at a high accuracy for collectives of up to 100 individuals. It uses two deep networks, one detecting when animals touch or cross and another one for animal identification, trained adaptively to conditions and difficulty of the video.

* 44 pages, 1 main figure, 13 supplementary figures, 6 tables

Via

Access Paper or Ask Questions