Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kathlén Kohn

Learning on a Razor's Edge: the Singularity Bias of Polynomial Neural Networks

May 17, 2025

Vahid Shahverdi, Giovanni Luca Marchetti, Kathlén Kohn

Abstract:Deep neural networks often infer sparse representations, converging to a subnetwork during the learning process. In this work, we theoretically analyze subnetworks and their bias through the lens of algebraic geometry. We consider fully-connected networks with polynomial activation functions, and focus on the geometry of the function space they parametrize, often referred to as neuromanifold. First, we compute the dimension of the subspace of the neuromanifold parametrized by subnetworks. Second, we show that this subspace is singular. Third, we argue that such singularities often correspond to critical points of the training dynamics. Lastly, we discuss convolutional networks, for which subnetworks and singularities are similarly related, but the bias does not arise.

Via

Access Paper or Ask Questions

An Algebraic Geometry Approach to Viewing Graph Solvability

Apr 04, 2025

Federica Arrigoni, Kathlén Kohn, Andrea Fusiello, Tomas Pajdla

Abstract:The concept of viewing graph solvability has gained significant interest in the context of structure-from-motion. A viewing graph is a mathematical structure where nodes are associated to cameras and edges represent the epipolar geometry connecting overlapping views. Solvability studies under which conditions the cameras are uniquely determined by the graph. In this paper we propose a novel framework for analyzing solvability problems based on Algebraic Geometry, demonstrating its potential in understanding structure-from-motion graphs and proving a conjecture that was previously proposed.

Via

Access Paper or Ask Questions

A Framework for Reducing the Complexity of Geometric Vision Problems and its Application to Two-View Triangulation with Approximation Bounds

Mar 11, 2025

Felix Rydell, Georg Bökman, Fredrik Kahl, Kathlén Kohn

Abstract:In this paper, we present a new framework for reducing the computational complexity of geometric vision problems through targeted reweighting of the cost functions used to minimize reprojection errors. Triangulation - the task of estimating a 3D point from noisy 2D projections across multiple images - is a fundamental problem in multiview geometry and Structure-from-Motion (SfM) pipelines. We apply our framework to the two-view case and demonstrate that optimal triangulation, which requires solving a univariate polynomial of degree six, can be simplified through cost function reweighting reducing the polynomial degree to two. This reweighting yields a closed-form solution while preserving strong geometric accuracy. We derive optimal weighting strategies, establish theoretical bounds on the approximation error, and provide experimental results on real data demonstrating the effectiveness of the proposed approach compared to standard methods. Although this work focuses on two-view triangulation, the framework generalizes to other geometric vision problems.

Via

Access Paper or Ask Questions

PLMP -- Point-Line Minimal Problems for Projective SfM

Mar 06, 2025

Kim Kiehn, Albin Ahlbäck, Kathlén Kohn

Abstract:We completely classify all minimal problems for Structure-from-Motion (SfM) where arrangements of points and lines are fully observed by multiple uncalibrated pinhole cameras. We find 291 minimal problems, 73 of which have unique solutions and can thus be solved linearly. Two of the linear problems allow an arbitrary number of views, while all other minimal problems have at most 9 cameras. All minimal problems have at most 7 points and at most 12 lines. We compute the number of solutions of each minimal problem, as this gives a measurement of the problem's intrinsic difficulty, and find that these number are relatively low (e.g., when comparing with minimal problems for calibrated cameras). Finally, by exploring stabilizer subgroups of subarrangements, we develop a geometric and systematic way to 1) factorize minimal problems into smaller problems, 2) identify minimal problems in underconstrained problems, and 3) formally prove non-minimality.

Via

Access Paper or Ask Questions

An Invitation to Neuroalgebraic Geometry

Jan 31, 2025

Giovanni Luca Marchetti, Vahid Shahverdi, Stefano Mereta, Matthew Trager, Kathlén Kohn

Figure 1 for An Invitation to Neuroalgebraic Geometry

Figure 2 for An Invitation to Neuroalgebraic Geometry

Figure 3 for An Invitation to Neuroalgebraic Geometry

Figure 4 for An Invitation to Neuroalgebraic Geometry

Abstract:In this expository work, we promote the study of function spaces parameterized by machine learning models through the lens of algebraic geometry. To this end, we focus on algebraic models, such as neural networks with polynomial activations, whose associated function spaces are semi-algebraic varieties. We outline a dictionary between algebro-geometric invariants of these varieties, such as dimension, degree, and singularities, and fundamental aspects of machine learning, such as sample complexity, expressivity, training dynamics, and implicit bias. Along the way, we review the literature and discuss ideas beyond the algebraic domain. This work lays the foundations of a research direction bridging algebraic geometry and deep learning, that we refer to as neuroalgebraic geometry.

Via

Access Paper or Ask Questions

On the Geometry and Optimization of Polynomial Convolutional Networks

Oct 01, 2024

Vahid Shahverdi, Giovanni Luca Marchetti, Kathlén Kohn

Figure 1 for On the Geometry and Optimization of Polynomial Convolutional Networks

Figure 2 for On the Geometry and Optimization of Polynomial Convolutional Networks

Figure 3 for On the Geometry and Optimization of Polynomial Convolutional Networks

Figure 4 for On the Geometry and Optimization of Polynomial Convolutional Networks

Abstract:We study convolutional neural networks with monomial activation functions. Specifically, we prove that their parameterization map is regular and is an isomorphism almost everywhere, up to rescaling the filters. By leveraging on tools from algebraic geometry, we explore the geometric properties of the image in function space of this map -- typically referred to as neuromanifold. In particular, we compute the dimension and the degree of the neuromanifold, which measure the expressivity of the model, and describe its singularities. Moreover, for a generic large dataset, we derive an explicit formula that quantifies the number of critical points arising in the optimization of a regression loss.

Via

Access Paper or Ask Questions

Geometry of Lightning Self-Attention: Identifiability and Dimension

Aug 30, 2024

Nathan W. Henry, Giovanni Luca Marchetti, Kathlén Kohn

Figure 1 for Geometry of Lightning Self-Attention: Identifiability and Dimension

Figure 2 for Geometry of Lightning Self-Attention: Identifiability and Dimension

Figure 3 for Geometry of Lightning Self-Attention: Identifiability and Dimension

Figure 4 for Geometry of Lightning Self-Attention: Identifiability and Dimension

Abstract:We consider function spaces defined by self-attention networks without normalization, and theoretically analyze their geometry. Since these networks are polynomial, we rely on tools from algebraic geometry. In particular, we study the identifiability of deep attention by providing a description of the generic fibers of the parametrization for an arbitrary number of layers and, as a consequence, compute the dimension of the function space. Additionally, for a single-layer model, we characterize the singular and boundary points. Finally, we formulate a conjectural extension of our results to normalized self-attention networks, prove it for a single layer, and numerically verify it in the deep case.

Via

Access Paper or Ask Questions

Order-One Rolling Shutter Cameras

Mar 17, 2024

Marvin Anas Hahn, Kathlén Kohn, Orlando Marigliano, Tomas Pajdla

Figure 1 for Order-One Rolling Shutter Cameras

Figure 2 for Order-One Rolling Shutter Cameras

Figure 3 for Order-One Rolling Shutter Cameras

Figure 4 for Order-One Rolling Shutter Cameras

Abstract:Rolling shutter (RS) cameras dominate consumer and smartphone markets. Several methods for computing the absolute pose of RS cameras have appeared in the last 20 years, but the relative pose problem has not been fully solved yet. We provide a unified theory for the important class of order-one rolling shutter (RS$_1$) cameras. These cameras generalize the perspective projection to RS cameras, projecting a generic space point to exactly one image point via a rational map. We introduce a new back-projection RS camera model, characterize RS$_1$ cameras, construct explicit parameterizations of such cameras, and determine the image of a space line. We classify all minimal problems for solving the relative camera pose problem with linear RS$_1$ cameras and discover new practical cases. Finally, we show how the theory can be used to explain RS models previously used for absolute pose computation.

* 36 pages, 6 figures, 3 ancillary files

Via

Access Paper or Ask Questions

Geometry of Linear Neural Networks: Equivariance and Invariance under Permutation Groups

Sep 24, 2023

Kathlén Kohn, Anna-Laura Sattelberger, Vahid Shahverdi

Abstract:The set of functions parameterized by a linear fully-connected neural network is a determinantal variety. We investigate the subvariety of functions that are equivariant or invariant under the action of a permutation group. Examples of such group actions are translations or $90^\circ$ rotations on images. For such equivariant or invariant subvarieties, we provide an explicit description of their dimension, their degree as well as their Euclidean distance degree, and their singularities. We fully characterize invariance for arbitrary permutation groups, and equivariance for cyclic groups. We draw conclusions for the parameterization and the design of equivariant and invariant linear networks, such as a weight sharing property, and we prove that all invariant linear functions can be learned by linear autoencoders.

* 24 pages, 2 figures, comments welcome!

Via

Access Paper or Ask Questions

Function Space and Critical Points of Linear Convolutional Networks

Apr 12, 2023

Kathlén Kohn, Guido Montúfar, Vahid Shahverdi, Matthew Trager

Abstract:We study the geometry of linear networks with one-dimensional convolutional layers. The function spaces of these networks can be identified with semi-algebraic families of polynomials admitting sparse factorizations. We analyze the impact of the network's architecture on the function space's dimension, boundary, and singular points. We also describe the critical points of the network's parameterization map. Furthermore, we study the optimization problem of training a network with the squared error loss. We prove that for architectures where all strides are larger than one and generic data, the non-zero critical points of that optimization problem are smooth interior points of the function space. This property is known to be false for dense linear networks and linear convolutional networks with stride one.

* 33 pages, 1 figure, 1 table

Via

Access Paper or Ask Questions