Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nicholas F. Marshall

Learning equivariant tensor functions with applications to sparse vector recovery

Jun 03, 2024

Wilson G. Gregory, Josué Tonelli-Cueto, Nicholas F. Marshall, Andrew S. Lee, Soledad Villar

Abstract:This work characterizes equivariant polynomial functions from tuples of tensor inputs to tensor outputs. Loosely motivated by physics, we focus on equivariant functions with respect to the diagonal action of the orthogonal group on tensors. We show how to extend this characterization to other linear algebraic groups, including the Lorentz and symplectic groups. Our goal behind these characterizations is to define equivariant machine learning models. In particular, we focus on the sparse vector estimation problem. This problem has been broadly studied in the theoretical computer science literature, and explicit spectral methods, derived by techniques from sum-of-squares, can be shown to recover sparse vectors under certain assumptions. Our numerical results show that the proposed equivariant machine learning models can learn spectral methods that outperform the best theoretically known spectral methods in some regimes. The experiments also suggest that learned spectral methods can solve the problem in settings that have not yet been theoretically analyzed. This is an example of a promising direction in which theory can inform machine learning models and machine learning models could inform theory.

Via

Access Paper or Ask Questions

Laplace-HDC: Understanding the geometry of binary hyperdimensional computing

Apr 16, 2024

Saeid Pourmand, Wyatt D. Whiting, Alireza Aghasi, Nicholas F. Marshall

Abstract:This paper studies the geometry of binary hyperdimensional computing (HDC), a computational scheme in which data are encoded using high-dimensional binary vectors. We establish a result about the similarity structure induced by the HDC binding operator and show that the Laplace kernel naturally arises in this setting, motivating our new encoding method Laplace-HDC, which improves upon previous methods. We describe how our results indicate limitations of binary HDC in encoding spatial information from images and discuss potential solutions, including using Haar convolutional features and the definition of a translation-equivariant HDC encoding. Several numerical experiments highlighting the improved accuracy of Laplace-HDC in contrast to alternative methods are presented. We also numerically study other aspects of the proposed framework such as robustness and the underlying translation-equivariant encoding.

* 23 pages, 7 figures

Via

Access Paper or Ask Questions

Moment-based metrics for molecules computable from cryo-EM images

Jan 26, 2024

Andy Zhang, Oscar Mickelin, Joe Kileel, Eric J. Verbeke, Nicholas F. Marshall, Marc Aurèle Gilles, Amit Singer

Figure 1 for Moment-based metrics for molecules computable from cryo-EM images

Figure 2 for Moment-based metrics for molecules computable from cryo-EM images

Figure 3 for Moment-based metrics for molecules computable from cryo-EM images

Figure 4 for Moment-based metrics for molecules computable from cryo-EM images

Abstract:Single particle cryogenic electron microscopy (cryo-EM) is an imaging technique capable of recovering the high-resolution 3-D structure of biological macromolecules from many noisy and randomly oriented projection images. One notable approach to 3-D reconstruction, known as Kam's method, relies on the moments of the 2-D images. Inspired by Kam's method, we introduce a rotationally invariant metric between two molecular structures, which does not require 3-D alignment. Further, we introduce a metric between a stack of projection images and a molecular structure, which is invariant to rotations and reflections and does not require performing 3-D reconstruction. Additionally, the latter metric does not assume a uniform distribution of viewing angles. We demonstrate uses of the new metrics on synthetic and experimental datasets, highlighting their ability to measure structural similarity.

* 21 Pages, 9 Figures, 2 Algorithms, and 3 Tables

Via

Access Paper or Ask Questions

Randomized Kaczmarz with geometrically smoothed momentum

Jan 17, 2024

Seth J. Alderman, Roan W. Luikart, Nicholas F. Marshall

Figure 1 for Randomized Kaczmarz with geometrically smoothed momentum

Figure 2 for Randomized Kaczmarz with geometrically smoothed momentum

Figure 3 for Randomized Kaczmarz with geometrically smoothed momentum

Figure 4 for Randomized Kaczmarz with geometrically smoothed momentum

Abstract:This paper studies the effect of adding geometrically smoothed momentum to the randomized Kaczmarz algorithm, which is an instance of stochastic gradient descent on a linear least squares loss function. We prove a result about the expected error in the direction of singular vectors of the matrix defining the least squares loss. We present several numerical examples illustrating the utility of our result and pose several questions.

* 21 pages, 9 figures

Via

Access Paper or Ask Questions

Fast expansion into harmonics on the disk: a steerable basis with fast radial convolutions

Jul 27, 2022

Nicholas F. Marshall, Oscar Mickelin, Amit Singer

Figure 1 for Fast expansion into harmonics on the disk: a steerable basis with fast radial convolutions

Figure 2 for Fast expansion into harmonics on the disk: a steerable basis with fast radial convolutions

Figure 3 for Fast expansion into harmonics on the disk: a steerable basis with fast radial convolutions

Figure 4 for Fast expansion into harmonics on the disk: a steerable basis with fast radial convolutions

Abstract:We present a fast and numerically accurate method for expanding digitized $L \times L$ images representing functions on $[-1,1]^2$ supported on the disk $\{x \in \mathbb{R}^2 : |x|<1\}$ in the harmonics (Dirichlet Laplacian eigenfunctions) on the disk. Our method runs in $\mathcal{O}(L^2 \log L)$ operations. This basis is also known as the Fourier-Bessel basis and it has several computational advantages: it is orthogonal, ordered by frequency, and steerable in the sense that images expanded in the basis can be rotated by applying a diagonal transform to the coefficients. Moreover, we show that convolution with radial functions can also be efficiently computed by applying a diagonal transform to the coefficients.

* 19 pages, 4 figures, 1 table

Via

Access Paper or Ask Questions

An optimal scheduled learning rate for a randomized Kaczmarz algorithm

Apr 04, 2022

Nicholas F. Marshall, Oscar Mickelin

Figure 1 for An optimal scheduled learning rate for a randomized Kaczmarz algorithm

Figure 2 for An optimal scheduled learning rate for a randomized Kaczmarz algorithm

Figure 3 for An optimal scheduled learning rate for a randomized Kaczmarz algorithm

Figure 4 for An optimal scheduled learning rate for a randomized Kaczmarz algorithm

Abstract:We study how the learning rate affects the performance of a relaxed randomized Kaczmarz algorithm for solving $A x \approx b + \varepsilon$, where $A x =b$ is a consistent linear system and $\varepsilon$ has independent mean zero random entries. We derive a learning rate schedule which optimizes a bound on the expected error that is sharp in certain cases; in contrast to the exponential convergence of the standard randomized Kaczmarz algorithm, our optimized bound involves the reciprocal of the Lambert-$W$ function of an exponential.

* 16 pages, 4 figures

Via

Access Paper or Ask Questions

A common variable minimax theorem for graphs

Jul 30, 2021

Ronald R. Coifman, Nicholas F. Marshall, Stefan Steinerberger

Figure 1 for A common variable minimax theorem for graphs

Figure 2 for A common variable minimax theorem for graphs

Figure 3 for A common variable minimax theorem for graphs

Figure 4 for A common variable minimax theorem for graphs

Abstract:Let $\mathcal{G} = \{G_1 = (V, E_1), \dots, G_m = (V, E_m)\}$ be a collection of $m$ graphs defined on a common set of vertices $V$ but with different edge sets $E_1, \dots, E_m$. Informally, a function $f :V \rightarrow \mathbb{R}$ is smooth with respect to $G_k = (V,E_k)$ if $f(u) \sim f(v)$ whenever $(u, v) \in E_k$. We study the problem of understanding whether there exists a nonconstant function that is smooth with respect to all graphs in $\mathcal{G}$, simultaneously, and how to find it if it exists.

* 21 pages, 11 figures

Via

Access Paper or Ask Questions

Multi-target detection with rotations

Jan 19, 2021

Tamir Bendory, Ti-Yen Lan, Nicholas F. Marshall, Iris Rukshin, Amit Singer

Figure 1 for Multi-target detection with rotations

Figure 2 for Multi-target detection with rotations

Figure 3 for Multi-target detection with rotations

Figure 4 for Multi-target detection with rotations

Abstract:We consider the multi-target detection problem of estimating a two-dimensional target image from a large noisy measurement image that contains many randomly rotated and translated copies of the target image. Motivated by single-particle cryo-electron microscopy, we focus on the low signal-to-noise regime, where it is difficult to estimate the locations and orientations of the target images in the measurement. Our approach uses autocorrelation analysis to estimate rotationally and translationally invariant features of the target image. We demonstrate that, regardless of the level of noise, our technique can be used to recover the target image when the measurement is sufficiently large.

* 17 pages, 4 figures

Via

Access Paper or Ask Questions

Image recovery from rotational and translational invariants

Oct 22, 2019

Nicholas F. Marshall, Ti-Yen Lan, Tamir Bendory, Amit Singer

Figure 1 for Image recovery from rotational and translational invariants

Figure 2 for Image recovery from rotational and translational invariants

Figure 3 for Image recovery from rotational and translational invariants

Abstract:We introduce a framework for recovering an image from its rotationally and translationally invariant features based on autocorrelation analysis. This work is an instance of the multi-target detection statistical model, which is mainly used to study the mathematical and computational properties of single-particle reconstruction using cryo-electron microscopy (cryo-EM) at low signal-to-noise ratios. We demonstrate with synthetic numerical experiments that an image can be reconstructed from rotationally and translationally invariant features and show that the reconstruction is robust to noise. These results constitute an important step towards the goal of structure determination of small biomolecules using cryo-EM.

* 5 pages, 3 figures

Via

Access Paper or Ask Questions

Manifold learning with bi-stochastic kernels

Feb 27, 2018

Nicholas F. Marshall, Ronald R. Coifman

Figure 1 for Manifold learning with bi-stochastic kernels

Figure 2 for Manifold learning with bi-stochastic kernels

Figure 3 for Manifold learning with bi-stochastic kernels

Figure 4 for Manifold learning with bi-stochastic kernels

Abstract:In this paper we answer the following question: what is the infinitesimal generator of the diffusion process defined by a kernel that is normalized such that it is bi-stochastic with respect to a specified measure? More precisely, under the assumption that data is sampled from a Riemannian manifold we determine how the resulting infinitesimal generator depends on the potentially nonuniform distribution of the sample points, and the specified measure for the bi-stochastic normalization. In a special case, we demonstrate a connection to the heat kernel. We consider both the case where only a single data set is given, and the case where a data set and a reference set are given. The spectral theory of the constructed operators is studied, and Nystr\"om extension formulas for the gradients of the eigenfunctions are computed. Applications to discrete point sets and manifold learning are discussed.

* 18 pages, 5 figures

Via

Access Paper or Ask Questions