Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Christoph Schnörr

Generative Modeling of Discrete Data Using Geometric Latent Subspaces

Jan 29, 2026

Daniel Gonzalez-Alvarado, Jonas Cassel, Stefania Petra, Christoph Schnörr

Abstract:We introduce the use of latent subspaces in the exponential parameter space of product manifolds of categorial distributions, as a tool for learning generative models of discrete data. The low-dimensional latent space encodes statistical dependencies and removes redundant degrees of freedom among the categorial variables. We equip the parameter domain with a Riemannian geometry such that the spaces and distances are related by isometries which enables consistent flow matching. In particular, geodesics become straight lines which makes model training by flow matching effective. Empirical results demonstrate that reduced latent dimensions suffice to represent data for generative modeling.

Via

Access Paper or Ask Questions

Riemannian Patch Assignment Gradient Flows

Apr 17, 2025

Daniel Gonzalez-Alvarado, Fabio Schlindwein, Jonas Cassel, Laura Steingruber, Stefania Petra, Christoph Schnörr

Abstract:This paper introduces patch assignment flows for metric data labeling on graphs. Labelings are determined by regularizing initial local labelings through the dynamic interaction of both labels and label assignments across the graph, entirely encoded by a dictionary of competing labeled patches and mediated by patch assignment variables. Maximal consistency of patch assignments is achieved by geometric numerical integration of a Riemannian ascent flow, as critical point of a Lagrangian action functional. Experiments illustrate properties of the approach, including uncertainty quantification of label assignments.

Via

Access Paper or Ask Questions

On Moving Object Segmentation from Monocular Video with Transformers

Nov 28, 2024

Christian Homeyer, Christoph Schnörr

Abstract:Moving object detection and segmentation from a single moving camera is a challenging task, requiring an understanding of recognition, motion and 3D geometry. Combining both recognition and reconstruction boils down to a fusion problem, where appearance and motion features need to be combined for classification and segmentation. In this paper, we present a novel fusion architecture for monocular motion segmentation - M3Former, which leverages the strong performance of transformers for segmentation and multi-modal fusion. As reconstructing motion from monocular video is ill-posed, we systematically analyze different 2D and 3D motion representations for this problem and their importance for segmentation performance. Finally, we analyze the effect of training data and show that diverse datasets are required to achieve SotA performance on Kitti and Davis.

* Proceedings of the IEEE/CVF International Conference on Computer Vision 2023 (880--891)
* WICCV2023

Via

Access Paper or Ask Questions

DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting

Nov 26, 2024

Christian Homeyer, Leon Begiristain, Christoph Schnörr

Figure 1 for DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting

Figure 2 for DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting

Figure 3 for DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting

Figure 4 for DROID-Splat: Combining end-to-end SLAM with 3D Gaussian Splatting

Abstract:Recent progress in scene synthesis makes standalone SLAM systems purely based on optimizing hyperprimitives with a Rendering objective possible \cite{monogs}. However, the tracking performance still lacks behind traditional \cite{orbslam} and end-to-end SLAM systems \cite{droid}. An optimal trade-off between robustness, speed and accuracy has not yet been reached, especially for monocular video. In this paper, we introduce a SLAM system based on an end-to-end Tracker and extend it with a Renderer based on recent 3D Gaussian Splatting techniques. Our framework \textbf{DroidSplat} achieves both SotA tracking and rendering results on common SLAM benchmarks. We implemented multiple building blocks of modern SLAM systems to run in parallel, allowing for fast inference on common consumer GPU's. Recent progress in monocular depth prediction and camera calibration allows our system to achieve strong results even on in-the-wild data without known camera intrinsics. Code will be available at \url{https://github.com/ChenHoy/DROID-Splat}.

Via

Access Paper or Ask Questions

Sigma Flows for Image and Data Labeling and Learning Structured Prediction

Aug 28, 2024

Jonas Cassel, Bastian Boll, Stefania Petra, Peter Albers, Christoph Schnörr

Abstract:This paper introduces the sigma flow model for the prediction of structured labelings of data observed on Riemannian manifolds, including Euclidean image domains as special case. The approach combines the Laplace-Beltrami framework for image denoising and enhancement, introduced by Sochen, Kimmel and Malladi about 25 years ago, and the assignment flow approach introduced and studied by the authors. The sigma flow arises as Riemannian gradient flow of generalized harmonic energies and thus is governed by a nonlinear geometric PDE which determines a harmonic map from a closed Riemannian domain manifold to a statistical manifold, equipped with the Fisher-Rao metric from information geometry. A specific ingredient of the sigma flow is the mutual dependency of the Riemannian metric of the domain manifold on the evolving state. This makes the approach amenable to machine learning in a specific way, by realizing this dependency through a mapping with compact time-variant parametrization that can be learned from data. Proof of concept experiments demonstrate the expressivity of the sigma flow model and prediction performance. Structural similarities to transformer network architectures and networks generated by the geometric integration of sigma flows are pointed out, which highlights the connection to deep learning and, conversely, may stimulate the use of geometric design principles for structured prediction in other areas of scientific machine learning.

* 51 pages

Via

Access Paper or Ask Questions

Learning Distances from Data with Normalizing Flows and Score Matching

Jul 12, 2024

Peter Sorrenson, Daniel Behrend-Uriarte, Christoph Schnörr, Ullrich Köthe

Figure 1 for Learning Distances from Data with Normalizing Flows and Score Matching

Figure 2 for Learning Distances from Data with Normalizing Flows and Score Matching

Figure 3 for Learning Distances from Data with Normalizing Flows and Score Matching

Figure 4 for Learning Distances from Data with Normalizing Flows and Score Matching

Abstract:Density-based distances (DBDs) offer an elegant solution to the problem of metric learning. By defining a Riemannian metric which increases with decreasing probability density, shortest paths naturally follow the data manifold and points are clustered according to the modes of the data. We show that existing methods to estimate Fermat distances, a particular choice of DBD, suffer from poor convergence in both low and high dimensions due to i) inaccurate density estimates and ii) reliance on graph-based paths which are increasingly rough in high dimensions. To address these issues, we propose learning the densities using a normalizing flow, a generative model with tractable density estimation, and employing a smooth relaxation method using a score model initialized from a graph-based proposal. Additionally, we introduce a dimension-adapted Fermat distance that exhibits more intuitive behavior when scaled to high dimensions and offers better numerical properties. Our work paves the way for practical use of density-based distances, especially in high-dimensional spaces.

Via

Access Paper or Ask Questions

Generative Assignment Flows for Representing and Learning Joint Distributions of Discrete Data

Jun 06, 2024

Bastian Boll, Daniel Gonzalez-Alvarado, Stefania Petra, Christoph Schnörr

Figure 1 for Generative Assignment Flows for Representing and Learning Joint Distributions of Discrete Data

Figure 2 for Generative Assignment Flows for Representing and Learning Joint Distributions of Discrete Data

Figure 3 for Generative Assignment Flows for Representing and Learning Joint Distributions of Discrete Data

Figure 4 for Generative Assignment Flows for Representing and Learning Joint Distributions of Discrete Data

Abstract:We introduce a novel generative model for the representation of joint probability distributions of a possibly large number of discrete random variables. The approach uses measure transport by randomized assignment flows on the statistical submanifold of factorizing distributions, which also enables to sample efficiently from the target distribution and to assess the likelihood of unseen data points. The embedding of the flow via the Segre map in the meta-simplex of all discrete joint distributions ensures that any target distribution can be represented in principle, whose complexity in practice only depends on the parametrization of the affinity function of the dynamical assignment flow system. Our model can be trained in a simulation-free manner without integration by conditional Riemannian flow matching, using the training data encoded as geodesics in closed-form with respect to the e-connection of information geometry. By projecting high-dimensional flow matching in the meta-simplex of joint distributions to the submanifold of factorizing distributions, our approach has strong motivation from first principles of modeling coupled discrete variables. Numerical experiments devoted to distributions of structured image labelings demonstrate the applicability to large-scale problems, which may include discrete distributions in other application areas. Performance measures show that our approach scales better with the increasing number of classes than recent related work.

Via

Access Paper or Ask Questions

The Central Spanning Tree Problem

Apr 09, 2024

Enrique Fita Sanmartín, Christoph Schnörr, Fred A. Hamprecht

Abstract:Spanning trees are an important primitive in many data analysis tasks, when a data set needs to be summarized in terms of its "skeleton", or when a tree-shaped graph over all observations is required for downstream processing. Popular definitions of spanning trees include the minimum spanning tree and the optimum distance spanning tree, a.k.a. the minimum routing cost tree. When searching for the shortest spanning tree but admitting additional branching points, even shorter spanning trees can be realized: Steiner trees. Unfortunately, both minimum spanning and Steiner trees are not robust with respect to noise in the observations; that is, small perturbations of the original data set often lead to drastic changes in the associated spanning trees. In response, we make two contributions when the data lies in a Euclidean space: on the theoretical side, we introduce a new optimization problem, the "(branched) central spanning tree", which subsumes all previously mentioned definitions as special cases. On the practical side, we show empirically that the (branched) central spanning tree is more robust to noise in the data, and as such is better suited to summarize a data set in terms of its skeleton. We also propose a heuristic to address the NP-hard optimization problem, and illustrate its use on single cell RNA expression data from biology and 3D point clouds of plants.

Via

Access Paper or Ask Questions

Generative Modeling of Discrete Joint Distributions by E-Geodesic Flow Matching on Assignment Manifolds

Feb 12, 2024

Bastian Boll, Daniel Gonzalez-Alvarado, Christoph Schnörr

Figure 1 for Generative Modeling of Discrete Joint Distributions by E-Geodesic Flow Matching on Assignment Manifolds

Figure 2 for Generative Modeling of Discrete Joint Distributions by E-Geodesic Flow Matching on Assignment Manifolds

Figure 3 for Generative Modeling of Discrete Joint Distributions by E-Geodesic Flow Matching on Assignment Manifolds

Figure 4 for Generative Modeling of Discrete Joint Distributions by E-Geodesic Flow Matching on Assignment Manifolds

Abstract:This paper introduces a novel generative model for discrete distributions based on continuous normalizing flows on the submanifold of factorizing discrete measures. Integration of the flow gradually assigns categories and avoids issues of discretizing the latent continuous model like rounding, sample truncation etc. General non-factorizing discrete distributions capable of representing complex statistical dependencies of structured discrete data, can be approximated by embedding the submanifold into a the meta-simplex of all joint discrete distributions and data-driven averaging. Efficient training of the generative model is demonstrated by matching the flow of geodesics of factorizing discrete distributions. Various experiments underline the approach's broad applicability.

Via

Access Paper or Ask Questions

On the Universality of Coupling-based Normalizing Flows

Feb 09, 2024

Felix Draxler, Stefan Wahl, Christoph Schnörr, Ullrich Köthe

Figure 1 for On the Universality of Coupling-based Normalizing Flows

Figure 2 for On the Universality of Coupling-based Normalizing Flows

Figure 3 for On the Universality of Coupling-based Normalizing Flows

Figure 4 for On the Universality of Coupling-based Normalizing Flows

Abstract:We present a novel theoretical framework for understanding the expressive power of coupling-based normalizing flows such as RealNVP. Despite their prevalence in scientific applications, a comprehensive understanding of coupling flows remains elusive due to their restricted architectures. Existing theorems fall short as they require the use of arbitrarily ill-conditioned neural networks, limiting practical applicability. Additionally, we demonstrate that these constructions inherently lead to volume-preserving flows, a property which we show to be a fundamental constraint for expressivity. We propose a new distributional universality theorem for coupling-based normalizing flows, which overcomes several limitations of prior work. Our results support the general wisdom that the coupling architecture is expressive and provide a nuanced view for choosing the expressivity of coupling functions, bridging a gap between empirical results and theoretical understanding.

* under review

Via

Access Paper or Ask Questions