Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Frederik Hoppe

Comparing Task-Agnostic Embedding Models for Tabular Data

Nov 18, 2025

Frederik Hoppe, Lars Kleinemeier, Astrid Franz, Udo Göbel

Abstract:Recent foundation models for tabular data achieve strong task-specific performance via in-context learning. Nevertheless, they focus on direct prediction by encapsulating both representation learning and task-specific inference inside a single, resource-intensive network. This work specifically focuses on representation learning, i.e., on transferable, task-agnostic embeddings. We systematically evaluate task-agnostic representations from tabular foundation models (TabPFN and TabICL) alongside with classical feature engineering (TableVectorizer) across a variety of application tasks as outlier detection (ADBench) and supervised learning (TabArena Lite). We find that simple TableVectorizer features achieve comparable or superior performance while being up to three orders of magnitude faster than tabular foundation models. The code is available at https://github.com/ContactSoftwareAI/TabEmbedBench.

* Accepted at AI for Tabular Data (EurIPS 2025 Workshop)

Via

Access Paper or Ask Questions

High-Dimensional Confidence Regions in Sparse MRI

Jul 18, 2024

Frederik Hoppe, Felix Krahmer, Claudio Mayrink Verdun, Marion Menzel, Holger Rauhut

Figure 1 for High-Dimensional Confidence Regions in Sparse MRI

Figure 2 for High-Dimensional Confidence Regions in Sparse MRI

Figure 3 for High-Dimensional Confidence Regions in Sparse MRI

Abstract:One of the most promising solutions for uncertainty quantification in high-dimensional statistics is the debiased LASSO that relies on unconstrained $\ell_1$-minimization. The initial works focused on real Gaussian designs as a toy model for this problem. However, in medical imaging applications, such as compressive sensing for MRI, the measurement system is represented by a (subsampled) complex Fourier matrix. The purpose of this work is to extend the method to the MRI case in order to construct confidence intervals for each pixel of an MR image. We show that a sufficient amount of data is $n \gtrsim \max\{ s_0\log^2 s_0\log p, s_0 \log^2 p \}$.

* Recognized with Best Student Paper Award at ICASSP 2023. arXiv admin note: substantial text overlap with arXiv:2212.14864

Via

Access Paper or Ask Questions

With or Without Replacement? Improving Confidence in Fourier Imaging

Jul 18, 2024

Frederik Hoppe, Claudio Mayrink Verdun, Felix Krahmer, Marion I. Menzel, Holger Rauhut

Figure 1 for With or Without Replacement? Improving Confidence in Fourier Imaging

Figure 2 for With or Without Replacement? Improving Confidence in Fourier Imaging

Figure 3 for With or Without Replacement? Improving Confidence in Fourier Imaging

Figure 4 for With or Without Replacement? Improving Confidence in Fourier Imaging

Abstract:Over the last few years, debiased estimators have been proposed in order to establish rigorous confidence intervals for high-dimensional problems in machine learning and data science. The core argument is that the error of these estimators with respect to the ground truth can be expressed as a Gaussian variable plus a remainder term that vanishes as long as the dimension of the problem is sufficiently high. Thus, uncertainty quantification (UQ) can be performed exploiting the Gaussian model. Empirically, however, the remainder term cannot be neglected in many realistic situations of moderately-sized dimensions, in particular in certain structured measurement scenarios such as Magnetic Resonance Imaging (MRI). This, in turn, can downgrade the advantage of the UQ methods as compared to non-UQ approaches such as the standard LASSO. In this paper, we present a method to improve the debiased estimator by sampling without replacement. Our approach leverages recent results of ours on the structure of the random nature of certain sampling schemes showing how a transition between sampling with and without replacement can lead to a weighted reconstruction scheme with improved performance for the standard LASSO. In this paper, we illustrate how this reweighted sampling idea can also improve the debiased estimator and, consequently, provide a better method for UQ in Fourier imaging.

* Accepted at Cosera 2024

Via

Access Paper or Ask Questions

Non-Asymptotic Uncertainty Quantification in High-Dimensional Learning

Jul 18, 2024

Frederik Hoppe, Claudio Mayrink Verdun, Hannah Laus, Felix Krahmer, Holger Rauhut

Figure 1 for Non-Asymptotic Uncertainty Quantification in High-Dimensional Learning

Figure 2 for Non-Asymptotic Uncertainty Quantification in High-Dimensional Learning

Figure 3 for Non-Asymptotic Uncertainty Quantification in High-Dimensional Learning

Figure 4 for Non-Asymptotic Uncertainty Quantification in High-Dimensional Learning

Abstract:Uncertainty quantification (UQ) is a crucial but challenging task in many high-dimensional regression or learning problems to increase the confidence of a given predictor. We develop a new data-driven approach for UQ in regression that applies both to classical regression approaches such as the LASSO as well as to neural networks. One of the most notable UQ techniques is the debiased LASSO, which modifies the LASSO to allow for the construction of asymptotic confidence intervals by decomposing the estimation error into a Gaussian and an asymptotically vanishing bias component. However, in real-world problems with finite-dimensional data, the bias term is often too significant to be neglected, resulting in overly narrow confidence intervals. Our work rigorously addresses this issue and derives a data-driven adjustment that corrects the confidence intervals for a large class of predictors by estimating the means and variances of the bias terms from training data, exploiting high-dimensional concentration phenomena. This gives rise to non-asymptotic confidence intervals, which can help avoid overestimating uncertainty in critical applications such as MRI diagnosis. Importantly, our analysis extends beyond sparse regression to data-driven predictors like neural networks, enhancing the reliability of model-based deep learning. Our findings bridge the gap between established theory and the practical applicability of such debiased methods.

Via

Access Paper or Ask Questions

Uncertainty quantification for learned ISTA

Sep 14, 2023

Frederik Hoppe, Claudio Mayrink Verdun, Felix Krahmer, Hannah Laus, Holger Rauhut

Figure 1 for Uncertainty quantification for learned ISTA

Figure 2 for Uncertainty quantification for learned ISTA

Figure 3 for Uncertainty quantification for learned ISTA

Abstract:Model-based deep learning solutions to inverse problems have attracted increasing attention in recent years as they bridge state-of-the-art numerical performance with interpretability. In addition, the incorporated prior domain knowledge can make the training more efficient as the smaller number of parameters allows the training step to be executed with smaller datasets. Algorithm unrolling schemes stand out among these model-based learning techniques. Despite their rapid advancement and their close connection to traditional high-dimensional statistical methods, they lack certainty estimates and a theory for uncertainty quantification is still elusive. This work provides a step towards closing this gap proposing a rigorous way to obtain confidence intervals for the LISTA estimator.

* to appear at the 33rd IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2023)

Via

Access Paper or Ask Questions