Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nikita Puchkin

Dimension-free bounds in high-dimensional linear regression via error-in-operator approach

Feb 21, 2025

Fedor Noskov, Nikita Puchkin, Vladimir Spokoiny

Abstract:We consider a problem of high-dimensional linear regression with random design. We suggest a novel approach referred to as error-in-operator which does not estimate the design covariance $\Sigma$ directly but incorporates it into empirical risk minimization. We provide an expansion of the excess prediction risk and derive non-asymptotic dimension-free bounds on the leading term and the remainder. This helps us to show that auxiliary variables do not increase the effective dimension of the problem, provided that parameters of the procedure are tuned properly. We also discuss computational aspects of our method and illustrate its performance with numerical experiments.

* 100 pages

Via

Access Paper or Ask Questions

Generalization error bound for denoising score matching under relaxed manifold assumption

Feb 19, 2025

Konstantin Yakovlev, Nikita Puchkin

Abstract:We examine theoretical properties of the denoising score matching estimate. We model the density of observations with a nonparametric Gaussian mixture. We significantly relax the standard manifold assumption allowing the samples step away from the manifold. At the same time, we are still able to leverage a nice distribution structure. We derive non-asymptotic bounds on the approximation and generalization errors of the denoising score matching estimate. The rates of convergence are determined by the intrinsic dimension. Furthermore, our bounds remain valid even if we allow the ambient dimension grow polynomially with the sample size.

* 59 pages

Via

Access Paper or Ask Questions

Score-based change point detection via tracking the best of infinitely many experts

Aug 26, 2024

Anna Markovich, Nikita Puchkin

Figure 1 for Score-based change point detection via tracking the best of infinitely many experts

Figure 2 for Score-based change point detection via tracking the best of infinitely many experts

Figure 3 for Score-based change point detection via tracking the best of infinitely many experts

Figure 4 for Score-based change point detection via tracking the best of infinitely many experts

Abstract:We suggest a novel algorithm for online change point detection based on sequential score function estimation and tracking the best expert approach. The core of the procedure is a version of the fixed share forecaster for the case of infinite number of experts and quadratic loss functions. The algorithm shows a promising performance in numerical experiments on artificial and real-world data sets. We also derive new upper bounds on the dynamic regret of the fixed share forecaster with varying parameter, which are of independent interest.

* 43 pages, 4 figures

Via

Access Paper or Ask Questions

Dimension-free Structured Covariance Estimation

Feb 15, 2024

Nikita Puchkin, Maxim Rakhuba

Abstract:Given a sample of i.i.d. high-dimensional centered random vectors, we consider a problem of estimation of their covariance matrix $\Sigma$ with an additional assumption that $\Sigma$ can be represented as a sum of a few Kronecker products of smaller matrices. Under mild conditions, we derive the first non-asymptotic dimension-free high-probability bound on the Frobenius distance between $\Sigma$ and a widely used penalized permuted least squares estimate. Because of the hidden structure, the established rate of convergence is faster than in the standard covariance estimation problem.

* 30 pages

Via

Access Paper or Ask Questions

Breaking the Heavy-Tailed Noise Barrier in Stochastic Optimization Problems

Nov 07, 2023

Nikita Puchkin, Eduard Gorbunov, Nikolay Kutuzov, Alexander Gasnikov

Abstract:We consider stochastic optimization problems with heavy-tailed noise with structured density. For such problems, we show that it is possible to get faster rates of convergence than $\mathcal{O}(K^{-2(\alpha - 1)/\alpha})$, when the stochastic gradients have finite moments of order $\alpha \in (1, 2]$. In particular, our analysis allows the noise norm to have an unbounded expectation. To achieve these results, we stabilize stochastic gradients, using smoothed medians of means. We prove that the resulting estimates have negligible bias and controllable variance. This allows us to carefully incorporate them into clipped-SGD and clipped-SSTM and derive new high-probability complexity bounds in the considered setup.

* 62 pages, 2 figures

Via

Access Paper or Ask Questions

Exploring Local Norms in Exp-concave Statistical Learning

Feb 21, 2023

Nikita Puchkin, Nikita Zhivotovskiy

Abstract:We consider the problem of stochastic convex optimization with exp-concave losses using Empirical Risk Minimization in a convex class. Answering a question raised in several prior works, we provide a $O( d / n + \log( 1 / \delta) / n )$ excess risk bound valid for a wide class of bounded exp-concave losses, where $d$ is the dimension of the convex reference set, $n$ is the sample size, and $\delta$ is the confidence level. Our result is based on a unified geometric assumption on the gradient of losses and the notion of local norms.

* 20 pages

Via

Access Paper or Ask Questions

A Contrastive Approach to Online Change Point Detection

Jun 21, 2022

Nikita Puchkin, Valeriia Shcherbakova

Figure 1 for A Contrastive Approach to Online Change Point Detection

Figure 2 for A Contrastive Approach to Online Change Point Detection

Figure 3 for A Contrastive Approach to Online Change Point Detection

Figure 4 for A Contrastive Approach to Online Change Point Detection

Abstract:We suggest a novel procedure for online change point detection. Our approach expands an idea of maximizing a discrepancy measure between points from pre-change and post-change distributions. This leads to a flexible procedure suitable for both parametric and nonparametric scenarios. We prove non-asymptotic bounds on the average running length of the procedure and its expected detection delay. The efficiency of the algorithm is illustrated with numerical experiments on synthetic and real-world data sets.

* 34 pages, 3 figures

Via

Access Paper or Ask Questions

Exponential Savings in Agnostic Active Learning through Abstention

Jan 31, 2021

Nikita Puchkin, Nikita Zhivotovskiy

Abstract:We show that in pool-based active classification without assumptions on the underlying distribution, if the learner is given the power to abstain from some predictions by paying the price marginally smaller than the average loss $1/2$ of a random guess, exponential savings in the number of label requests are possible whenever they are possible in the corresponding realizable problem. We extend this result to provide a necessary and sufficient condition for exponential savings in pool-based active classification under the model misspecification.

* 25 pages

Via

Access Paper or Ask Questions

Rates of convergence for density estimation with GANs

Jan 30, 2021

Denis Belomestny, Eric Moulines, Alexey Naumov, Nikita Puchkin, Sergey Samsonov

Abstract:We undertake a precise study of the non-asymptotic properties of vanilla generative adversarial networks (GANs) and derive theoretical guarantees in the problem of estimating an unknown $d$-dimensional density $p^*$ under a proper choice of the class of generators and discriminators. We prove that the resulting density estimate converges to $p^*$ in terms of Jensen-Shannon (JS) divergence at the rate $(\log n/n)^{2\beta/(2\beta+d)}$ where $n$ is the sample size and $\beta$ determines the smoothness of $p^*.$ This is the first result in the literature on density estimation using vanilla GANs with JS rates faster than $n^{-1/2}$ in the regime $\beta>d/2.$

* 27 pages

Via

Access Paper or Ask Questions

Structure-adaptive manifold estimation

Jun 19, 2019

Nikita Puchkin, Vladimir Spokoiny

Figure 1 for Structure-adaptive manifold estimation

Figure 2 for Structure-adaptive manifold estimation

Figure 3 for Structure-adaptive manifold estimation

Figure 4 for Structure-adaptive manifold estimation

Abstract:We consider a problem of manifold estimation from noisy observations. We suggest a novel adaptive procedure, which simultaneously reconstructs a smooth manifold from the observations and estimates projectors onto the tangent spaces. Many manifold learning procedures locally approximate a manifold by a weighted average over a small neighborhood. However, in the presence of large noise, the assigned weights become so corrupted that the averaged estimate shows very poor performance. We adjust the weights so they capture the manifold structure better. We propose a computationally efficient procedure, which iteratively refines the weights on each step, such that, after several iterations, we obtain the "oracle" weights, so the quality of the final estimates does not suffer even in the presence of relatively large noise. We also provide a theoretical study of the procedure and prove its optimality deriving both new upper and lower bounds for manifold estimation under the Hausdorff loss.

* 40 pages, 4 figures

Via

Access Paper or Ask Questions