Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Cynthia Rush

A theoretical framework for M-posteriors: frequentist guarantees and robustness properties

Oct 01, 2025

Juraj Marusic, Marco Avella Medina, Cynthia Rush

Abstract:We provide a theoretical framework for a wide class of generalized posteriors that can be viewed as the natural Bayesian posterior counterpart of the class of M-estimators in the frequentist world. We call the members of this class M-posteriors and show that they are asymptotically normally distributed under mild conditions on the M-estimation loss and the prior. In particular, an M-posterior contracts in probability around a normal distribution centered at an M-estimator, showing frequentist consistency and suggesting some degree of robustness depending on the reference M-estimator. We formalize the robustness properties of the M-posteriors by a new characterization of the posterior influence function and a novel definition of breakdown point adapted for posterior distributions. We illustrate the wide applicability of our theory in various popular models and illustrate their empirical relevance in some numerical examples.

Via

Access Paper or Ask Questions

Generalized Linear Models with 1-Bit Measurements: Asymptotics of the Maximum Likelihood Estimator

Jan 09, 2025

Jaimin Shah, Martina Cardone, Cynthia Rush, Alex Dytso

Figure 1 for Generalized Linear Models with 1-Bit Measurements: Asymptotics of the Maximum Likelihood Estimator

Abstract:This work establishes regularity conditions for consistency and asymptotic normality of the multiple parameter maximum likelihood estimator(MLE) from censored data, where the censoring mechanism is in the form of $1$-bit measurements. The underlying distribution of the uncensored data is assumed to belong to the exponential family, with natural parameters expressed as a linear combination of the predictors, known as generalized linear model (GLM). As part of the analysis, the Fisher information matrix is also derived for both censored and uncensored data, which helps to quantify the impact of censoring and assess the performance of the MLE. The choice of GLM allows one to consider a variety of practical examples where 1-bit estimation is of interest. In particular, it is shown how the derived results can be used to analyze two practically relevant scenarios: the Gaussian model with both unknown mean and variance, and the Poisson model with an unknown mean.

* ICASSP 2025

Via

Access Paper or Ask Questions

Max-sliced Wasserstein concentration and uniform ratio bounds of empirical measures on RKHS

May 21, 2024

Ruiyu Han, Cynthia Rush, Johannes Wiesel

Abstract:Optimal transport and the Wasserstein distance $\mathcal{W}_p$ have recently seen a number of applications in the fields of statistics, machine learning, data science, and the physical sciences. These applications are however severely restricted by the curse of dimensionality, meaning that the number of data points needed to estimate these problems accurately increases exponentially in the dimension. To alleviate this problem, a number of variants of $\mathcal{W}_p$ have been introduced. We focus here on one of these variants, namely the max-sliced Wasserstein metric $\overline{\mathcal{W}}_p$. This metric reduces the high-dimensional minimization problem given by $\mathcal{W}_p$ to a maximum of one-dimensional measurements in an effort to overcome the curse of dimensionality. In this note we derive concentration results and upper bounds on the expectation of $\overline{\mathcal{W}}_p$ between the true and empirical measure on unbounded reproducing kernel Hilbert spaces. We show that, under quite generic assumptions, probability measures concentrate uniformly fast in one-dimensional subspaces, at (nearly) parametric rates. Our results rely on an improvement of currently known bounds for $\overline{\mathcal{W}}_p$ in the finite-dimensional case.

Via

Access Paper or Ask Questions

Is it easier to count communities than find them?

Dec 21, 2022

Cynthia Rush, Fiona Skerman, Alexander S. Wein, Dana Yang

Figure 1 for Is it easier to count communities than find them?

Abstract:Random graph models with community structure have been studied extensively in the literature. For both the problems of detecting and recovering community structure, an interesting landscape of statistical and computational phase transitions has emerged. A natural unanswered question is: might it be possible to infer properties of the community structure (for instance, the number and sizes of communities) even in situations where actually finding those communities is believed to be computationally hard? We show the answer is no. In particular, we consider certain hypothesis testing problems between models with different community structures, and we show (in the low-degree polynomial framework) that testing between two options is as hard as finding the communities. In addition, our methods give the first computational lower bounds for testing between two different `planted' distributions, whereas previous results have considered testing between a planted distribution and an i.i.d. `null' distribution.

* Accepted to Innovations in Theoretical Computer Science (ITCS) 2023

Via

Access Paper or Ask Questions

Entropic CLT for Order Statistics

May 10, 2022

Martina Cardone, Alex Dytso, Cynthia Rush

Abstract:It is well known that central order statistics exhibit a central limit behavior and converge to a Gaussian distribution as the sample size grows. This paper strengthens this known result by establishing an entropic version of the CLT that ensures a stronger mode of convergence using the relative entropy. In particular, an order $O(1/\sqrt{n})$ rate of convergence is established under mild conditions on the parent distribution of the sample generating the order statistics. To prove this result, ancillary results on order statistics are derived, which might be of independent interest.

* Accepted to the 2022 IEEE International Symposium on Information Theory (ISIT)

Via

Access Paper or Ask Questions

Characterizing the SLOPE Trade-off: A Variational Perspective and the Donoho-Tanner Limit

May 27, 2021

Zhiqi Bu, Jason Klusowski, Cynthia Rush, Weijie J. Su

Figure 1 for Characterizing the SLOPE Trade-off: A Variational Perspective and the Donoho-Tanner Limit

Figure 2 for Characterizing the SLOPE Trade-off: A Variational Perspective and the Donoho-Tanner Limit

Figure 3 for Characterizing the SLOPE Trade-off: A Variational Perspective and the Donoho-Tanner Limit

Figure 4 for Characterizing the SLOPE Trade-off: A Variational Perspective and the Donoho-Tanner Limit

Abstract:Sorted l1 regularization has been incorporated into many methods for solving high-dimensional statistical estimation problems, including the SLOPE estimator in linear regression. In this paper, we study how this relatively new regularization technique improves variable selection by characterizing the optimal SLOPE trade-off between the false discovery proportion (FDP) and true positive proportion (TPP) or, equivalently, between measures of type I error and power. Assuming a regime of linear sparsity and working under Gaussian random designs, we obtain an upper bound on the optimal trade-off for SLOPE, showing its capability of breaking the Donoho-Tanner power limit. To put it into perspective, this limit is the highest possible power that the Lasso, which is perhaps the most popular l1-based method, can achieve even with arbitrarily strong effect sizes. Next, we derive a tight lower bound that delineates the fundamental limit of sorted l1 regularization in optimally trading the FDP off for the TPP. Finally, we show that on any problem instance, SLOPE with a certain regularization sequence outperforms the Lasso, in the sense of having a smaller FDP, larger TPP and smaller l2 estimation risk simultaneously. Our proofs are based on a novel technique that reduces a variational calculus problem to a class of infinite-dimensional convex optimization problems and a very recent result from approximate message passing theory.

Via

Access Paper or Ask Questions

A unifying tutorial on Approximate Message Passing

May 05, 2021

Oliver Y. Feng, Ramji Venkataramanan, Cynthia Rush, Richard J. Samworth

Figure 1 for A unifying tutorial on Approximate Message Passing

Figure 2 for A unifying tutorial on Approximate Message Passing

Abstract:Over the last decade or so, Approximate Message Passing (AMP) algorithms have become extremely popular in various structured high-dimensional statistical problems. The fact that the origins of these techniques can be traced back to notions of belief propagation in the statistical physics literature lends a certain mystique to the area for many statisticians. Our goal in this work is to present the main ideas of AMP from a statistical perspective, to illustrate the power and flexibility of the AMP framework. Along the way, we strengthen and unify many of the results in the existing literature.

* 99 pages, 2 figures

Via

Access Paper or Ask Questions

On the Robustness to Misspecification of $α$-Posteriors and Their Variational Approximations

Apr 16, 2021

Marco Avella Medina, José Luis Montiel Olea, Cynthia Rush, Amilcar Velez

Abstract:$\alpha$-posteriors and their variational approximations distort standard posterior inference by downweighting the likelihood and introducing variational approximation errors. We show that such distortions, if tuned appropriately, reduce the Kullback-Leibler (KL) divergence from the true, but perhaps infeasible, posterior distribution when there is potential parametric model misspecification. To make this point, we derive a Bernstein-von Mises theorem showing convergence in total variation distance of $\alpha$-posteriors and their variational approximations to limiting Gaussian distributions. We use these distributions to evaluate the KL divergence between true and reported posteriors. We show this divergence is minimized by choosing $\alpha$ strictly smaller than one, assuming there is a vanishingly small probability of model misspecification. The optimized value becomes smaller as the the misspecification becomes more severe. The optimized KL divergence increases logarithmically in the degree of misspecification and not linearly as with the usual posterior.

Via

Access Paper or Ask Questions

All-or-nothing statistical and computational phase transitions in sparse spiked matrix estimation

Jun 14, 2020

Jean Barbier, Nicolas Macris, Cynthia Rush

Figure 1 for All-or-nothing statistical and computational phase transitions in sparse spiked matrix estimation

Figure 2 for All-or-nothing statistical and computational phase transitions in sparse spiked matrix estimation

Abstract:We determine statistical and computational limits for estimation of a rank-one matrix (the spike) corrupted by an additive gaussian noise matrix, in a sparse limit, where the underlying hidden vector (that constructs the rank-one matrix) has a number of non-zero components that scales sub-linearly with the total dimension of the vector, and the signal-to-noise ratio tends to infinity at an appropriate speed. We prove explicit low-dimensional variational formulas for the asymptotic mutual information between the spike and the observed noisy matrix and analyze the approximate message passing algorithm in the sparse regime. For Bernoulli and Bernoulli-Rademacher distributed vectors, and when the sparsity and signal strength satisfy an appropriate scaling relation, we find all-or-nothing phase transitions for the asymptotic minimum and algorithmic mean-square errors. These jump from their maximum possible value to zero, at well defined signal-to-noise thresholds whose asymptotic values we determine exactly. In the asymptotic regime the statistical-to-algorithmic gap diverges indicating that sparse recovery is hard for approximate message passing.

* Part of this work (in particular the proof of Theorem 1) is already present in reference arXiv:1911.05030

Via

Access Paper or Ask Questions

Rigorous State Evolution Analysis for Approximate Message Passing with Side Information

Mar 25, 2020

Hangjin Liu, Cynthia Rush, Dror Baron

Figure 1 for Rigorous State Evolution Analysis for Approximate Message Passing with Side Information

Figure 2 for Rigorous State Evolution Analysis for Approximate Message Passing with Side Information

Figure 3 for Rigorous State Evolution Analysis for Approximate Message Passing with Side Information

Abstract:A common goal in many research areas is to reconstruct an unknown signal x from noisy linear measurements. Approximate message passing (AMP) is a class of low-complexity algorithms that can be used for efficiently solving such high-dimensional regression tasks. Often, it is the case that side information (SI) is available during reconstruction. For this reason, a novel algorithmic framework that incorporates SI into AMP, referred to as approximate message passing with side information (AMP-SI), has been recently introduced. In this work, we provide rigorous performance guarantees for AMP-SI when there are statistical dependencies between the signal and SI pairs and the entries of the measurement matrix are independent and identically distributed Gaussian. The AMP-SI performance is shown to be provably tracked by a scalar iteration referred to as state evolution. Moreover, we provide numerical examples that demonstrate empirically that the SE can predict the AMP-SI mean square error accurately.

* arXiv admin note: text overlap with arXiv:1902.00150

Via

Access Paper or Ask Questions