Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:High-Dimensional Smoothed Entropy Estimation via Dimensionality Reduction

May 11, 2023

Kristjan Greenewald, Brian Kingsbury, Yuancheng Yu

Figure 1 for High-Dimensional Smoothed Entropy Estimation via Dimensionality Reduction

Figure 2 for High-Dimensional Smoothed Entropy Estimation via Dimensionality Reduction

Figure 3 for High-Dimensional Smoothed Entropy Estimation via Dimensionality Reduction

Figure 4 for High-Dimensional Smoothed Entropy Estimation via Dimensionality Reduction

Share this with someone who'll enjoy it:

Abstract:We study the problem of overcoming exponential sample complexity in differential entropy estimation under Gaussian convolutions. Specifically, we consider the estimation of the differential entropy $h(X+Z)$ via $n$ independently and identically distributed samples of $X$, where $X$ and $Z$ are independent $D$-dimensional random variables with $X$ sub-Gaussian with bounded second moment and $Z\sim\mathcal{N}(0,\sigma^2I_D)$. Under the absolute-error loss, the above problem has a parametric estimation rate of $\frac{c^D}{\sqrt{n}}$, which is exponential in data dimension $D$ and often problematic for applications. We overcome this exponential sample complexity by projecting $X$ to a low-dimensional space via principal component analysis (PCA) before the entropy estimation, and show that the asymptotic error overhead vanishes as the unexplained variance of the PCA vanishes. This implies near-optimal performance for inherently low-dimensional structures embedded in high-dimensional spaces, including hidden-layer outputs of deep neural networks (DNN), which can be used to estimate mutual information (MI) in DNNs. We provide numerical results verifying the performance of our PCA approach on Gaussian and spiral data. We also apply our method to analysis of information flow through neural network layers (c.f. information bottleneck), with results measuring mutual information in a noisy fully connected network and a noisy convolutional neural network (CNN) for MNIST classification.

* To appear in ISIT 2023

View paper on

Share this with someone who'll enjoy it:

Title:High-Dimensional Smoothed Entropy Estimation via Dimensionality Reduction

Paper and Code