Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Scalable DP-SGD: Shuffling vs. Poisson Subsampling

Nov 06, 2024

Lynn Chua, Badih Ghazi, Pritish Kamath, Ravi Kumar, Pasin Manurangsi, Amer Sinha, Chiyuan Zhang

Figure 1 for Scalable DP-SGD: Shuffling vs. Poisson Subsampling

Figure 2 for Scalable DP-SGD: Shuffling vs. Poisson Subsampling

Figure 3 for Scalable DP-SGD: Shuffling vs. Poisson Subsampling

Figure 4 for Scalable DP-SGD: Shuffling vs. Poisson Subsampling

Share this with someone who'll enjoy it:

Abstract:We provide new lower bounds on the privacy guarantee of the multi-epoch Adaptive Batch Linear Queries (ABLQ) mechanism with shuffled batch sampling, demonstrating substantial gaps when compared to Poisson subsampling; prior analysis was limited to a single epoch. Since the privacy analysis of Differentially Private Stochastic Gradient Descent (DP-SGD) is obtained by analyzing the ABLQ mechanism, this brings into serious question the common practice of implementing shuffling-based DP-SGD, but reporting privacy parameters as if Poisson subsampling was used. To understand the impact of this gap on the utility of trained machine learning models, we introduce a practical approach to implement Poisson subsampling at scale using massively parallel computation, and efficiently train models with the same. We compare the utility of models trained with Poisson-subsampling-based DP-SGD, and the optimistic estimates of utility when using shuffling, via our new lower bounds on the privacy guarantee of ABLQ with shuffling.

* To appear at NeurIPS 2024

View paper on

Share this with someone who'll enjoy it:

Title:Scalable DP-SGD: Shuffling vs. Poisson Subsampling

Paper and Code