Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:TAN without a burn: Scaling Laws of DP-SGD

Oct 07, 2022

Tom Sander, Pierre Stock, Alexandre Sablayrolles

Figure 1 for TAN without a burn: Scaling Laws of DP-SGD

Figure 2 for TAN without a burn: Scaling Laws of DP-SGD

Figure 3 for TAN without a burn: Scaling Laws of DP-SGD

Figure 4 for TAN without a burn: Scaling Laws of DP-SGD

Share this with someone who'll enjoy it:

Abstract:Differentially Private methods for training Deep Neural Networks (DNNs) have progressed recently, in particular with the use of massive batches and aggregated data augmentations for a large number of steps. These techniques require much more compute than their non-private counterparts, shifting the traditional privacy-accuracy trade-off to a privacy-accuracy-compute trade-off and making hyper-parameter search virtually impossible for realistic scenarios. In this work, we decouple privacy analysis and experimental behavior of noisy training to explore the trade-off with minimal computational requirements. We first use the tools of R\'enyi Differential Privacy (RDP) to show that the privacy budget, when not overcharged, only depends on the total amount of noise (TAN) injected throughout training. We then derive scaling laws for training models with DP-SGD to optimize hyper-parameters with more than a 100 reduction in computational budget. We apply the proposed method on CIFAR-10 and ImageNet and, in particular, strongly improve the state-of-the-art on ImageNet with a +9 points gain in accuracy for a privacy budget epsilon=8.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:TAN without a burn: Scaling Laws of DP-SGD

Paper and Code