Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Calibrated Chaos: Variance Between Runs of Neural Network Training is Harmless and Inevitable

Apr 04, 2023

Keller Jordan

Figure 1 for Calibrated Chaos: Variance Between Runs of Neural Network Training is Harmless and Inevitable

Figure 2 for Calibrated Chaos: Variance Between Runs of Neural Network Training is Harmless and Inevitable

Figure 3 for Calibrated Chaos: Variance Between Runs of Neural Network Training is Harmless and Inevitable

Figure 4 for Calibrated Chaos: Variance Between Runs of Neural Network Training is Harmless and Inevitable

Share this with someone who'll enjoy it:

Abstract:Typical neural network trainings have substantial variance in test-set performance between repeated runs, impeding hyperparameter comparison and training reproducibility. We present the following results towards understanding this variation. (1) Despite having significant variance on their test-sets, we demonstrate that standard CIFAR-10 and ImageNet trainings have very little variance in their performance on the test-distributions from which those test-sets are sampled, suggesting that variance is less of a practical issue than previously thought. (2) We present a simplifying statistical assumption which closely approximates the structure of the test-set accuracy distribution. (3) We argue that test-set variance is inevitable in the following two senses. First, we show that variance is largely caused by high sensitivity of the training process to initial conditions, rather than by specific sources of randomness like the data order and augmentations. Second, we prove that variance is unavoidable given the observation that ensembles of trained networks are well-calibrated. (4) We conduct preliminary studies of distribution-shift, fine-tuning, data augmentation and learning rate through the lens of variance between runs.

View paper on

Share this with someone who'll enjoy it:

Title:Calibrated Chaos: Variance Between Runs of Neural Network Training is Harmless and Inevitable

Paper and Code