Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Rolling the dice for better deep learning performance: A study of randomness techniques in deep neural networks

Apr 05, 2024

Mohammed Ghaith Altarabichi, Sławomir Nowaczyk, Sepideh Pashami, Peyman Sheikholharam Mashhadi, Julia Handl

Figure 1 for Rolling the dice for better deep learning performance: A study of randomness techniques in deep neural networks

Figure 2 for Rolling the dice for better deep learning performance: A study of randomness techniques in deep neural networks

Figure 3 for Rolling the dice for better deep learning performance: A study of randomness techniques in deep neural networks

Figure 4 for Rolling the dice for better deep learning performance: A study of randomness techniques in deep neural networks

Share this with someone who'll enjoy it:

Abstract:This paper investigates how various randomization techniques impact Deep Neural Networks (DNNs). Randomization, like weight noise and dropout, aids in reducing overfitting and enhancing generalization, but their interactions are poorly understood. The study categorizes randomness techniques into four types and proposes new methods: adding noise to the loss function and random masking of gradient updates. Using Particle Swarm Optimizer (PSO) for hyperparameter optimization, it explores optimal configurations across MNIST, FASHION-MNIST, CIFAR10, and CIFAR100 datasets. Over 30,000 configurations are evaluated, revealing data augmentation and weight initialization randomness as main performance contributors. Correlation analysis shows different optimizers prefer distinct randomization types. The complete implementation and dataset are available on GitHub.

* Information Sciences, p.120500 (2024)

View paper on

Share this with someone who'll enjoy it:

Title:Rolling the dice for better deep learning performance: A study of randomness techniques in deep neural networks

Paper and Code