Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Eduardo Lavin

Dynamical loss functions shape landscape topography and improve learning in artificial neural networks

Oct 14, 2024

Eduardo Lavin, Miguel Ruiz-Garcia

Figure 1 for Dynamical loss functions shape landscape topography and improve learning in artificial neural networks

Figure 2 for Dynamical loss functions shape landscape topography and improve learning in artificial neural networks

Figure 3 for Dynamical loss functions shape landscape topography and improve learning in artificial neural networks

Figure 4 for Dynamical loss functions shape landscape topography and improve learning in artificial neural networks

Abstract:Dynamical loss functions are derived from standard loss functions used in supervised classification tasks, but they are modified such that the contribution from each class periodically increases and decreases. These oscillations globally alter the loss landscape without affecting the global minima. In this paper, we demonstrate how to transform cross-entropy and mean squared error into dynamical loss functions. We begin by discussing the impact of increasing the size of the neural network or the learning rate on the learning process. Building on this intuition, we propose several versions of dynamical loss functions and show how they significantly improve validation accuracy for networks of varying sizes. Finally, we explore how the landscape of these dynamical loss functions evolves during training, highlighting the emergence of instabilities that may be linked to edge-of-instability minimization.

Via

Access Paper or Ask Questions