Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sui Yang Khoo

POGD: Gradient Descent with New Stochastic Rules

Oct 15, 2022

Feihu Han, Sida Xing, Sui Yang Khoo

Figure 1 for POGD: Gradient Descent with New Stochastic Rules

Figure 2 for POGD: Gradient Descent with New Stochastic Rules

Figure 3 for POGD: Gradient Descent with New Stochastic Rules

Figure 4 for POGD: Gradient Descent with New Stochastic Rules

Abstract:There introduce Particle Optimized Gradient Descent (POGD), an algorithm based on the gradient descent but integrates the particle swarm optimization (PSO) principle to achieve the iteration. From the experiments, this algorithm has adaptive learning ability. The experiments in this paper mainly focus on the training speed to reach the target value and the ability to prevent the local minimum. The experiments in this paper are achieved by the convolutional neural network (CNN) image classification on the MNIST and cifar-10 datasets.

Via

Access Paper or Ask Questions

Piecewise Linear Units Improve Deep Neural Networks

Aug 22, 2021

Jordan Inturrisi, Sui Yang Khoo, Abbas Kouzani, Riccardo Pagliarella

Figure 1 for Piecewise Linear Units Improve Deep Neural Networks

Figure 2 for Piecewise Linear Units Improve Deep Neural Networks

Figure 3 for Piecewise Linear Units Improve Deep Neural Networks

Figure 4 for Piecewise Linear Units Improve Deep Neural Networks

Abstract:The activation function is at the heart of a deep neural networks nonlinearity; the choice of the function has great impact on the success of training. Currently, many practitioners prefer the Rectified Linear Unit (ReLU) due to its simplicity and reliability, despite its few drawbacks. While most previous functions proposed to supplant ReLU have been hand-designed, recent work on learning the function during training has shown promising results. In this paper we propose an adaptive piecewise linear activation function, the Piecewise Linear Unit (PiLU), which can be learned independently for each dimension of the neural network. We demonstrate how PiLU is a generalised rectifier unit and note its similarities with the Adaptive Piecewise Linear Units, namely adaptive and piecewise linear. Across a distribution of 30 experiments, we show that for the same model architecture, hyperparameters, and pre-processing, PiLU significantly outperforms ReLU: reducing classification error by 18.53% on CIFAR-10 and 13.13% on CIFAR-100, for a minor increase in the number of neurons. Further work should be dedicated to exploring generalised piecewise linear units, as well as verifying these results across other challenging domains and larger problems.

* 13 pages, 6 figures, 5 tables, replaced some figures and wording

Via

Access Paper or Ask Questions