Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Feihu Han

POGD: Gradient Descent with New Stochastic Rules

Oct 15, 2022

Feihu Han, Sida Xing, Sui Yang Khoo

Figure 1 for POGD: Gradient Descent with New Stochastic Rules

Figure 2 for POGD: Gradient Descent with New Stochastic Rules

Figure 3 for POGD: Gradient Descent with New Stochastic Rules

Figure 4 for POGD: Gradient Descent with New Stochastic Rules

Abstract:There introduce Particle Optimized Gradient Descent (POGD), an algorithm based on the gradient descent but integrates the particle swarm optimization (PSO) principle to achieve the iteration. From the experiments, this algorithm has adaptive learning ability. The experiments in this paper mainly focus on the training speed to reach the target value and the ability to prevent the local minimum. The experiments in this paper are achieved by the convolutional neural network (CNN) image classification on the MNIST and cifar-10 datasets.

Via

Access Paper or Ask Questions

Extreme-Long-short Term Memory for Time-series Prediction

Oct 15, 2022

Sida Xing, Feihu Han, Suiyang Khoo

Figure 1 for Extreme-Long-short Term Memory for Time-series Prediction

Figure 2 for Extreme-Long-short Term Memory for Time-series Prediction

Figure 3 for Extreme-Long-short Term Memory for Time-series Prediction

Figure 4 for Extreme-Long-short Term Memory for Time-series Prediction

Abstract:The emergence of Long Short-Term Memory (LSTM) solves the problems of vanishing gradient and exploding gradient in traditional Recurrent Neural Networks (RNN). LSTM, as a new type of RNN, has been widely used in various fields, such as text prediction, Wind Speed Forecast, depression prediction by EEG signals, etc. The results show that improving the efficiency of LSTM can help to improve the efficiency in other application areas. In this paper, we proposed an advanced LSTM algorithm, the Extreme Long Short-Term Memory (E-LSTM), which adds the inverse matrix part of Extreme Learning Machine (ELM) as a new "gate" into the structure of LSTM. This "gate" preprocess a portion of the data and involves the processed data in the cell update of the LSTM to obtain more accurate data with fewer training rounds, thus reducing the overall training time. In this research, the E-LSTM model is used for the text prediction task. Experimental results showed that the E-LSTM sometimes takes longer to perform a single training round, but when tested on a small data set, the new E-LSTM requires only 2 epochs to obtain the results of the 7th epoch traditional LSTM. Therefore, the E-LSTM retains the high accuracy of the traditional LSTM, whilst also improving the training speed and the overall efficiency of the LSTM.

Via

Access Paper or Ask Questions