Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Moein Hasani

An Efficient Approach for Using Expectation Maximization Algorithm in Capsule Networks

Dec 22, 2019

Moein Hasani, Amin Nasim Saravi, Hassan Khotanlou

Figure 1 for An Efficient Approach for Using Expectation Maximization Algorithm in Capsule Networks

Figure 2 for An Efficient Approach for Using Expectation Maximization Algorithm in Capsule Networks

Figure 3 for An Efficient Approach for Using Expectation Maximization Algorithm in Capsule Networks

Figure 4 for An Efficient Approach for Using Expectation Maximization Algorithm in Capsule Networks

Abstract:Capsule Networks (CapsNets) are brand-new architectures that have shown ground-breaking results in certain areas of Computer Vision (CV). In 2017, Hinton and his team introduced CapsNets with routing-by-agreement in "Sabour et al" and in a more recent paper "Matrix Capsules with EM Routing" they proposed a more complete architecture with Expectation-Maximization (EM) algorithm. Unlike the traditional convolutional neural networks (CNNs), this architecture is able to preserve the pose of the objects in the picture. Due to this characteristic, it has been able to beat the previous state-of-theart results on the smallNORB dataset, which includes samples with various view points. Also, this architecture is more robust to white box adversarial attacks. However, CapsNets have two major drawbacks. They can't perform as well as CNNs on complex datasets and, they need a huge amount of time for training. We try to mitigate these shortcomings by finding optimum settings of EM routing iterations for training CapsNets. Unlike the past studies, we use un-equal numbers of EM routing iterations for different stages of the CapsNet. For our research, we use three datasets: Yale face dataset, Belgium Traffic Sign dataset, and Fashion-MNIST dataset.

Via

Access Paper or Ask Questions

An Empirical Study on Position of the Batch Normalization Layer in Convolutional Neural Networks

Dec 12, 2019

Moein Hasani, Hassan Khotanlou

Figure 1 for An Empirical Study on Position of the Batch Normalization Layer in Convolutional Neural Networks

Figure 2 for An Empirical Study on Position of the Batch Normalization Layer in Convolutional Neural Networks

Figure 3 for An Empirical Study on Position of the Batch Normalization Layer in Convolutional Neural Networks

Figure 4 for An Empirical Study on Position of the Batch Normalization Layer in Convolutional Neural Networks

Abstract:In this paper, we have studied how the training of the convolutional neural networks (CNNs) can be affected by changing the position of the batch normalization (BN) layer. Three different convolutional neural networks have been chosen for our experiments. These networks are AlexNet, VGG-16, and ResNet- 20. We show that the speed up in training provided by the BN algorithm can be improved by using other positions for the BN layer than the one suggested by its original paper. Also, we discuss how the BN layer in a certain position can aid the training of one network but not the other. Three different positions for the BN layer have been studied in this research. These positions are: the BN layer between the convolution layer and the non-linear activation function, the BN layer after the non-linear activation function and finally, the BN layer before each of the convolutional layers.

Via

Access Paper or Ask Questions