Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haibing Wu

Max-Pooling Dropout for Regularization of Convolutional Neural Networks

Dec 04, 2015

Haibing Wu, Xiaodong Gu

Figure 1 for Max-Pooling Dropout for Regularization of Convolutional Neural Networks

Figure 2 for Max-Pooling Dropout for Regularization of Convolutional Neural Networks

Figure 3 for Max-Pooling Dropout for Regularization of Convolutional Neural Networks

Figure 4 for Max-Pooling Dropout for Regularization of Convolutional Neural Networks

* The journal version of this paper [arXiv:1512.00242] has been published in Neural Networks, http://www.sciencedirect.com/science/article/pii/S0893608015001446

Via

Access Paper or Ask Questions

Towards Dropout Training for Convolutional Neural Networks

Dec 01, 2015

Haibing Wu, Xiaodong Gu

Figure 1 for Towards Dropout Training for Convolutional Neural Networks

Figure 2 for Towards Dropout Training for Convolutional Neural Networks

Figure 3 for Towards Dropout Training for Convolutional Neural Networks

Figure 4 for Towards Dropout Training for Convolutional Neural Networks

Abstract:Recently, dropout has seen increasing use in deep learning. For deep convolutional neural networks, dropout is known to work well in fully-connected layers. However, its effect in convolutional and pooling layers is still not clear. This paper demonstrates that max-pooling dropout is equivalent to randomly picking activation based on a multinomial distribution at training time. In light of this insight, we advocate employing our proposed probabilistic weighted pooling, instead of commonly used max-pooling, to act as model averaging at test time. Empirical evidence validates the superiority of probabilistic weighted pooling. We also empirically show that the effect of convolutional dropout is not trivial, despite the dramatically reduced possibility of over-fitting due to the convolutional architecture. Elaborately designing dropout training simultaneously in max-pooling and fully-connected layers, we achieve state-of-the-art performance on MNIST, and very competitive results on CIFAR-10 and CIFAR-100, relative to other approaches without data augmentation. Finally, we compare max-pooling dropout and stochastic pooling, both of which introduce stochasticity based on multinomial distributions at pooling stage.

* Neural Networks 71: 1-10 (2015)
* This paper has been published in Neural Networks, http://www.sciencedirect.com/science/article/pii/S0893608015001446

Via

Access Paper or Ask Questions

Aspect-based Opinion Summarization with Convolutional Neural Networks

Nov 30, 2015

Haibing Wu, Yiwei Gu, Shangdi Sun, Xiaodong Gu

Figure 1 for Aspect-based Opinion Summarization with Convolutional Neural Networks

Figure 2 for Aspect-based Opinion Summarization with Convolutional Neural Networks

Figure 3 for Aspect-based Opinion Summarization with Convolutional Neural Networks

Figure 4 for Aspect-based Opinion Summarization with Convolutional Neural Networks

Abstract:This paper considers Aspect-based Opinion Summarization (AOS) of reviews on particular products. To enable real applications, an AOS system needs to address two core subtasks, aspect extraction and sentiment classification. Most existing approaches to aspect extraction, which use linguistic analysis or topic modeling, are general across different products but not precise enough or suitable for particular products. Instead we take a less general but more precise scheme, directly mapping each review sentence into pre-defined aspects. To tackle aspect mapping and sentiment classification, we propose two Convolutional Neural Network (CNN) based methods, cascaded CNN and multitask CNN. Cascaded CNN contains two levels of convolutional networks. Multiple CNNs at level 1 deal with aspect mapping task, and a single CNN at level 2 deals with sentiment classification. Multitask CNN also contains multiple aspect CNNs and a sentiment CNN, but different networks share the same word embeddings. Experimental results indicate that both cascaded and multitask CNNs outperform SVM-based methods by large margins. Multitask CNN generally performs better than cascaded CNN.

Via

Access Paper or Ask Questions