Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gabriel Fricout

A Fast Learning Algorithm for Image Segmentation with Max-Pooling Convolutional Networks

Feb 07, 2013

Jonathan Masci, Alessandro Giusti, Dan Cireşan, Gabriel Fricout, Jürgen Schmidhuber

Figure 1 for A Fast Learning Algorithm for Image Segmentation with Max-Pooling Convolutional Networks

Figure 2 for A Fast Learning Algorithm for Image Segmentation with Max-Pooling Convolutional Networks

Figure 3 for A Fast Learning Algorithm for Image Segmentation with Max-Pooling Convolutional Networks

Figure 4 for A Fast Learning Algorithm for Image Segmentation with Max-Pooling Convolutional Networks

Abstract:We present a fast algorithm for training MaxPooling Convolutional Networks to segment images. This type of network yields record-breaking performance in a variety of tasks, but is normally trained on a computationally expensive patch-by-patch basis. Our new method processes each training image in a single pass, which is vastly more efficient. We validate the approach in different scenarios and report a 1500-fold speed-up. In an application to automated steel defect detection and segmentation, we obtain excellent performance with short training times.

Via

Access Paper or Ask Questions

Object Recognition with Multi-Scale Pyramidal Pooling Networks

Jul 07, 2012

Jonathan Masci, Ueli Meier, Gabriel Fricout, Jürgen Schmidhuber

Figure 1 for Object Recognition with Multi-Scale Pyramidal Pooling Networks

Figure 2 for Object Recognition with Multi-Scale Pyramidal Pooling Networks

Figure 3 for Object Recognition with Multi-Scale Pyramidal Pooling Networks

Figure 4 for Object Recognition with Multi-Scale Pyramidal Pooling Networks

Abstract:We present a Multi-Scale Pyramidal Pooling Network, featuring a novel pyramidal pooling layer at multiple scales and a novel encoding layer. Thanks to the former the network does not require all images of a given classification task to be of equal size. The encoding layer improves generalisation performance in comparison to similar neural network architectures, especially when training data is scarce. We evaluate and compare our system to convolutional neural networks and state-of-the-art computer vision methods on various benchmark datasets. We also present results on industrial steel defect classification, where existing architectures are not applicable because of the constraint on equally sized input images. The proposed architecture can be seen as a fully supervised hierarchical bag-of-features extension that is trained online and can be fine-tuned for any given task.

Via

Access Paper or Ask Questions