Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gregor Ehrensperger

Evaluating CNNs on the Gestalt Principle of Closure

Mar 30, 2019

Gregor Ehrensperger, Sebastian Stabinger, Antonio Rodríguez Sánchez

Figure 1 for Evaluating CNNs on the Gestalt Principle of Closure

Figure 2 for Evaluating CNNs on the Gestalt Principle of Closure

Abstract:Deep convolutional neural networks (CNNs) are widely known for their outstanding performance in classification and regression tasks over high-dimensional data. This made them a popular and powerful tool for a large variety of applications in industry and academia. Recent publications show that seemingly easy classifaction tasks (for humans) can be very challenging for state of the art CNNs. An attempt to describe how humans perceive visual elements is given by the Gestalt principles. In this paper we evaluate AlexNet and GoogLeNet regarding their performance on classifying the correctness of the well known Kanizsa triangles, which heavily rely on the Gestalt principle of closure. Therefore we created various datasets containing valid as well as invalid variants of the Kanizsa triangle. Our findings suggest that perceiving objects by utilizing the principle of closure is very challenging for the applied network architectures but they appear to adapt to the effect of closure.

Via

Access Paper or Ask Questions

Fast algorithms for morphological operations using run-length encoded binary images

Apr 04, 2015

Gregor Ehrensperger, Alexander Ostermann, Felix Schwitzer

Figure 1 for Fast algorithms for morphological operations using run-length encoded binary images

Figure 2 for Fast algorithms for morphological operations using run-length encoded binary images

Figure 3 for Fast algorithms for morphological operations using run-length encoded binary images

Abstract:This paper presents innovative algorithms to efficiently compute erosions and dilations of run-length encoded (RLE) binary images with arbitrary shaped structuring elements. An RLE image is given by a set of runs, where a run is a horizontal concatenation of foreground pixels. The proposed algorithms extract the skeleton of the structuring element and build distance tables of the input image, which are storing the distance to the next background pixel on the left and right hand sides. This information is then used to speed up the calculations of the erosion and dilation operator by enabling the use of techniques which allow to skip the analysis of certain pixels whenever a hit or miss occurs. Additionally the input image gets trimmed during the preprocessing steps on the base of two primitive criteria. Experimental results show the advantages over other algorithms. The source code of our algorithms is available in C++.

* 17 pages, 2 figures. Submitted to Elsevier (Pattern Recognition). For the associated source code, see https://numerical-analysis.uibk.ac.at/g.ehrensperger

Via

Access Paper or Ask Questions