Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Maliha Arif

Background Invariant Classification on Infrared Imagery by Data Efficient Training and Reducing Bias in CNNs

Feb 09, 2022

Maliha Arif, Calvin Yong, Abhijit Mahalanobis

Figure 1 for Background Invariant Classification on Infrared Imagery by Data Efficient Training and Reducing Bias in CNNs

Figure 2 for Background Invariant Classification on Infrared Imagery by Data Efficient Training and Reducing Bias in CNNs

Figure 3 for Background Invariant Classification on Infrared Imagery by Data Efficient Training and Reducing Bias in CNNs

Figure 4 for Background Invariant Classification on Infrared Imagery by Data Efficient Training and Reducing Bias in CNNs

Abstract:Even though convolutional neural networks can classify objects in images very accurately, it is well known that the attention of the network may not always be on the semantically important regions of the scene. It has been observed that networks often learn background textures which are not relevant to the object of interest. In turn this makes the networks susceptible to variations and changes in the background which negatively affect their performance. We propose a new two-step training procedure called split training to reduce this bias in CNNs on both Infrared imagery and RGB data. Our split training procedure has two steps: using MSE loss first train the layers of the network on images with background to match the activations of the same network when it is trained using images without background; then with these layers frozen, train the rest of the network with cross-entropy loss to classify the objects. Our training method outperforms the traditional training procedure in both a simple CNN architecture, and deep CNNs like VGG and Densenet which use lots of hardware resources, and learns to mimic human vision which focuses more on shape and structure than background with higher accuracy.

* Accepted in AAAI-22 Workshop

Via

Access Paper or Ask Questions

Multiple View Generation and Classification of Mid-wave Infrared Images using Deep Learning

Aug 18, 2020

Maliha Arif, Abhijit Mahalanobis

Figure 1 for Multiple View Generation and Classification of Mid-wave Infrared Images using Deep Learning

Figure 2 for Multiple View Generation and Classification of Mid-wave Infrared Images using Deep Learning

Figure 3 for Multiple View Generation and Classification of Mid-wave Infrared Images using Deep Learning

Figure 4 for Multiple View Generation and Classification of Mid-wave Infrared Images using Deep Learning

Abstract:We propose a novel study of generating unseen arbitrary viewpoints for infrared imagery in the non-linear feature subspace . Current methods use synthetic images and often result in blurry and distorted outputs. Our approach on the contrary understands the semantic information in natural images and encapsulates it such that our predicted unseen views possess good 3D representations. We further explore the non-linear feature subspace and conclude that our network does not operate in the Euclidean subspace but rather in the Riemannian subspace. It does not learn the geometric transformation for predicting the position of the pixel in the new image but rather learns the manifold. To this end, we use t-SNE visualisations to conduct a detailed analysis of our network and perform classification of generated images as a low-shot learning task.

* 5 pages, 5 figures, to be submitted in a journal

Via

Access Paper or Ask Questions