Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marcel Ackermann

No Shifted Augmentations (NSA): compact distributions for robust self-supervised Anomaly Detection

Mar 19, 2022

Mohamed Yousef, Marcel Ackermann, Unmesh Kurup, Tom Bishop

Figure 1 for No Shifted Augmentations (NSA): compact distributions for robust self-supervised Anomaly Detection

Figure 2 for No Shifted Augmentations (NSA): compact distributions for robust self-supervised Anomaly Detection

Figure 3 for No Shifted Augmentations (NSA): compact distributions for robust self-supervised Anomaly Detection

Figure 4 for No Shifted Augmentations (NSA): compact distributions for robust self-supervised Anomaly Detection

Abstract:Unsupervised Anomaly detection (AD) requires building a notion of normalcy, distinguishing in-distribution (ID) and out-of-distribution (OOD) data, using only available ID samples. Recently, large gains were made on this task for the domain of natural images using self-supervised contrastive feature learning as a first step followed by kNN or traditional one-class classifiers for feature scoring. Learned representations that are non-uniformly distributed on the unit hypersphere have been shown to be beneficial for this task. We go a step further and investigate how the \emph {geometrical compactness} of the ID feature distribution makes isolating and detecting outliers easier, especially in the realistic situation when ID training data is polluted (i.e. ID data contains some OOD data that is used for learning the feature extractor parameters). We propose novel architectural modifications to the self-supervised feature learning step, that enable such compact distributions for ID data to be learned. We show that the proposed modifications can be effectively applied to most existing self-supervised objectives, with large gains in performance. Furthermore, this improved OOD performance is obtained without resorting to tricks such as using strongly augmented ID images (e.g. by 90 degree rotations) as proxies for the unseen OOD data, as these impose overly prescriptive assumptions about ID data and its invariances. We perform extensive studies on benchmark datasets for one-class OOD detection and show state-of-the-art performance in the presence of pollution in the ID data, and comparable performance otherwise. We also propose and extensively evaluate a novel feature scoring technique based on the angular Mahalanobis distance, and propose a simple and novel technique for feature ensembling during evaluation that enables a big boost in performance at nearly zero run-time cost.

Via

Access Paper or Ask Questions

Explanations can be manipulated and geometry is to blame

Jun 19, 2019

Ann-Kathrin Dombrowski, Maximilian Alber, Christopher J. Anders, Marcel Ackermann, Klaus-Robert Müller, Pan Kessel

Figure 1 for Explanations can be manipulated and geometry is to blame

Figure 2 for Explanations can be manipulated and geometry is to blame

Figure 3 for Explanations can be manipulated and geometry is to blame

Figure 4 for Explanations can be manipulated and geometry is to blame

Abstract:Explanation methods aim to make neural networks more trustworthy and interpretable. In this paper, we demonstrate a property of explanation methods which is disconcerting for both of these purposes. Namely, we show that explanations can be manipulated arbitrarily by applying visually hardly perceptible perturbations to the input that keep the network's output approximately constant. We establish theoretically that this phenomenon can be related to certain geometrical properties of neural networks. This allows us to derive an upper bound on the susceptibility of explanations to manipulations. Based on this result, we propose effective mechanisms to enhance the robustness of explanations.

Via

Access Paper or Ask Questions

Interpreting and Explaining Deep Neural Networks for Classification of Audio Signals

Jul 09, 2018

Sören Becker, Marcel Ackermann, Sebastian Lapuschkin, Klaus-Robert Müller, Wojciech Samek

Figure 1 for Interpreting and Explaining Deep Neural Networks for Classification of Audio Signals

Figure 2 for Interpreting and Explaining Deep Neural Networks for Classification of Audio Signals

Figure 3 for Interpreting and Explaining Deep Neural Networks for Classification of Audio Signals

Figure 4 for Interpreting and Explaining Deep Neural Networks for Classification of Audio Signals

Abstract:Interpretability of deep neural networks is a recently emerging area of machine learning research targeting a better understanding of how models perform feature selection and derive their classification decisions. In this paper, two neural network architectures are trained on spectrogram and raw waveform data for audio classification tasks on a newly created audio dataset and layer-wise relevance propagation (LRP), a previously proposed interpretability method, is applied to investigate the models' feature selection and decision making. It is demonstrated that the networks are highly reliant on feature marked as relevant by LRP through systematic manipulation of the input data. Our results show that by making deep audio classifiers interpretable, one can analyze and compare the properties and strategies of different models beyond classification accuracy, which potentially opens up new ways for model improvements.

* 6 pages, 4 figures

Via

Access Paper or Ask Questions