Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Joao P. C. Bertoldo

AUPIMO: Redefining Visual Anomaly Detection Benchmarks with High Speed and Low Tolerance

Jan 03, 2024

Joao P. C. Bertoldo, Dick Ameln, Ashwin Vaidya, Samet Akçay

Abstract:Recent advances in visual anomaly detection research have seen AUROC and AUPRO scores on public benchmark datasets such as MVTec and VisA converge towards perfect recall, giving the impression that these benchmarks are near-solved. However, high AUROC and AUPRO scores do not always reflect qualitative performance, which limits the validity of these metrics in real-world applications. We argue that the artificial ceiling imposed by the lack of an adequate evaluation metric restrains progression of the field, and it is crucial that we revisit the evaluation metrics used to rate our algorithms. In response, we introduce Per-IMage Overlap (PIMO), a novel metric that addresses the shortcomings of AUROC and AUPRO. PIMO retains the recall-based nature of the existing metrics but introduces two distinctions: the assignment of curves (and respective area under the curve) is per-image, and its X-axis relies solely on normal images. Measuring recall per image simplifies instance score indexing and is more robust to noisy annotations. As we show, it also accelerates computation and enables the usage of statistical tests to compare models. By imposing low tolerance for false positives on normal images, PIMO provides an enhanced model validation procedure and highlights performance variations across datasets. Our experiments demonstrate that PIMO offers practical advantages and nuanced performance insights that redefine anomaly detection benchmarks -- notably challenging the perception that MVTec AD and VisA datasets have been solved by contemporary models. Available on GitHub: https://github.com/jpcbertoldo/aupimo.

Via

Access Paper or Ask Questions

Adapting the Hypersphere Loss Function from Anomaly Detection to Anomaly Segmentation

Jan 23, 2023

Joao P. C. Bertoldo, Santiago Velasco-Forero, Jesus Angulo, Etienne Decencière

Figure 1 for Adapting the Hypersphere Loss Function from Anomaly Detection to Anomaly Segmentation

Figure 2 for Adapting the Hypersphere Loss Function from Anomaly Detection to Anomaly Segmentation

Figure 3 for Adapting the Hypersphere Loss Function from Anomaly Detection to Anomaly Segmentation

Abstract:We propose an incremental improvement to Fully Convolutional Data Description (FCDD), an adaptation of the one-class classification approach from anomaly detection to image anomaly segmentation (a.k.a. anomaly localization). We analyze its original loss function and propose a substitute that better resembles its predecessor, the Hypersphere Classifier (HSC). Both are compared on the MVTec Anomaly Detection Dataset (MVTec-AD) -- training images are flawless objects/textures and the goal is to segment unseen defects -- showing that consistent improvement is achieved by better designing the pixel-wise supervision.

* Submitted to the 2023 IEEE International Conference on Image Processing (ICIP 2023)

Via

Access Paper or Ask Questions

[Reproducibility Report] Explainable Deep One-Class Classification

Jun 06, 2022

Joao P. C. Bertoldo, Etienne Decencière

Figure 1 for [Reproducibility Report] Explainable Deep One-Class Classification

Figure 2 for [Reproducibility Report] Explainable Deep One-Class Classification

Figure 3 for [Reproducibility Report] Explainable Deep One-Class Classification

Figure 4 for [Reproducibility Report] Explainable Deep One-Class Classification

Abstract:Fully Convolutional Data Description (FCDD), an explainable version of the Hypersphere Classifier (HSC), directly addresses image anomaly detection (AD) and pixel-wise AD without any post-hoc explainer methods. The authors claim that FCDD achieves results comparable with the state-of-the-art in sample-wise AD on Fashion-MNIST and CIFAR-10 and exceeds the state-of-the-art on the pixel-wise task on MVTec-AD. We reproduced the main results of the paper using the author's code with minor changes and provide runtime requirements to achieve if (CPU memory, GPU memory, and training time). We propose another analysis methodology using a critical difference diagram, and further investigate the test performance of the model during the training phase.

* Submitted to the ML Reproducibility Challenge 2021 Fall

Via

Access Paper or Ask Questions