Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Etienne Decencière

CMM

Euclidean Distance to Convex Polyhedra and Application to Class Representation in Spectral Images

Mar 26, 2025

Antoine Bottenmuller, Florent Magaud, Arnaud Demortière, Etienne Decencière, Petr Dokladal

Abstract:With the aim of estimating the abundance map from observations only, linear unmixing approaches are not always suitable to spectral images, especially when the number of bands is too small or when the spectra of the observed data are too correlated. To address this issue in the general case, we present a novel approach which provides an adapted spatial density function based on any arbitrary linear classifier. A robust mathematical formulation for computing the Euclidean distance to polyhedral sets is presented, along with an efficient algorithm that provides the exact minimum-norm point in a polyhedron. An empirical evaluation on the widely-used Samson hyperspectral dataset demonstrates that the proposed method surpasses state-of-the-art approaches in reconstructing abundance maps. Furthermore, its application to spectral images of a Lithium-ion battery, incompatible with linear unmixing models, validates the method's generality and effectiveness.

* 14th International Conference on Pattern Recognition Applications and Methods, Feb 2025, Porto, France. pp.192-203

Via

Access Paper or Ask Questions

Coupled Laplacian Eigenmaps for Locally-Aware 3D Rigid Point Cloud Matching

Feb 27, 2024

Matteo Bastico, Etienne Decencière, Laurent Corté, Yannick Tillier, David Ryckelynck

Abstract:Point cloud matching, a crucial technique in computer vision, medical and robotics fields, is primarily concerned with finding correspondences between pairs of point clouds or voxels. In some practical scenarios, emphasizing local differences is crucial for accurately identifying a correct match, thereby enhancing the overall robustness and reliability of the matching process. Commonly used shape descriptors have several limitations and often fail to provide meaningful local insights on the paired geometries. In this work, we propose a new technique, based on graph Laplacian eigenmaps, to match point clouds by taking into account fine local structures. To deal with the order and sign ambiguity of Laplacian eigenmaps, we introduce a new operator, called Coupled Laplacian, that allows to easily generate aligned eigenspaces for multiple rigidly-registered geometries. We show that the similarity between those aligned high-dimensional spaces provides a locally meaningful score to match shapes. We initially evaluate the performance of the proposed technique in a point-wise manner, specifically focusing on the task of object anomaly localization using the MVTec 3D-AD dataset. Additionally, we define a new medical task, called automatic Bone Side Estimation (BSE), which we address through a global similarity score derived from coupled eigenspaces. In order to test it, we propose a benchmark collecting bone surface structures from various public datasets. Our matching technique, based on Coupled Laplacian, outperforms other methods by reaching an impressive accuracy on both tasks. The code to reproduce our experiments is publicly available at https://github.com/matteo-bastico/CoupledLaplacian and in the Supplementary Code.

* This paper has been accepted at Computer Vision and Patter Recognition (CVPR) 2024

Via

Access Paper or Ask Questions

A Simple and Robust Framework for Cross-Modality Medical Image Segmentation applied to Vision Transformers

Oct 09, 2023

Matteo Bastico, David Ryckelynck, Laurent Corté, Yannick Tillier, Etienne Decencière

Abstract:When it comes to clinical images, automatic segmentation has a wide variety of applications and a considerable diversity of input domains, such as different types of Magnetic Resonance Images (MRIs) and Computerized Tomography (CT) scans. This heterogeneity is a challenge for cross-modality algorithms that should equally perform independently of the input image type fed to them. Often, segmentation models are trained using a single modality, preventing generalization to other types of input data without resorting to transfer learning techniques. Furthermore, the multi-modal or cross-modality architectures proposed in the literature frequently require registered images, which are not easy to collect in clinical environments, or need additional processing steps, such as synthetic image generation. In this work, we propose a simple framework to achieve fair image segmentation of multiple modalities using a single conditional model that adapts its normalization layers based on the input type, trained with non-registered interleaved mixed data. We show that our framework outperforms other cross-modality segmentation methods, when applied to the same 3D UNet baseline model, on the Multi-Modality Whole Heart Segmentation Challenge. Furthermore, we define the Conditional Vision Transformer (C-ViT) encoder, based on the proposed cross-modality framework, and we show that it brings significant improvements to the resulting segmentation, up to 6.87\% of Dice accuracy, with respect to its baseline reference. The code to reproduce our experiments and the trained model weights are available at https://github.com/matteo-bastico/MI-Seg.

* This paper has been accepted in International Conference on Computer Vision Workshops (ICCVW) 2023

Via

Access Paper or Ask Questions

Heuristic Hyperparameter Choice for Image Anomaly Detection

Jul 20, 2023

Zeyu Jiang, João P. C. Bertoldo, Etienne Decencière

Abstract:Anomaly detection (AD) in images is a fundamental computer vision problem by deep learning neural network to identify images deviating significantly from normality. The deep features extracted from pretrained models have been proved to be essential for AD based on multivariate Gaussian distribution analysis. However, since models are usually pretrained on a large dataset for classification tasks such as ImageNet, they might produce lots of redundant features for AD, which increases computational cost and degrades the performance. We aim to do the dimension reduction of Negated Principal Component Analysis (NPCA) for these features. So we proposed some heuristic to choose hyperparameter of NPCA algorithm for getting as fewer components of features as possible while ensuring a good performance.

Via

Access Paper or Ask Questions

Adapting the Hypersphere Loss Function from Anomaly Detection to Anomaly Segmentation

Jan 23, 2023

Joao P. C. Bertoldo, Santiago Velasco-Forero, Jesus Angulo, Etienne Decencière

Figure 1 for Adapting the Hypersphere Loss Function from Anomaly Detection to Anomaly Segmentation

Figure 2 for Adapting the Hypersphere Loss Function from Anomaly Detection to Anomaly Segmentation

Figure 3 for Adapting the Hypersphere Loss Function from Anomaly Detection to Anomaly Segmentation

Abstract:We propose an incremental improvement to Fully Convolutional Data Description (FCDD), an adaptation of the one-class classification approach from anomaly detection to image anomaly segmentation (a.k.a. anomaly localization). We analyze its original loss function and propose a substitute that better resembles its predecessor, the Hypersphere Classifier (HSC). Both are compared on the MVTec Anomaly Detection Dataset (MVTec-AD) -- training images are flawless objects/textures and the goal is to segment unseen defects -- showing that consistent improvement is achieved by better designing the pixel-wise supervision.

* Submitted to the 2023 IEEE International Conference on Image Processing (ICIP 2023)

Via

Access Paper or Ask Questions

Giga-SSL: Self-Supervised Learning for Gigapixel Images

Dec 06, 2022

Tristan Lazard, Marvin Lerousseau, Etienne Decencière, Thomas Walter

Abstract:Whole slide images (WSI) are microscopy images of stained tissue slides routinely prepared for diagnosis and treatment selection in medical practice. WSI are very large (gigapixel size) and complex (made of up to millions of cells). The current state-of-the-art (SoTA) approach to classify WSI subdivides them into tiles, encodes them by pre-trained networks and applies Multiple Instance Learning (MIL) to train for specific downstream tasks. However, annotated datasets are often small, typically a few hundred to a few thousand WSI, which may cause overfitting and underperforming models. Conversely, the number of unannotated WSI is ever increasing, with datasets of tens of thousands (soon to be millions) of images available. While it has been previously proposed to use these unannotated data to identify suitable tile representations by self-supervised learning (SSL), downstream classification tasks still require full supervision because parts of the MIL architecture is not trained during tile level SSL pre-training. Here, we propose a strategy of slide level SSL to leverage the large number of WSI without annotations to infer powerful slide representations. Applying our method to The Cancer-Genome Atlas, one of the most widely used data resources in cancer research (16 TB image data), we are able to downsize the dataset to 23 MB without any loss in predictive power: we show that a linear classifier trained on top of these embeddings maintains or improves previous SoTA performances on various benchmark WSI classification tasks. Finally, we observe that training a classifier on these representations with tiny datasets (e.g. 50 slides) improved performances over SoTA by an average of +6.3 AUC points over all downstream tasks.

Via

Access Paper or Ask Questions

[Reproducibility Report] Explainable Deep One-Class Classification

Jun 06, 2022

Joao P. C. Bertoldo, Etienne Decencière

Figure 1 for [Reproducibility Report] Explainable Deep One-Class Classification

Figure 2 for [Reproducibility Report] Explainable Deep One-Class Classification

Figure 3 for [Reproducibility Report] Explainable Deep One-Class Classification

Figure 4 for [Reproducibility Report] Explainable Deep One-Class Classification

Abstract:Fully Convolutional Data Description (FCDD), an explainable version of the Hypersphere Classifier (HSC), directly addresses image anomaly detection (AD) and pixel-wise AD without any post-hoc explainer methods. The authors claim that FCDD achieves results comparable with the state-of-the-art in sample-wise AD on Fashion-MNIST and CIFAR-10 and exceeds the state-of-the-art on the pixel-wise task on MVTec-AD. We reproduced the main results of the paper using the author's code with minor changes and provide runtime requirements to achieve if (CPU memory, GPU memory, and training time). We propose another analysis methodology using a critical difference diagram, and further investigate the test performance of the model during the training phase.

* Submitted to the ML Reproducibility Challenge 2021 Fall

Via

Access Paper or Ask Questions

A modular U-Net for automated segmentation of X-ray tomography images in composite materials

Jul 15, 2021

João P C Bertoldo, Etienne Decencière, David Ryckelynck, Henry Proudhon

Figure 1 for A modular U-Net for automated segmentation of X-ray tomography images in composite materials

Figure 2 for A modular U-Net for automated segmentation of X-ray tomography images in composite materials

Figure 3 for A modular U-Net for automated segmentation of X-ray tomography images in composite materials

Figure 4 for A modular U-Net for automated segmentation of X-ray tomography images in composite materials

Abstract:X-ray Computed Tomography (XCT) techniques have evolved to a point that high-resolution data can be acquired so fast that classic segmentation methods are prohibitively cumbersome, demanding automated data pipelines capable of dealing with non-trivial 3D images. Deep learning has demonstrated success in many image processing tasks, including material science applications, showing a promising alternative for a humanfree segmentation pipeline. In this paper a modular interpretation of UNet (Modular U-Net) is proposed and trained to segment 3D tomography images of a three-phased glass fiber-reinforced Polyamide 66. We compare 2D and 3D versions of our model, finding that the former is slightly better than the latter. We observe that human-comparable results can be achievied even with only 10 annotated layers and using a shallow U-Net yields better results than a deeper one. As a consequence, Neural Network (NN) show indeed a promising venue to automate XCT data processing pipelines needing no human, adhoc intervention.

* Submitted to Nature Machine Intelligence

Via

Access Paper or Ask Questions

Dealing with Topological Information within a Fully Convolutional Neural Network

Jun 27, 2019

Etienne Decencière, Santiago Velasco-Forero, Fu Min, Juanjuan Chen, Hélène Burdin, Gervais Gauthier, Bruno Laÿ, Thomas Bornschloegl, Thérèse Baldeweck

Figure 1 for Dealing with Topological Information within a Fully Convolutional Neural Network

Figure 2 for Dealing with Topological Information within a Fully Convolutional Neural Network

Figure 3 for Dealing with Topological Information within a Fully Convolutional Neural Network

Figure 4 for Dealing with Topological Information within a Fully Convolutional Neural Network

Abstract:A fully convolutional neural network has a receptive field of limited size and therefore cannot exploit global information, such as topological information. A solution is proposed in this paper to solve this problem, based on pre-processing with a geodesic operator. It is applied to the segmentation of histological images of pigmented reconstructed epidermis acquired via Whole Slide Imaging.

* Advanced Concepts for Intelligent Vision Systems. ACIVS 2018. Lecture Notes in Computer Science, vol 11182. Springer, Cham
* International Conference on Advanced Concepts for Intelligent Vision Systems (ACIVS 2018)

Via

Access Paper or Ask Questions