Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gabriele Valvano

Controllable Image Synthesis of Industrial Data Using Stable Diffusion

Jan 06, 2024

Gabriele Valvano, Antonino Agostino, Giovanni De Magistris, Antonino Graziano, Giacomo Veneri

Figure 1 for Controllable Image Synthesis of Industrial Data Using Stable Diffusion

Figure 2 for Controllable Image Synthesis of Industrial Data Using Stable Diffusion

Figure 3 for Controllable Image Synthesis of Industrial Data Using Stable Diffusion

Figure 4 for Controllable Image Synthesis of Industrial Data Using Stable Diffusion

Abstract:Training supervised deep neural networks that perform defect detection and segmentation requires large-scale fully-annotated datasets, which can be hard or even impossible to obtain in industrial environments. Generative AI offers opportunities to enlarge small industrial datasets artificially, thus enabling the usage of state-of-the-art supervised approaches in the industry. Unfortunately, also good generative models need a lot of data to train, while industrial datasets are often tiny. Here, we propose a new approach for reusing general-purpose pre-trained generative models on industrial data, ultimately allowing the generation of self-labelled defective images. First, we let the model learn the new concept, entailing the novel data distribution. Then, we force it to learn to condition the generative process, producing industrial images that satisfy well-defined topological characteristics and show defects with a given geometry and location. To highlight the advantage of our approach, we use the synthetic dataset to optimise a crack segmentor for a real industrial use case. When the available data is small, we observe considerable performance increase under several metrics, showing the method's potential in production environments.

* Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024, pp. 5354-5363

Via

Access Paper or Ask Questions

Re-using Adversarial Mask Discriminators for Test-time Training under Distribution Shifts

Aug 26, 2021

Gabriele Valvano, Andrea Leo, Sotirios A. Tsaftaris

Figure 1 for Re-using Adversarial Mask Discriminators for Test-time Training under Distribution Shifts

Figure 2 for Re-using Adversarial Mask Discriminators for Test-time Training under Distribution Shifts

Figure 3 for Re-using Adversarial Mask Discriminators for Test-time Training under Distribution Shifts

Figure 4 for Re-using Adversarial Mask Discriminators for Test-time Training under Distribution Shifts

Abstract:Thanks to their ability to learn flexible data-driven losses, Generative Adversarial Networks (GANs) are an integral part of many semi- and weakly-supervised methods for medical image segmentation. GANs jointly optimise a generator and an adversarial discriminator on a set of training data. After training has completed, the discriminator is usually discarded and only the generator is used for inference. But should we discard discriminators? In this work, we argue that training stable discriminators produces expressive loss functions that we can re-use at inference to detect and correct segmentation mistakes. First, we identify key challenges and suggest possible solutions to make discriminators re-usable at inference. Then, we show that we can combine discriminators with image reconstruction costs (via decoders) to further improve the model. Our method is simple and improves the test-time performance of pre-trained GANs. Moreover, we show that it is compatible with standard post-processing techniques and it has potentials to be used for Online Continual Learning. With our work, we open new research avenues for re-using adversarial discriminators at inference.

* Code for reproducing the experiments will be released

Via

Access Paper or Ask Questions

Stop Throwing Away Discriminators! Re-using Adversaries for Test-Time Training

Aug 26, 2021

Gabriele Valvano, Andrea Leo, Sotirios A. Tsaftaris

Figure 1 for Stop Throwing Away Discriminators! Re-using Adversaries for Test-Time Training

Figure 2 for Stop Throwing Away Discriminators! Re-using Adversaries for Test-Time Training

Figure 3 for Stop Throwing Away Discriminators! Re-using Adversaries for Test-Time Training

Figure 4 for Stop Throwing Away Discriminators! Re-using Adversaries for Test-Time Training

Abstract:Thanks to their ability to learn data distributions without requiring paired data, Generative Adversarial Networks (GANs) have become an integral part of many computer vision methods, including those developed for medical image segmentation. These methods jointly train a segmentor and an adversarial mask discriminator, which provides a data-driven shape prior. At inference, the discriminator is discarded, and only the segmentor is used to predict label maps on test images. But should we discard the discriminator? Here, we argue that the life cycle of adversarial discriminators should not end after training. On the contrary, training stable GANs produces powerful shape priors that we can use to correct segmentor mistakes at inference. To achieve this, we develop stable mask discriminators that do not overfit or catastrophically forget. At test time, we fine-tune the segmentor on each individual test instance until it satisfies the learned shape prior. Our method is simple to implement and increases model performance. Moreover, it opens new directions for re-using mask discriminators at inference. We release the code used for the experiments at https://vios-s.github.io/adversarial-test-time-training.

* Accepted at: Domain Adaptation and Representation Transfer (DART) 2021

Via

Access Paper or Ask Questions

Self-supervised Multi-scale Consistency for Weakly Supervised Segmentation Learning

Aug 26, 2021

Gabriele Valvano, Andrea Leo, Sotirios A. Tsaftaris

Figure 1 for Self-supervised Multi-scale Consistency for Weakly Supervised Segmentation Learning

Figure 2 for Self-supervised Multi-scale Consistency for Weakly Supervised Segmentation Learning

Figure 3 for Self-supervised Multi-scale Consistency for Weakly Supervised Segmentation Learning

Figure 4 for Self-supervised Multi-scale Consistency for Weakly Supervised Segmentation Learning

Abstract:Collecting large-scale medical datasets with fine-grained annotations is time-consuming and requires experts. For this reason, weakly supervised learning aims at optimising machine learning models using weaker forms of annotations, such as scribbles, which are easier and faster to collect. Unfortunately, training with weak labels is challenging and needs regularisation. Herein, we introduce a novel self-supervised multi-scale consistency loss, which, coupled with an attention mechanism, encourages the segmentor to learn multi-scale relationships between objects and improves performance. We show state-of-the-art performance on several medical and non-medical datasets. The code used for the experiments is available at https://vios-s.github.io/multiscale-pyag.

* Accepted at Domain Adaptation and Representation Transfer (DART) 2021

Via

Access Paper or Ask Questions

Metrics for Exposing the Biases of Content-Style Disentanglement

Aug 31, 2020

Xiao Liu, Spyridon Thermos, Gabriele Valvano, Agisilaos Chartsias, Alison O'Neil, Sotirios A. Tsaftaris

Figure 1 for Metrics for Exposing the Biases of Content-Style Disentanglement

Figure 2 for Metrics for Exposing the Biases of Content-Style Disentanglement

Figure 3 for Metrics for Exposing the Biases of Content-Style Disentanglement

Figure 4 for Metrics for Exposing the Biases of Content-Style Disentanglement

Abstract:Recent state-of-the-art semi- and un-supervised solutions for challenging computer vision tasks have used the idea of encoding image content into a spatial tensor and image appearance or "style" into a vector. These decomposed representations take advantage of equivariant properties of network design and improve performance in equivariant tasks, such as image-to-image translation. Most of these methods use the term "disentangled" for their representations and employ model design, learning objectives, and data biases to achieve good model performance. While considerable effort has been made to measure disentanglement in vector representations, currently, metrics that can characterize the degree of disentanglement between content (spatial) and style (vector) representations and the relation to task performance are lacking. In this paper, we propose metrics to measure how (un)correlated, biased, and informative the content and style representations are. In particular, we first identify key design choices and learning constraints on three popular models that employ content-style disentanglement and derive ablated versions. Then, we use our metrics to ascertain the role of each bias. Our experiments reveal a "sweet-spot" between disentanglement, task performance and latent space interpretability. The proposed metrics enable the design of better models and the selection of models that achieve the desired performance and disentanglement. Our metrics library is available at https://github.com/TsaftarisCollaboratory/CSDisentanglement_Metrics_Library.

* 21 pages, 8 figures

Via

Access Paper or Ask Questions

Weakly Supervised Segmentation with Multi-scale Adversarial Attention Gates

Jul 02, 2020

Gabriele Valvano, Andrea Leo, Sotirios A. Tsaftaris

Figure 1 for Weakly Supervised Segmentation with Multi-scale Adversarial Attention Gates

Figure 2 for Weakly Supervised Segmentation with Multi-scale Adversarial Attention Gates

Figure 3 for Weakly Supervised Segmentation with Multi-scale Adversarial Attention Gates

Figure 4 for Weakly Supervised Segmentation with Multi-scale Adversarial Attention Gates

Abstract:Large, fine-grained image segmentation datasets, annotated at pixel-level, are difficult to obtain, particularly in medical imaging, where annotations also require expert knowledge. Weakly-supervised learning can train models by relying on weaker forms of annotation, such as scribbles. Here, we learn to segment using scribble annotations in an adversarial game. With unpaired segmentation masks, we train a multi-scale GAN to generate realistic segmentation masks at multiple resolutions, while we use scribbles to learn the correct position in the image. Central to the model's success is a novel attention gating mechanism, which we condition with adversarial signals to act as a shape prior, resulting in better object localization at multiple scales. We evaluated our model on several medical (ACDC, LVSC, CHAOS) and non-medical (PPSS) datasets, and we report performance levels matching those achieved by models trained with fully annotated segmentation masks. We also demonstrate extensions in a variety of settings: semi-supervised learning; combining multiple scribble sources (a crowdsourcing scenario) and multi-task learning (combining scribble and mask supervision). We will release expert-made scribble annotations for the ACDC dataset, and the code used for the experiments, at https://gvalvano.github.io/wss-multiscale-adversarial-attention-gates.

* Project page: https://gvalvano.github.io/wss-multiscale-adversarial-attention-gates

Via

Access Paper or Ask Questions

Temporal Consistency Objectives Regularize the Learning of Disentangled Representations

Aug 29, 2019

Gabriele Valvano, Agisilaos Chartsias, Andrea Leo, Sotirios A. Tsaftaris

Figure 1 for Temporal Consistency Objectives Regularize the Learning of Disentangled Representations

Figure 2 for Temporal Consistency Objectives Regularize the Learning of Disentangled Representations

Figure 3 for Temporal Consistency Objectives Regularize the Learning of Disentangled Representations

Figure 4 for Temporal Consistency Objectives Regularize the Learning of Disentangled Representations

Abstract:There has been an increasing focus in learning interpretable feature representations, particularly in applications such as medical image analysis that require explainability, whilst relying less on annotated data (since annotations can be tedious and costly). Here we build on recent innovations in style-content representations to learn anatomy, imaging characteristics (appearance) and temporal correlations. By introducing a self-supervised objective of predicting future cardiac phases we improve disentanglement. We propose a temporal transformer architecture that given an image conditioned on phase difference, it predicts a future frame. This forces the anatomical decomposition to be consistent with the temporal cardiac contraction in cine MRI and to have semantic meaning with less need for annotations. We demonstrate that using this regularization, we achieve competitive results and improve semi-supervised segmentation, especially when very few labelled data are available. Specifically, we show Dice increase of up to 19\% and 7\% compared to supervised and semi-supervised approaches respectively on the ACDC dataset. Code is available at: https://github.com/gvalvano/sdtnet .

* 9 pages, 4 figures (1 .gif), 1 table

Via

Access Paper or Ask Questions

Unsupervised Data Selection for Supervised Learning

Oct 29, 2018

Gabriele Valvano, Andrea Leo, Daniele Della Latta, Nicola Martini, Gianmarco Santini, Dante Chiappino, Emiliano Ricciardi

Figure 1 for Unsupervised Data Selection for Supervised Learning

Figure 2 for Unsupervised Data Selection for Supervised Learning

Figure 3 for Unsupervised Data Selection for Supervised Learning

Figure 4 for Unsupervised Data Selection for Supervised Learning

Abstract:Recent research put a big effort in the development of deep learning architectures and optimizers obtaining impressive results in areas ranging from vision to language processing. However little attention has been addressed to the need of a methodological process of data collection. In this work we show that high quality data for supervised learning can be selected in an unsupervised manner and that by doing so one can obtain models capable to generalize better than in the case of random training set construction.

* Technical Report --- 8 pages, 3 figures

Via

Access Paper or Ask Questions

Training of a Skull-Stripping Neural Network with efficient data augmentation

Oct 25, 2018

Gabriele Valvano, Nicola Martini, Andrea Leo, Gianmarco Santini, Daniele Della Latta, Emiliano Ricciardi, Dante Chiappino

Figure 1 for Training of a Skull-Stripping Neural Network with efficient data augmentation

Figure 2 for Training of a Skull-Stripping Neural Network with efficient data augmentation

Figure 3 for Training of a Skull-Stripping Neural Network with efficient data augmentation

Figure 4 for Training of a Skull-Stripping Neural Network with efficient data augmentation

Abstract:Skull-stripping methods aim to remove the non-brain tissue from acquisition of brain scans in magnetic resonance (MR) imaging. Although several methods sharing this common purpose have been presented in literature, they all suffer from the great variability of the MR images. In this work we propose a novel approach based on Convolutional Neural Networks to automatically perform the brain extraction obtaining cutting-edge performance in the NFBS public database. Additionally, we focus on the efficient training of the neural network designing an effective data augmentation pipeline. Obtained results are evaluated through Dice metric, obtaining a value of 96.5%, and processing time, with 4.5s per volume.

* April 2018. Content: 8 pages, 2 figures

Via

Access Paper or Ask Questions

Convolutional Neural Networks for the segmentation of microcalcification in Mammography Imaging

Sep 11, 2018

Gabriele Valvano, Gianmarco Santini, Nicola Martini, Andrea Ripoli, Chiara Iacconi, Dante Chiappino, Daniele Della Latta

Figure 1 for Convolutional Neural Networks for the segmentation of microcalcification in Mammography Imaging

Figure 2 for Convolutional Neural Networks for the segmentation of microcalcification in Mammography Imaging

Figure 3 for Convolutional Neural Networks for the segmentation of microcalcification in Mammography Imaging

Figure 4 for Convolutional Neural Networks for the segmentation of microcalcification in Mammography Imaging

Abstract:Cluster of microcalcifications can be an early sign of breast cancer. In this paper we propose a novel approach based on convolutional neural networks for the detection and segmentation of microcalcification clusters. In this work we used 283 mammograms to train and validate our model, obtaining an accuracy of 98.22% in the detection of preliminary suspect regions and of 97.47% in the segmentation task. Our results show how deep learning could be an effective tool to effectively support radiologists during mammograms examination.

* 13 pages, 7 figures

Via

Access Paper or Ask Questions