Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Anders Eklund

Brain tumor segmentation using synthetic MR images -- A comparison of GANs and diffusion models

Jun 05, 2023

Muhammad Usman Akbar, Måns Larsson, Anders Eklund

Abstract:Large annotated datasets are required for training deep learning models, but in medical imaging data sharing is often complicated due to ethics, anonymization and data protection legislation (e.g. the general data protection regulation (GDPR)). Generative AI models, such as generative adversarial networks (GANs) and diffusion models, can today produce very realistic synthetic images, and can potentially facilitate data sharing as GDPR should not apply for medical images which do not belong to a specific person. However, in order to share synthetic images it must first be demonstrated that they can be used for training different networks with acceptable performance. Here, we therefore comprehensively evaluate four GANs (progressive GAN, StyleGAN 1-3) and a diffusion model for the task of brain tumor segmentation. Our results show that segmentation networks trained on synthetic images reach Dice scores that are 80\% - 90\% of Dice scores when training with real images, but that memorization of the training images can be a problem for diffusion models if the original dataset is too small. Furthermore, we demonstrate that common metrics for evaluating synthetic images, Fr\'echet inception distance (FID) and inception score (IS), do not correlate well with the obtained performance when using the synthetic images for training segmentation networks.

* 20 Pages. arXiv admin note: text overlap with arXiv:2211.04086

Via

Access Paper or Ask Questions

Beware of diffusion models for synthesizing medical images -- A comparison with GANs in terms of memorizing brain tumor images

May 12, 2023

Muhammad Usman Akbar, Wuhao Wang, Anders Eklund

Abstract:Diffusion models were initially developed for text-to-image generation and are now being utilized to generate high quality synthetic images. Preceded by GANs, diffusion models have shown impressive results using various evaluation metrics. However, commonly used metrics such as FID and IS are not suitable for determining whether diffusion models are simply reproducing the training images. Here we train StyleGAN and diffusion models, using BRATS20 and BRATS21 datasets, to synthesize brain tumor images, and measure the correlation between the synthetic images and all training images. Our results show that diffusion models are much more likely to memorize the training images, especially for small datasets. Researchers should be careful when using diffusion models for medical imaging, if the final goal is to share the synthetic images.

* 9 Pages, 3 Figures

Via

Access Paper or Ask Questions

Efficient brain age prediction from 3D MRI volumes using 2D projections

Nov 10, 2022

Johan Jönemo, Muhammad Usman Akbar, Robin Kämpe, J Paul Hamilton, Anders Eklund

Abstract:Using 3D CNNs on high resolution medical volumes is very computationally demanding, especially for large datasets like the UK Biobank which aims to scan 100,000 subjects. Here we demonstrate that using 2D CNNs on a few 2D projections (representing mean and standard deviation across axial, sagittal and coronal slices) of the 3D volumes leads to reasonable test accuracy when predicting the age from brain volumes. Using our approach, one training epoch with 20,324 subjects takes 40 - 70 seconds using a single GPU, which is almost 100 times faster compared to a small 3D CNN. These results are important for researchers who do not have access to expensive GPU hardware for 3D CNNs.

Via

Access Paper or Ask Questions

Does an ensemble of GANs lead to better performance when training segmentation networks with synthetic images?

Nov 08, 2022

Måns Larsson, Muhammad Usman Akbar, Anders Eklund

Abstract:Large annotated datasets are required to train segmentation networks. In medical imaging, it is often difficult, time consuming and expensive to create such datasets, and it may also be difficult to share these datasets with other researchers. Different AI models can today generate very realistic synthetic images, which can potentially be openly shared as they do not belong to specific persons. However, recent work has shown that using synthetic images for training deep networks often leads to worse performance compared to using real images. Here we demonstrate that using synthetic images and annotations from an ensemble of 10 GANs, instead of from a single GAN, increases the Dice score on real test images with 4.7 % to 14.0 % on specific classes.

* 5 pages, submitted to ISBI 2023

Via

Access Paper or Ask Questions

NUQ: A Noise Metric for Diffusion MRI via Uncertainty Discrepancy Quantification

Mar 04, 2022

Shreyas Fadnavis, Jens Sjölund, Anders Eklund, Eleftherios Garyfallidis

Figure 1 for NUQ: A Noise Metric for Diffusion MRI via Uncertainty Discrepancy Quantification

Figure 2 for NUQ: A Noise Metric for Diffusion MRI via Uncertainty Discrepancy Quantification

Figure 3 for NUQ: A Noise Metric for Diffusion MRI via Uncertainty Discrepancy Quantification

Figure 4 for NUQ: A Noise Metric for Diffusion MRI via Uncertainty Discrepancy Quantification

Abstract:Diffusion MRI (dMRI) is the only non-invasive technique sensitive to tissue micro-architecture, which can, in turn, be used to reconstruct tissue microstructure and white matter pathways. The accuracy of such tasks is hampered by the low signal-to-noise ratio in dMRI. Today, the noise is characterized mainly by visual inspection of residual maps and estimated standard deviation. However, it is hard to estimate the impact of noise on downstream tasks based only on such qualitative assessments. To address this issue, we introduce a novel metric, Noise Uncertainty Quantification (NUQ), for quantitative image quality analysis in the absence of a ground truth reference image. NUQ uses a recent Bayesian formulation of dMRI models to estimate the uncertainty of microstructural measures. Specifically, NUQ uses the maximum mean discrepancy metric to compute a pooled quality score by comparing samples drawn from the posterior distribution of the microstructure measures. We show that NUQ allows a fine-grained analysis of noise, capturing details that are visually imperceptible. We perform qualitative and quantitative comparisons on real datasets, showing that NUQ generates consistent scores across different denoisers and acquisitions. Lastly, by using NUQ on a cohort of schizophrenics and controls, we quantify the substantial impact of denoising on group differences.

Via

Access Paper or Ask Questions

Inflation of test accuracy due to data leakage in deep learning-based classification of OCT images

Feb 21, 2022

Iulian Emil Tampu, Anders Eklund, Neda Haj-Hosseini

Figure 1 for Inflation of test accuracy due to data leakage in deep learning-based classification of OCT images

Figure 2 for Inflation of test accuracy due to data leakage in deep learning-based classification of OCT images

Figure 3 for Inflation of test accuracy due to data leakage in deep learning-based classification of OCT images

Figure 4 for Inflation of test accuracy due to data leakage in deep learning-based classification of OCT images

Abstract:In the application of deep learning on optical coherence tomography (OCT) data, it is common to train classification networks using 2D images originating from volumetric data. Given the micrometer resolution of OCT systems, consecutive images are often very similar in both visible structures and noise. Thus, an inappropriate data split can result in overlap between the training and testing sets, with a large portion of the literature overlooking this aspect. In this study, the effect of improper dataset splitting on model evaluation is demonstrated for two classification tasks using two OCT open-access datasets extensively used in the literature, Kermany's ophthalmology dataset and AIIMS breast tissue dataset. Our results show that the classification accuracy is inflated by 3.9 to 26 percentage units for models tested on a dataset with improper splitting, highlighting the considerable effect of dataset handling on model evaluation. This study intends to raise awareness on the importance of dataset splitting for research on deep learning using OCT data and volumetric data in general.

* 8 pages, 2 figures

Via

Access Paper or Ask Questions

Evaluation of augmentation methods in classifying autism spectrum disorders from fMRI data with 3D convolutional neural networks

Oct 20, 2021

Johan Jönemo, David Abramian, Anders Eklund

Figure 1 for Evaluation of augmentation methods in classifying autism spectrum disorders from fMRI data with 3D convolutional neural networks

Figure 2 for Evaluation of augmentation methods in classifying autism spectrum disorders from fMRI data with 3D convolutional neural networks

Abstract:Classifying subjects as healthy or diseased using neuroimaging data has gained a lot of attention during the last 10 years. Here we apply deep learning to derivatives from resting state fMRI data, and investigate how different 3D augmentation techniques affect the test accuracy. Specifically, we use resting state derivatives from 1,112 subjects in ABIDE preprocessed to train a 3D convolutional neural network (CNN) to perform the classification. Our results show that augmentation only provide minor improvements to the test accuracy.

Via

Access Paper or Ask Questions

Does contextual information improve 3D U-Net based brain tumor segmentation?

Oct 26, 2020

Iulian Emil Tampu, Neda Haj-Hosseini, Anders Eklund

Figure 1 for Does contextual information improve 3D U-Net based brain tumor segmentation?

Figure 2 for Does contextual information improve 3D U-Net based brain tumor segmentation?

Abstract:Effective, robust and automatic tools for brain tumor segmentation are needed for extraction of information useful in treatment planning. In recent years, convolutional neural networks have shown state-of-the-art performance in the identification of tumor regions in magnetic resonance (MR) images. A large portion of the current research is devoted to the development of new network architectures to improve segmentation accuracy. In this work it is instead investigated if the addition of contextual information in the form of white matter (WM), gray matter (GM) and cerebrospinal fluid (CSF) masks improves U-Net based brain tumor segmentation. The BraTS 2020 dataset was used to train and test a standard 3D U-Net model that, in addition to the conventional MR image modalities, used the contextual information as extra channels. For comparison, a baseline model that only used the conventional MR image modalities was also trained. Dice scores of 80.76 and 79.58 were obtained for the baseline and the contextual information models, respectively. Results show that there is no statistically significant difference when comparing Dice scores of the two models on the test dataset p > 0.5. In conclusion, there is no improvement in segmentation performance when using contextual information as extra channels.

Via

Access Paper or Ask Questions

What is the best data augmentation approach for brain tumor segmentation using 3D U-Net?

Oct 26, 2020

Marco Domenico Cirillo, David Abramian, Anders Eklund

Figure 1 for What is the best data augmentation approach for brain tumor segmentation using 3D U-Net?

Figure 2 for What is the best data augmentation approach for brain tumor segmentation using 3D U-Net?

Abstract:Training segmentation networks requires large annotated datasets, which in medical imaging can be hard to obtain. Despite this fact, data augmentation has in our opinion not been fully explored for brain tumor segmentation (a possible explanation is that the number of training subjects (369) is rather large in the BraTS 2020 dataset). Here we apply different types of data augmentation (flipping, rotation, scaling, brightness adjustment, elastic deformation) when training a standard 3D U-Net, and demonstrate that augmentation significantly improves performance on the validation set (125 subjects) in many cases. Our conclusion is that brightness augmentation and elastic deformation works best, and that combinations of different augmentation techniques do not provide further improvement compared to only using one augmentation technique.

Via

Access Paper or Ask Questions

Synthesizing brain tumor images and annotations by combining progressive growing GAN and SPADE

Sep 13, 2020

Mehdi Foroozandeh, Anders Eklund

Figure 1 for Synthesizing brain tumor images and annotations by combining progressive growing GAN and SPADE

Figure 2 for Synthesizing brain tumor images and annotations by combining progressive growing GAN and SPADE

Figure 3 for Synthesizing brain tumor images and annotations by combining progressive growing GAN and SPADE

Figure 4 for Synthesizing brain tumor images and annotations by combining progressive growing GAN and SPADE

Abstract:Training segmentation networks requires large annotated datasets, but manual annotation is time consuming and costly. We here investigate if the combination of a noise-to-image GAN and an image-to-image GAN can be used to synthesize realistic brain tumor images as well as the corresponding tumor annotations (labels), to substantially increase the number of training images. The noise-to-image GAN is used to synthesize new label images, while the image-to-image GAN generates the corresponding MR image from the label image. Our results indicate that the two GANs can synthesize label images and MR images that look realistic, and that adding synthetic images improves the segmentation performance, although the effect is small.

Via

Access Paper or Ask Questions