Abstract:The recognition performance of biometric systems strongly depends on the quality of the compared biometric samples. Motivated by the goal of establishing a common understanding of face image quality and enabling system interoperability, the committee draft of ISO/IEC 29794-5 introduces expression neutrality as one of many component quality elements affecting recognition performance. In this study, we train classifiers to assess facial expression neutrality using seven datasets. We conduct extensive performance benchmarking to evaluate their classification and face recognition utility prediction abilities. Our experiments reveal significant differences in how each classifier distinguishes "neutral" from "non-neutral" expressions. While Random Forests and AdaBoost classifiers are most suitable for distinguishing neutral from non-neutral facial expressions with high accuracy, they underperform compared to Support Vector Machines in predicting face recognition utility.
Abstract:A morph is a combination of two separate facial images and contains identity information of two different people. When used in an identity document, both people can be authenticated by a biometric Face Recognition (FR) system. Morphs can be generated using either a landmark-based approach or approaches based on deep learning such as Generative Adversarial Networks (GAN). In a recent paper, we introduced a \emph{worst-case} upper bound on how challenging morphing attacks can be for an FR system. The closer morphs are to this upper bound, the bigger the challenge they pose to FR. We introduced an approach with which it was possible to generate morphs that approximate this upper bound for a known FR system (white box), but not for unknown (black box) FR systems. In this paper, we introduce a morph generation method that can approximate worst-case morphs even when the FR system is not known. A key contribution is that we include the goal of generating difficult morphs \emph{during} training. Our method is based on Adversarially Learned Inference (ALI) and uses concepts from Wasserstein GANs trained with Gradient Penalty, which were introduced to stabilise the training of GANs. We include these concepts to achieve similar improvement in training stability and call the resulting method Wasserstein ALI (WALI). We finetune WALI using loss functions designed specifically to improve the ability to manipulate identity information in facial images and show how it can generate morphs that are more challenging for FR systems than landmark- or GAN-based morphs. We also show how our findings can be used to improve MIPGAN, an existing StyleGAN-based morph generator.
Abstract:We propose a multi-planar pulmonary nodule detection system using convolutional neural networks. The 2-D convolutional neural network model, U-net++, was trained by axial, coronal, and sagittal slices for the candidate detection task. All possible nodule candidates from the three different planes are combined. For false positive reduction, we apply 3-D multi-scale dense convolutional neural networks to efficiently remove false positive candidates. We use the public LIDC-IDRI dataset which includes 888 CT scans with 1186 nodules annotated by four radiologists. After ten-fold cross-validation, our proposed system achieves a sensitivity of 95.3% with 0.5 false positive/scan and a sensitivity of 96.2% with 1.0 false positive/scan. Although it is difficult to detect small nodules (i.e. nodules with a diameter < 6 mm), our designed CAD system reaches a sensitivity of 93.8% (94.6%) of these small nodules at an overall false positive rate of 0.5 (1.0) false positives/scan. At the nodule candidate detection stage, the proposed system detected 98.1% of nodules after merging the predictions from all three planes. Using only the 1 mm axial slices resulted in the detection of 91.1% of nodules, which is better than that of utilizing solely the coronal or sagittal slices. The results show that a multi-planar method is capable to detect more nodules compared to using a single plane. Our approach achieves state-of-the-art performance on this dataset, which demonstrates the effectiveness and efficiency of our developed CAD system for lung nodule detection.
Abstract:Accurate pulmonary nodule detection in computed tomography scans is a crucial step in lung cancer screening. Computer-aided detection (CAD) systems are not routinely used by radiologists for pulmonary nodules detection in clinical practice despite their potential benefits. Maximum intensity projection (MIP) images improve the detection of pulmonary nodules in radiological evaluation with computed tomography (CT) scans. In this work, we aim to explore the feasibility of utilizing MIP images to improve the effectiveness of automatic detection of lung nodules by convolutional neural networks (CNNs). We propose a CNN based approach that takes MIP images of different slab thicknesses (5 mm, 10 mm, 15 mm) and 1mm plain multiplanar reconstruction (MPR) images as input. Such an approach augments the 2-D CT slice images with more representative spatial information that helps in the discriminating nodules from vessels through their morphologies. We use the public available LUNA16 set collected from seven academic centers to train and test our approach. Our proposed method achieves a sensitivity of 91.13% with 1 false positive per scan and a sensitivity of 94.13% with 4 false positives per scan for lung nodule detection in this dataset. Using the thick MIP images helps the detection of small pulmonary nodules (3mm-10mm) and acquires fewer false positives. Experimental results show that applying MIP images can increase the sensitivity and lower the number of false positive, which demonstrates the effectiveness and significance of the proposed maximum intensity projection based CNN framework for automatic pulmonary nodule detection in CT scans. Index Terms: Computer-aided detection (CAD), convolutional neural networks (CNNs), computed tomography scans, maximum intensity projection (MIP), pulmonary nodule detection