Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mohammad Mahdi Dehshibi

On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process

Dec 18, 2024

Gereziher Adhane, Mohammad Mahdi Dehshibi, Dennis Vetter, David Masip, Gemma Roig

Figure 1 for On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process

Figure 2 for On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process

Figure 3 for On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process

Figure 4 for On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process

Abstract:Knowledge distillation (KD) remains challenging due to the opaque nature of the knowledge transfer process from a Teacher to a Student, making it difficult to address certain issues related to KD. To address this, we proposed UniCAM, a novel gradient-based visual explanation method, which effectively interprets the knowledge learned during KD. Our experimental results demonstrate that with the guidance of the Teacher's knowledge, the Student model becomes more efficient, learning more relevant features while discarding those that are not relevant. We refer to the features learned with the Teacher's guidance as distilled features and the features irrelevant to the task and ignored by the Student as residual features. Distilled features focus on key aspects of the input, such as textures and parts of objects. In contrast, residual features demonstrate more diffused attention, often targeting irrelevant areas, including the backgrounds of the target objects. In addition, we proposed two novel metrics: the feature similarity score (FSS) and the relevance score (RS), which quantify the relevance of the distilled knowledge. Experiments on the CIFAR10, ASIRRA, and Plant Disease datasets demonstrate that UniCAM and the two metrics offer valuable insights to explain the KD process.

* Accepted to 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV'25). Includes 5 pages of supplementary material

Via

Access Paper or Ask Questions

STAL: Spike Threshold Adaptive Learning Encoder for Classification of Pain-Related Biosignal Data

Jul 11, 2024

Freek Hens, Mohammad Mahdi Dehshibi, Leila Bagheriye, Mahyar Shahsavari, Ana Tajadura-Jiménez

Figure 1 for STAL: Spike Threshold Adaptive Learning Encoder for Classification of Pain-Related Biosignal Data

Figure 2 for STAL: Spike Threshold Adaptive Learning Encoder for Classification of Pain-Related Biosignal Data

Figure 3 for STAL: Spike Threshold Adaptive Learning Encoder for Classification of Pain-Related Biosignal Data

Figure 4 for STAL: Spike Threshold Adaptive Learning Encoder for Classification of Pain-Related Biosignal Data

Abstract:This paper presents the first application of spiking neural networks (SNNs) for the classification of chronic lower back pain (CLBP) using the EmoPain dataset. Our work has two main contributions. We introduce Spike Threshold Adaptive Learning (STAL), a trainable encoder that effectively converts continuous biosignals into spike trains. Additionally, we propose an ensemble of Spiking Recurrent Neural Network (SRNN) classifiers for the multi-stream processing of sEMG and IMU data. To tackle the challenges of small sample size and class imbalance, we implement minority over-sampling with weighted sample replacement during batch creation. Our method achieves outstanding performance with an accuracy of 80.43%, AUC of 67.90%, F1 score of 52.60%, and Matthews Correlation Coefficient (MCC) of 0.437, surpassing traditional rate-based and latency-based encoding methods. The STAL encoder shows superior performance in preserving temporal dynamics and adapting to signal characteristics. Importantly, our approach (STAL-SRNN) outperforms the best deep learning method in terms of MCC, indicating better balanced class prediction. This research contributes to the development of neuromorphic computing for biosignal analysis. It holds promise for energy-efficient, wearable solutions in chronic pain management.

Via

Access Paper or Ask Questions

Spatial-aware Transformer-GRU Framework for Enhanced Glaucoma Diagnosis from 3D OCT Imaging

Mar 08, 2024

Mona Ashtari-Majlan, Mohammad Mahdi Dehshibi, David Masip

Abstract:Glaucoma, a leading cause of irreversible blindness, necessitates early detection for accurate and timely intervention to prevent irreversible vision loss. In this study, we present a novel deep learning framework that leverages the diagnostic value of 3D Optical Coherence Tomography (OCT) imaging for automated glaucoma detection. In this framework, we integrate a pre-trained Vision Transformer on retinal data for rich slice-wise feature extraction and a bidirectional Gated Recurrent Unit for capturing inter-slice spatial dependencies. This dual-component approach enables comprehensive analysis of local nuances and global structural integrity, crucial for accurate glaucoma diagnosis. Experimental results on a large dataset demonstrate the superior performance of the proposed method over state-of-the-art ones, achieving an F1-score of 93.58%, Matthews Correlation Coefficient (MCC) of 73.54%, and AUC of 95.24%. The framework's ability to leverage the valuable information in 3D OCT data holds significant potential for enhancing clinical decision support systems and improving patient outcomes in glaucoma management.

Via

Access Paper or Ask Questions

BEE-NET: A deep neural network to identify in-the-wild Bodily Expression of Emotions

Feb 21, 2024

Mohammad Mahdi Dehshibi, David Masip

Figure 1 for BEE-NET: A deep neural network to identify in-the-wild Bodily Expression of Emotions

Figure 2 for BEE-NET: A deep neural network to identify in-the-wild Bodily Expression of Emotions

Figure 3 for BEE-NET: A deep neural network to identify in-the-wild Bodily Expression of Emotions

Figure 4 for BEE-NET: A deep neural network to identify in-the-wild Bodily Expression of Emotions

Abstract:In this study, we investigate how environmental factors, specifically the scenes and objects involved, can affect the expression of emotions through body language. To this end, we introduce a novel multi-stream deep convolutional neural network named BEE-NET. We also propose a new late fusion strategy that incorporates meta-information on places and objects as prior knowledge in the learning process. Our proposed probabilistic pooling model leverages this information to generate a joint probability distribution of both available and anticipated non-available contextual information in latent space. Importantly, our fusion strategy is differentiable, allowing for end-to-end training and capturing of hidden associations among data points without requiring further post-processing or regularisation. To evaluate our deep model, we use the Body Language Database (BoLD), which is currently the largest available database for the Automatic Identification of the in-the-wild Bodily Expression of Emotions (AIBEE). Our experimental results demonstrate that our proposed approach surpasses the current state-of-the-art in AIBEE by a margin of 2.07%, achieving an Emotional Recognition Score of 66.33%.

Via

Access Paper or Ask Questions

Deep Learning and Computer Vision for Glaucoma Detection: A Review

Jul 31, 2023

Mona Ashtari-Majlan, Mohammad Mahdi Dehshibi, David Masip

Abstract:Glaucoma is the leading cause of irreversible blindness worldwide and poses significant diagnostic challenges due to its reliance on subjective evaluation. However, recent advances in computer vision and deep learning have demonstrated the potential for automated assessment. In this paper, we survey recent studies on AI-based glaucoma diagnosis using fundus, optical coherence tomography, and visual field images, with a particular emphasis on deep learning-based methods. We provide an updated taxonomy that organizes methods into architectural paradigms and includes links to available source code to enhance the reproducibility of the methods. Through rigorous benchmarking on widely-used public datasets, we reveal performance gaps in generalizability, uncertainty estimation, and multimodal integration. Additionally, our survey curates key datasets while highlighting limitations such as scale, labeling inconsistencies, and bias. We outline open research challenges and detail promising directions for future studies. This survey is expected to be useful for both AI researchers seeking to translate advances into practice and ophthalmologists aiming to improve clinical workflows and diagnosis using the latest AI outcomes.

Via

Access Paper or Ask Questions

Pain level and pain-related behaviour classification using GRU-based sparsely-connected RNNs

Dec 20, 2022

Mohammad Mahdi Dehshibi, Temitayo Olugbade, Fernando Diaz-de-Maria, Nadia Bianchi-Berthouze, Ana Tajadura-Jiménez

Figure 1 for Pain level and pain-related behaviour classification using GRU-based sparsely-connected RNNs

Figure 2 for Pain level and pain-related behaviour classification using GRU-based sparsely-connected RNNs

Figure 3 for Pain level and pain-related behaviour classification using GRU-based sparsely-connected RNNs

Figure 4 for Pain level and pain-related behaviour classification using GRU-based sparsely-connected RNNs

Abstract:There is a growing body of studies on applying deep learning to biometrics analysis. Certain circumstances, however, could impair the objective measures and accuracy of the proposed biometric data analysis methods. For instance, people with chronic pain (CP) unconsciously adapt specific body movements to protect themselves from injury or additional pain. Because there is no dedicated benchmark database to analyse this correlation, we considered one of the specific circumstances that potentially influence a person's biometrics during daily activities in this study and classified pain level and pain-related behaviour in the EmoPain database. To achieve this, we proposed a sparsely-connected recurrent neural networks (s-RNNs) ensemble with the gated recurrent unit (GRU) that incorporates multiple autoencoders using a shared training framework. This architecture is fed by multidimensional data collected from inertial measurement unit (IMU) and surface electromyography (sEMG) sensors. Furthermore, to compensate for variations in the temporal dimension that may not be perfectly represented in the latent space of s-RNNs, we fused hand-crafted features derived from information-theoretic approaches with represented features in the shared hidden state. We conducted several experiments which indicate that the proposed method outperforms the state-of-the-art approaches in classifying both pain level and pain-related behaviour.

Via

Access Paper or Ask Questions

A multi-stream convolutional neural network for classification of progressive MCI in Alzheimer's disease using structural MRI images

Mar 03, 2022

Mona Ashtari-Majlan, Abbas Seifi, Mohammad Mahdi Dehshibi

Figure 1 for A multi-stream convolutional neural network for classification of progressive MCI in Alzheimer's disease using structural MRI images

Figure 2 for A multi-stream convolutional neural network for classification of progressive MCI in Alzheimer's disease using structural MRI images

Figure 3 for A multi-stream convolutional neural network for classification of progressive MCI in Alzheimer's disease using structural MRI images

Figure 4 for A multi-stream convolutional neural network for classification of progressive MCI in Alzheimer's disease using structural MRI images

Abstract:Early diagnosis of Alzheimer's disease and its prodromal stage, also known as mild cognitive impairment (MCI), is critical since some patients with progressive MCI will develop the disease. We propose a multi-stream deep convolutional neural network fed with patch-based imaging data to classify stable MCI and progressive MCI. First, we compare MRI images of Alzheimer's disease with cognitively normal subjects to identify distinct anatomical landmarks using a multivariate statistical test. These landmarks are then used to extract patches that are fed into the proposed multi-stream convolutional neural network to classify MRI images. Next, we train the architecture in a separate scenario using samples from Alzheimer's disease images, which are anatomically similar to the progressive MCI ones and cognitively normal images to compensate for the lack of progressive MCI training data. Finally, we transfer the trained model weights to the proposed architecture in order to fine-tune the model using progressive MCI and stable MCI data. Experimental results on the ADNI-1 dataset indicate that our method outperforms existing methods for MCI classification, with an F1-score of 85.96%.

Via

Access Paper or Ask Questions

ADVISE: ADaptive Feature Relevance and VISual Explanations for Convolutional Neural Networks

Mar 02, 2022

Mohammad Mahdi Dehshibi, Mona Ashtari-Majlan, Gereziher Adhane, David Masip

Figure 1 for ADVISE: ADaptive Feature Relevance and VISual Explanations for Convolutional Neural Networks

Figure 2 for ADVISE: ADaptive Feature Relevance and VISual Explanations for Convolutional Neural Networks

Figure 3 for ADVISE: ADaptive Feature Relevance and VISual Explanations for Convolutional Neural Networks

Figure 4 for ADVISE: ADaptive Feature Relevance and VISual Explanations for Convolutional Neural Networks

Abstract:To equip Convolutional Neural Networks (CNNs) with explainability, it is essential to interpret how opaque models take specific decisions, understand what causes the errors, improve the architecture design, and identify unethical biases in the classifiers. This paper introduces ADVISE, a new explainability method that quantifies and leverages the relevance of each unit of the feature map to provide better visual explanations. To this end, we propose using adaptive bandwidth kernel density estimation to assign a relevance score to each unit of the feature map with respect to the predicted class. We also propose an evaluation protocol to quantitatively assess the visual explainability of CNN models. We extensively evaluate our idea in the image classification task using AlexNet, VGG16, ResNet50, and Xception pretrained on ImageNet. We compare ADVISE with the state-of-the-art visual explainable methods and show that the proposed method outperforms competing approaches in quantifying feature-relevance and visual explainability while maintaining competitive time complexity. Our experiments further show that ADVISE fulfils the sensitivity and implementation independence axioms while passing the sanity checks. The implementation is accessible for reproducibility purposes on https://github.com/dehshibi/ADVISE.

Via

Access Paper or Ask Questions

A deep convolutional neural network for classification of Aedes albopictus mosquitoes

Oct 29, 2021

Gereziher Adhane, Mohammad Mahdi Dehshibi, David Masip

Figure 1 for A deep convolutional neural network for classification of Aedes albopictus mosquitoes

Figure 2 for A deep convolutional neural network for classification of Aedes albopictus mosquitoes

Figure 3 for A deep convolutional neural network for classification of Aedes albopictus mosquitoes

Figure 4 for A deep convolutional neural network for classification of Aedes albopictus mosquitoes

Abstract:Monitoring the spread of disease-carrying mosquitoes is a first and necessary step to control severe diseases such as dengue, chikungunya, Zika or yellow fever. Previous citizen science projects have been able to obtain large image datasets with linked geo-tracking information. As the number of international collaborators grows, the manual annotation by expert entomologists of the large amount of data gathered by these users becomes too time demanding and unscalable, posing a strong need for automated classification of mosquito species from images. We introduce the application of two Deep Convolutional Neural Networks in a comparative study to automate this classification task. We use the transfer learning principle to train two state-of-the-art architectures on the data provided by the Mosquito Alert project, obtaining testing accuracy of 94%. In addition, we applied explainable models based on the Grad-CAM algorithm to visualise the most discriminant regions of the classified images, which coincide with the white band stripes located at the legs, abdomen, and thorax of mosquitoes of the Aedes albopictus species. The model allows us to further analyse the classification errors. Visual Grad-CAM models show that they are linked to poor acquisition conditions and strong image occlusions.

* IEEE Access, vol. 9, pp. 72681-72690, 2021

Via

Access Paper or Ask Questions

On the use of uncertainty in classifying Aedes Albopictus mosquitoes

Oct 29, 2021

Gereziher Adhane, Mohammad Mahdi Dehshibi, David Masip

Figure 1 for On the use of uncertainty in classifying Aedes Albopictus mosquitoes

Figure 2 for On the use of uncertainty in classifying Aedes Albopictus mosquitoes

Figure 3 for On the use of uncertainty in classifying Aedes Albopictus mosquitoes

Figure 4 for On the use of uncertainty in classifying Aedes Albopictus mosquitoes

Abstract:The re-emergence of mosquito-borne diseases (MBDs), which kill hundreds of thousands of people each year, has been attributed to increased human population, migration, and environmental changes. Convolutional neural networks (CNNs) have been used by several studies to recognise mosquitoes in images provided by projects such as Mosquito Alert to assist entomologists in identifying, monitoring, and managing MBD. Nonetheless, utilising CNNs to automatically label input samples could involve incorrect predictions, which may mislead future epidemiological studies. Furthermore, CNNs require large numbers of manually annotated data. In order to address the mentioned issues, this paper proposes using the Monte Carlo Dropout method to estimate the uncertainty scores in order to rank the classified samples to reduce the need for human supervision in recognising Aedes albopictus mosquitoes. The estimated uncertainty was also used in an active learning framework, where just a portion of the data from large training sets was manually labelled. The experimental results show that the proposed classification method with rejection outperforms the competing methods by improving overall performance and reducing entomologist annotation workload. We also provide explainable visualisations of the different regions that contribute to a set of samples' uncertainty assessment.

* This article has been accepted for publication in a future issue of IEEE Journal of Selected Topics in Signal Processing

Via

Access Paper or Ask Questions