Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yassine Himeur

Exploring Image Transforms derived from Eye Gaze Variables for Progressive Autism Diagnosis

Jun 07, 2025

Abigail Copiaco, Christian Ritz, Yassine Himeur, Valsamma Eapen, Ammar Albanna, Wathiq Mansoor

Abstract:The prevalence of Autism Spectrum Disorder (ASD) has surged rapidly over the past decade, posing significant challenges in communication, behavior, and focus for affected individuals. Current diagnostic techniques, though effective, are time-intensive, leading to high social and economic costs. This work introduces an AI-powered assistive technology designed to streamline ASD diagnosis and management, enhancing convenience for individuals with ASD and efficiency for caregivers and therapists. The system integrates transfer learning with image transforms derived from eye gaze variables to diagnose ASD. This facilitates and opens opportunities for in-home periodical diagnosis, reducing stress for individuals and caregivers, while also preserving user privacy through the use of image transforms. The accessibility of the proposed method also offers opportunities for improved communication between guardians and therapists, ensuring regular updates on progress and evolving support needs. Overall, the approach proposed in this work ensures timely, accessible diagnosis while protecting the subjects' privacy, improving outcomes for individuals with ASD.

* 6 pages, 8 figures, and 1 table

Via

Access Paper or Ask Questions

Hybrid Vision Transformer-Mamba Framework for Autism Diagnosis via Eye-Tracking Analysis

Jun 07, 2025

Wafaa Kasri, Yassine Himeur, Abigail Copiaco, Wathiq Mansoor, Ammar Albanna, Valsamma Eapen

Abstract:Accurate Autism Spectrum Disorder (ASD) diagnosis is vital for early intervention. This study presents a hybrid deep learning framework combining Vision Transformers (ViT) and Vision Mamba to detect ASD using eye-tracking data. The model uses attention-based fusion to integrate visual, speech, and facial cues, capturing both spatial and temporal dynamics. Unlike traditional handcrafted methods, it applies state-of-the-art deep learning and explainable AI techniques to enhance diagnostic accuracy and transparency. Tested on the Saliency4ASD dataset, the proposed ViT-Mamba model outperformed existing methods, achieving 0.96 accuracy, 0.95 F1-score, 0.97 sensitivity, and 0.94 specificity. These findings show the model's promise for scalable, interpretable ASD screening, especially in resource-constrained or remote clinical settings where access to expert diagnosis is limited.

* 7 pages, 4 figures and 2 tables

Via

Access Paper or Ask Questions

End-to-End Deep Learning in Phase Noisy Coherent Optical Link

Feb 28, 2025

Omar Alnaseri, Yassine Himeur

Abstract:In coherent optical orthogonal frequency-division multiplexing (CO-OFDM) fiber communications, a novel end-to-end learning framework to mitigate Laser Phase Noise (LPN) impairments is proposed in this paper. Inspired by Autoencoder (AE) principles, the proposed approach trains a model to learn robust symbol sequences capable of combat LPN, even from low-cost distributed feedback (DFB) lasers with linewidths up to 2 MHz. This allows for the use of high-level modulation formats and large-scale Fast Fourier Transform (FFT) processing, maximizing spectral efficiency in CO-OFDM systems. By eliminating the need for complex traditional techniques, this approach offers a potentially more efficient and streamlined solution for CO-OFDM systems. The most significant achievement of this study is the demonstration that the proposed AE-based model can enhance system performance by reducing the bit error rate (BER) to below the threshold of forward error correction (FEC), even under severe phase noise conditions, thus proving its effectiveness and efficiency in practical deployment scenarios.

Via

Access Paper or Ask Questions

A Review on Deep Learning Autoencoder in the Design of Next-Generation Communication Systems

Dec 18, 2024

Omar Alnaseri, Laith Alzubaidi, Yassine Himeur, Jens Timmermann

Abstract:Traditional mathematical models used in designing next-generation communication systems often fall short due to inherent simplifications, narrow scope, and computational limitations. In recent years, the incorporation of deep learning (DL) methodologies into communication systems has made significant progress in system design and performance optimisation. Autoencoders (AEs) have become essential, enabling end-to-end learning that allows for the combined optimisation of transmitters and receivers. Consequently, AEs offer a data-driven methodology capable of bridging the gap between theoretical models and real-world complexities. The paper presents a comprehensive survey of the application of AEs within communication systems, with a particular focus on their architectures, associated challenges, and future directions. We examine 120 recent studies across wireless, optical, semantic, and quantum communication fields, categorising them according to transceiver design, channel modelling, digital signal processing, and computational complexity. This paper further examines the challenges encountered in the implementation of AEs, including the need for extensive training data, the risk of overfitting, and the requirement for differentiable channel models. Through data-driven approaches, AEs provide robust solutions for end-to-end system optimisation, surpassing traditional mathematical models confined by simplifying assumptions. This paper also summarises the computational complexity associated with AE-based systems by conducting an in-depth analysis employing the metric of floating-point operations per second (FLOPS). This analysis encompasses the evaluation of matrix multiplications, bias additions, and activation functions. This survey aims to establish a roadmap for future research, emphasising the transformative potential of AEs in the formulation of next-generation communication systems.

Via

Access Paper or Ask Questions

Automatic Speech Recognition with BERT and CTC Transformers: A Review

Oct 12, 2024

Noussaiba Djeffal, Hamza Kheddar, Djamel Addou, Ahmed Cherif Mazari, Yassine Himeur

Figure 1 for Automatic Speech Recognition with BERT and CTC Transformers: A Review

Figure 2 for Automatic Speech Recognition with BERT and CTC Transformers: A Review

Figure 3 for Automatic Speech Recognition with BERT and CTC Transformers: A Review

Figure 4 for Automatic Speech Recognition with BERT and CTC Transformers: A Review

Abstract:This review paper provides a comprehensive analysis of recent advances in automatic speech recognition (ASR) with bidirectional encoder representations from transformers BERT and connectionist temporal classification (CTC) transformers. The paper first introduces the fundamental concepts of ASR and discusses the challenges associated with it. It then explains the architecture of BERT and CTC transformers and their potential applications in ASR. The paper reviews several studies that have used these models for speech recognition tasks and discusses the results obtained. Additionally, the paper highlights the limitations of these models and outlines potential areas for further research. All in all, this review provides valuable insights for researchers and practitioners who are interested in ASR with BERT and CTC transformers.

* 2023 2nd International Conference on Electronics, Energy and Measurement (IC2EM)

Via

Access Paper or Ask Questions

Applications of Knowledge Distillation in Remote Sensing: A Survey

Sep 18, 2024

Yassine Himeur, Nour Aburaed, Omar Elharrouss, Iraklis Varlamis, Shadi Atalla, Wathiq Mansoor, Hussain Al Ahmad

Figure 1 for Applications of Knowledge Distillation in Remote Sensing: A Survey

Figure 2 for Applications of Knowledge Distillation in Remote Sensing: A Survey

Figure 3 for Applications of Knowledge Distillation in Remote Sensing: A Survey

Figure 4 for Applications of Knowledge Distillation in Remote Sensing: A Survey

Abstract:With the ever-growing complexity of models in the field of remote sensing (RS), there is an increasing demand for solutions that balance model accuracy with computational efficiency. Knowledge distillation (KD) has emerged as a powerful tool to meet this need, enabling the transfer of knowledge from large, complex models to smaller, more efficient ones without significant loss in performance. This review article provides an extensive examination of KD and its innovative applications in RS. KD, a technique developed to transfer knowledge from a complex, often cumbersome model (teacher) to a more compact and efficient model (student), has seen significant evolution and application across various domains. Initially, we introduce the fundamental concepts and historical progression of KD methods. The advantages of employing KD are highlighted, particularly in terms of model compression, enhanced computational efficiency, and improved performance, which are pivotal for practical deployments in RS scenarios. The article provides a comprehensive taxonomy of KD techniques, where each category is critically analyzed to demonstrate the breadth and depth of the alternative options, and illustrates specific case studies that showcase the practical implementation of KD methods in RS tasks, such as instance segmentation and object detection. Further, the review discusses the challenges and limitations of KD in RS, including practical constraints and prospective future directions, providing a comprehensive overview for researchers and practitioners in the field of RS. Through this organization, the paper not only elucidates the current state of research in KD but also sets the stage for future research opportunities, thereby contributing significantly to both academic research and real-world applications.

* 50 pages, 11 figures and 9 tables

Via

Access Paper or Ask Questions

PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems

Aug 26, 2024

Omar Alnaseri, Ibtesam R. K. Al-Saedi, Yassine Himeur, Hongxiang Li

Figure 1 for PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems

Figure 2 for PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems

Figure 3 for PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems

Figure 4 for PAPR Reduction based on Deep Learning Autoencoder in Coherent Optical OFDM Systems

Abstract:This paper presents an innovative approach to reducing Peak-to-Average Power Ratio (PAPR) in Coherent Optical Orthogonal Frequency Division Multiplexing (CO-OFDM) systems. The proposed deep learning autoencoder-based model eliminates the computational complexity of existing PAPR reduction techniques, such as Selective Mapping (SLM), by leveraging a novel decoder architecture at the receiver. In addition, No side information is needed in our approach, unlike SLM which requires knowledge of the PAPR distribution. Simulation results demonstrate significant improvements in both PAPR reduction and Bit Error Rate (BER) performance compared to traditional techniques. It achieves error-free transmission with over 10 dB PAPR reduction compared to unmitigated and 1 dB gain over SLM technique. Furthermore, our approach exhibits robustness against noise and nonlinearity effects, enabling reliable transmission over optical channels with varying levels of impairment. The proposed technique has far-reaching implications for next-generation optical communication systems, where efficient PAPR reduction is crucial for ensuring reliable data transfer.

* submitted to ICSPIS 2024

Via

Access Paper or Ask Questions

Deep Transfer Learning for Kidney Cancer Diagnosis

Aug 08, 2024

Yassine Habchi, Hamza Kheddar, Yassine Himeur, Abdelkrim Boukabou, Shadi Atalla, Wathiq Mansoor, Hussain Al-Ahmad

Abstract:Many incurable diseases prevalent across global societies stem from various influences, including lifestyle choices, economic conditions, social factors, and genetics. Research predominantly focuses on these diseases due to their widespread nature, aiming to decrease mortality, enhance treatment options, and improve healthcare standards. Among these, kidney disease stands out as a particularly severe condition affecting men and women worldwide. Nonetheless, there is a pressing need for continued research into innovative, early diagnostic methods to develop more effective treatments for such diseases. Recently, automatic diagnosis of Kidney Cancer has become an important challenge especially when using deep learning (DL) due to the importance of training medical datasets, which in most cases are difficult and expensive to obtain. Furthermore, in most cases, algorithms require data from the same domain and a powerful computer with efficient storage capacity. To overcome this issue, a new type of learning known as transfer learning (TL) has been proposed that can produce impressive results based on other different pre-trained data. This paper presents, to the best of the authors' knowledge, the first comprehensive survey of DL-based TL frameworks for kidney cancer diagnosis. This is a strong contribution to help researchers understand the current challenges and perspectives of this topic. Hence, the main limitations and advantages of each framework are identified and detailed critical analyses are provided. Looking ahead, the article identifies promising directions for future research. Moving on, the discussion is concluded by reflecting on the pivotal role of TL in the development of precision medicine and its effects on clinical practice and research in oncology.

* 32 pages, 8 figures and 8 tables

Via

Access Paper or Ask Questions

Advancing 3D Point Cloud Understanding through Deep Transfer Learning: A Comprehensive Survey

Jul 25, 2024

Shahab Saquib Sohail, Yassine Himeur, Hamza Kheddar, Abbes Amira, Fodil Fadli, Shadi Atalla, Abigail Copiaco, Wathiq Mansoor

Figure 1 for Advancing 3D Point Cloud Understanding through Deep Transfer Learning: A Comprehensive Survey

Figure 2 for Advancing 3D Point Cloud Understanding through Deep Transfer Learning: A Comprehensive Survey

Figure 3 for Advancing 3D Point Cloud Understanding through Deep Transfer Learning: A Comprehensive Survey

Figure 4 for Advancing 3D Point Cloud Understanding through Deep Transfer Learning: A Comprehensive Survey

Abstract:The 3D point cloud (3DPC) has significantly evolved and benefited from the advance of deep learning (DL). However, the latter faces various issues, including the lack of data or annotated data, the existence of a significant gap between training data and test data, and the requirement for high computational resources. To that end, deep transfer learning (DTL), which decreases dependency and costs by utilizing knowledge gained from a source data/task in training a target data/task, has been widely investigated. Numerous DTL frameworks have been suggested for aligning point clouds obtained from several scans of the same scene. Additionally, DA, which is a subset of DTL, has been modified to enhance the point cloud data's quality by dealing with noise and missing points. Ultimately, fine-tuning and DA approaches have demonstrated their effectiveness in addressing the distinct difficulties inherent in point cloud data. This paper presents the first review shedding light on this aspect. it provides a comprehensive overview of the latest techniques for understanding 3DPC using DTL and domain adaptation (DA). Accordingly, DTL's background is first presented along with the datasets and evaluation metrics. A well-defined taxonomy is introduced, and detailed comparisons are presented, considering different aspects such as different knowledge transfer strategies, and performance. The paper covers various applications, such as 3DPC object detection, semantic labeling, segmentation, classification, registration, downsampling/upsampling, and denoising. Furthermore, the article discusses the advantages and limitations of the presented frameworks, identifies open challenges, and suggests potential research directions.

* 55 pages, 9 tables, and 15 figures

Via

Access Paper or Ask Questions

AI Radiologist: Revolutionizing Liver Tissue Segmentation with Convolutional Neural Networks and a Clinician-Friendly GUI

Jun 11, 2024

Ayman Al-Kababji, Faycal Bensaali, Sarada Prasad Dakua, Yassine Himeur

Figure 1 for AI Radiologist: Revolutionizing Liver Tissue Segmentation with Convolutional Neural Networks and a Clinician-Friendly GUI

Figure 2 for AI Radiologist: Revolutionizing Liver Tissue Segmentation with Convolutional Neural Networks and a Clinician-Friendly GUI

Figure 3 for AI Radiologist: Revolutionizing Liver Tissue Segmentation with Convolutional Neural Networks and a Clinician-Friendly GUI

Figure 4 for AI Radiologist: Revolutionizing Liver Tissue Segmentation with Convolutional Neural Networks and a Clinician-Friendly GUI

Abstract:Artificial Intelligence (AI) is a pervasive research topic, permeating various sectors and applications. In this study, we harness the power of AI, specifically convolutional neural networks (ConvNets), for segmenting liver tissues. It also focuses on developing a user-friendly graphical user interface (GUI) tool, "AI Radiologist", enabling clinicians to effectively delineate different liver tissues (parenchyma, tumors, and vessels), thereby saving lives. This endeavor bridges the gap between academic research and practical, industrial applications. The GUI is a single-page application and is designed using the PyQt5 Python framework. The offline-available AI Radiologist resorts to three ConvNet models trained to segment all liver tissues. With respect to the Dice metric, the best liver ConvNet scores 98.16%, the best tumor ConvNet scores 65.95%, and the best vessel ConvNet scores 51.94%. It outputs 2D slices of the liver, tumors, and vessels, along with 3D interpolations in .obj and .mtl formats, which can be visualized/printed using any 3D-compatible software. Thus, the AI Radiologist offers a convenient tool for clinicians to perform liver tissue segmentation and 3D interpolation employing state-of-the-art models for tissues segmentation. With the provided capacity to select the volumes and pre-trained models, the clinicians can leave the rest to the AI Radiologist.

* 38 pages, 19 figures, 7 tables submitted to journal

Via

Access Paper or Ask Questions