Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Aite Zhao

SleepGMUformer: A gated multimodal temporal neural network for sleep staging

Feb 20, 2025

Chenjun Zhao, Xuesen Niu, Xinglin Yu, Long Chen, Na Lv, Huiyu Zhou, Aite Zhao

Figure 1 for SleepGMUformer: A gated multimodal temporal neural network for sleep staging

Figure 2 for SleepGMUformer: A gated multimodal temporal neural network for sleep staging

Figure 3 for SleepGMUformer: A gated multimodal temporal neural network for sleep staging

Figure 4 for SleepGMUformer: A gated multimodal temporal neural network for sleep staging

Abstract:Sleep staging is a key method for assessing sleep quality and diagnosing sleep disorders. However, current deep learning methods face challenges: 1) postfusion techniques ignore the varying contributions of different modalities; 2) unprocessed sleep data can interfere with frequency-domain information. To tackle these issues, this paper proposes a gated multimodal temporal neural network for multidomain sleep data, including heart rate, motion, steps, EEG (Fpz-Cz, Pz-Oz), and EOG from WristHR-Motion-Sleep and SleepEDF-78. The model integrates: 1) a pre-processing module for feature alignment, missing value handling, and EEG de-trending; 2) a feature extraction module for complex sleep features in the time dimension; and 3) a dynamic fusion module for real-time modality weighting.Experiments show classification accuracies of 85.03% on SleepEDF-78 and 94.54% on WristHR-Motion-Sleep datasets. The model handles heterogeneous datasets and outperforms state-of-the-art models by 1.00%-4.00%.

Via

Access Paper or Ask Questions

Multimodal Gait Recognition for Neurodegenerative Diseases

Jan 07, 2021

Aite Zhao, Jianbo Li, Junyu Dong, Lin Qi, Qianni Zhang, Ning Li, Xin Wang, Huiyu Zhou

Figure 1 for Multimodal Gait Recognition for Neurodegenerative Diseases

Figure 2 for Multimodal Gait Recognition for Neurodegenerative Diseases

Figure 3 for Multimodal Gait Recognition for Neurodegenerative Diseases

Figure 4 for Multimodal Gait Recognition for Neurodegenerative Diseases

Abstract:In recent years, single modality based gait recognition has been extensively explored in the analysis of medical images or other sensory data, and it is recognised that each of the established approaches has different strengths and weaknesses. As an important motor symptom, gait disturbance is usually used for diagnosis and evaluation of diseases; moreover, the use of multi-modality analysis of the patient's walking pattern compensates for the one-sidedness of single modality gait recognition methods that only learn gait changes in a single measurement dimension. The fusion of multiple measurement resources has demonstrated promising performance in the identification of gait patterns associated with individual diseases. In this paper, as a useful tool, we propose a novel hybrid model to learn the gait differences between three neurodegenerative diseases, between patients with different severity levels of Parkinson's disease and between healthy individuals and patients, by fusing and aggregating data from multiple sensors. A spatial feature extractor (SFE) is applied to generating representative features of images or signals. In order to capture temporal information from the two modality data, a new correlative memory neural network (CorrMNN) architecture is designed for extracting temporal features. Afterwards, we embed a multi-switch discriminator to associate the observations with individual state estimations. Compared with several state-of-the-art techniques, our proposed framework shows more accurate classification results.

Via

Access Paper or Ask Questions

Associated Spatio-Temporal Capsule Network for Gait Recognition

Jan 07, 2021

Aite Zhao, Junyu Dong, Jianbo Li, Lin Qi, Huiyu Zhou

Figure 1 for Associated Spatio-Temporal Capsule Network for Gait Recognition

Figure 2 for Associated Spatio-Temporal Capsule Network for Gait Recognition

Figure 3 for Associated Spatio-Temporal Capsule Network for Gait Recognition

Figure 4 for Associated Spatio-Temporal Capsule Network for Gait Recognition

Abstract:It is a challenging task to identify a person based on her/his gait patterns. State-of-the-art approaches rely on the analysis of temporal or spatial characteristics of gait, and gait recognition is usually performed on single modality data (such as images, skeleton joint coordinates, or force signals). Evidence has shown that using multi-modality data is more conducive to gait research. Therefore, we here establish an automated learning system, with an associated spatio-temporal capsule network (ASTCapsNet) trained on multi-sensor datasets, to analyze multimodal information for gait recognition. Specifically, we first design a low-level feature extractor and a high-level feature extractor for spatio-temporal feature extraction of gait with a novel recurrent memory unit and a relationship layer. Subsequently, a Bayesian model is employed for the decision-making of class labels. Extensive experiments on several public datasets (normal and abnormal gait) validate the effectiveness of the proposed ASTCapsNet, compared against several state-of-the-art methods.

Via

Access Paper or Ask Questions

Muti-view Mouse Social Behaviour Recognition with Deep Graphical Model

Nov 04, 2020

Zheheng Jiang, Feixiang Zhou, Aite Zhao, Xin Li, Ling Li, Dacheng Tao, Xuelong Li, Huiyu Zhou

Figure 1 for Muti-view Mouse Social Behaviour Recognition with Deep Graphical Model

Figure 2 for Muti-view Mouse Social Behaviour Recognition with Deep Graphical Model

Figure 3 for Muti-view Mouse Social Behaviour Recognition with Deep Graphical Model

Figure 4 for Muti-view Mouse Social Behaviour Recognition with Deep Graphical Model

Abstract:Home-cage social behaviour analysis of mice is an invaluable tool to assess therapeutic efficacy of neurodegenerative diseases. Despite tremendous efforts made within the research community, single-camera video recordings are mainly used for such analysis. Because of the potential to create rich descriptions of mouse social behaviors, the use of multi-view video recordings for rodent observations is increasingly receiving much attention. However, identifying social behaviours from various views is still challenging due to the lack of correspondence across data sources. To address this problem, we here propose a novel multiview latent-attention and dynamic discriminative model that jointly learns view-specific and view-shared sub-structures, where the former captures unique dynamics of each view whilst the latter encodes the interaction between the views. Furthermore, a novel multi-view latent-attention variational autoencoder model is introduced in learning the acquired features, enabling us to learn discriminative features in each view. Experimental results on the standard CRMI13 and our multi-view Parkinson's Disease Mouse Behaviour (PDMB) datasets demonstrate that our model outperforms the other state of the arts technologies and effectively deals with the imbalanced data problem.

* 17 pages, 11 figures

Via

Access Paper or Ask Questions

Perceptual underwater image enhancement with deep learning and physical priors

Sep 26, 2020

Long Chen, Zheheng Jiang, Lei Tong, Zhihua Liu, Aite Zhao, Qianni Zhang, Junyu Dong, Huiyu Zhou

Figure 1 for Perceptual underwater image enhancement with deep learning and physical priors

Figure 2 for Perceptual underwater image enhancement with deep learning and physical priors

Figure 3 for Perceptual underwater image enhancement with deep learning and physical priors

Figure 4 for Perceptual underwater image enhancement with deep learning and physical priors

Abstract:Underwater image enhancement, as a pre-processing step to improve the accuracy of the following object detection task, has drawn considerable attention in the field of underwater navigation and ocean exploration. However, most of the existing underwater image enhancement strategies tend to consider enhancement and detection as two independent modules with no interaction, and the practice of separate optimization does not always help the underwater object detection task. In this paper, we propose two perceptual enhancement models, each of which uses a deep enhancement model with a detection perceptor. The detection perceptor provides coherent information in the form of gradients to the enhancement model, guiding the enhancement model to generate patch level visually pleasing images or detection favourable images. In addition, due to the lack of training data, a hybrid underwater image synthesis model, which fuses physical priors and data-driven cues, is proposed to synthesize training data and generalise our enhancement model for real-world underwater images. Experimental results show the superiority of our proposed method over several state-of-the-art methods on both real-world and synthetic underwater datasets.

Via

Access Paper or Ask Questions