Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xujiong Ye

Deep histological synthesis from mass spectrometry imaging for multimodal registration

Jun 05, 2025

Kimberley M. Bird, Xujiong Ye, Alan M. Race, James M. Brown

Abstract:Registration of histological and mass spectrometry imaging (MSI) allows for more precise identification of structural changes and chemical interactions in tissue. With histology and MSI having entirely different image formation processes and dimensionalities, registration of the two modalities remains an ongoing challenge. This work proposes a solution that synthesises histological images from MSI, using a pix2pix model, to effectively enable unimodal registration. Preliminary results show promising synthetic histology images with limited artifacts, achieving increases in mutual information (MI) and structural similarity index measures (SSIM) of +0.924 and +0.419, respectively, compared to a baseline U-Net model. Our source code is available on GitHub: https://github.com/kimberley/MIUA2025.

* Medical Image Understanding and Analysis (MIUA) 2025 Extended Abstract Submission

Via

Access Paper or Ask Questions

Validation of Human Pose Estimation and Human Mesh Recovery for Extracting Clinically Relevant Motion Data from Videos

Mar 18, 2025

Kai Armstrong, Alexander Rodrigues, Alexander P. Willmott, Lei Zhang, Xujiong Ye

Abstract:This work aims to discuss the current landscape of kinematic analysis tools, ranging from the state-of-the-art in sports biomechanics such as inertial measurement units (IMUs) and retroreflective marker-based optical motion capture (MoCap) to more novel approaches from the field of computing such as human pose estimation and human mesh recovery. Primarily, this comparative analysis aims to validate the use of marker-less MoCap techniques in a clinical setting by showing that these marker-less techniques are within a reasonable range for kinematics analysis compared to the more cumbersome and less portable state-of-the-art tools. Not only does marker-less motion capture using human pose estimation produce results in-line with the results of both the IMU and MoCap kinematics but also benefits from a reduced set-up time and reduced practical knowledge and expertise to set up. Overall, while there is still room for improvement when it comes to the quality of the data produced, we believe that this compromise is within the room of error that these low-speed actions that are used in small clinical tests.

Via

Access Paper or Ask Questions

MetaFE-DE: Learning Meta Feature Embedding for Depth Estimation from Monocular Endoscopic Images

Feb 05, 2025

Dawei Lu, Deqiang Xiao, Danni Ai, Jingfan Fan, Tianyu Fu, Yucong Lin, Hong Song, Xujiong Ye, Lei Zhang, Jian Yang

Abstract:Depth estimation from monocular endoscopic images presents significant challenges due to the complexity of endoscopic surgery, such as irregular shapes of human soft tissues, as well as variations in lighting conditions. Existing methods primarily estimate the depth information from RGB images directly, and often surffer the limited interpretability and accuracy. Given that RGB and depth images are two views of the same endoscopic surgery scene, in this paper, we introduce a novel concept referred as ``meta feature embedding (MetaFE)", in which the physical entities (e.g., tissues and surgical instruments) of endoscopic surgery are represented using the shared features that can be alternatively decoded into RGB or depth image. With this concept, we propose a two-stage self-supervised learning paradigm for the monocular endoscopic depth estimation. In the first stage, we propose a temporal representation learner using diffusion models, which are aligned with the spatial information through the cross normalization to construct the MetaFE. In the second stage, self-supervised monocular depth estimation with the brightness calibration is applied to decode the meta features into the depth image. Extensive evaluation on diverse endoscopic datasets demonstrates that our approach outperforms the state-of-the-art method in depth estimation, achieving superior accuracy and generalization. The source code will be publicly available.

Via

Access Paper or Ask Questions

Pushing the limits of cell segmentation models for imaging mass cytometry

Feb 06, 2024

Kimberley M. Bird, Xujiong Ye, Alan M. Race, James M. Brown

Figure 1 for Pushing the limits of cell segmentation models for imaging mass cytometry

Figure 2 for Pushing the limits of cell segmentation models for imaging mass cytometry

Figure 3 for Pushing the limits of cell segmentation models for imaging mass cytometry

Figure 4 for Pushing the limits of cell segmentation models for imaging mass cytometry

Abstract:Imaging mass cytometry (IMC) is a relatively new technique for imaging biological tissue at subcellular resolution. In recent years, learning-based segmentation methods have enabled precise quantification of cell type and morphology, but typically rely on large datasets with fully annotated ground truth (GT) labels. This paper explores the effects of imperfect labels on learning-based segmentation models and evaluates the generalisability of these models to different tissue types. Our results show that removing 50% of cell annotations from GT masks only reduces the dice similarity coefficient (DSC) score to 0.874 (from 0.889 achieved by a model trained on fully annotated GT masks). This implies that annotation time can in fact be reduced by at least half without detrimentally affecting performance. Furthermore, training our single-tissue model on imperfect labels only decreases DSC by 0.031 on an unseen tissue type compared to its multi-tissue counterpart, with negligible qualitative differences in segmentation. Additionally, bootstrapping the worst-performing model (with 5% of cell annotations) a total of ten times improves its original DSC score of 0.720 to 0.829. These findings imply that less time and work can be put into the process of producing comparable segmentation models; this includes eliminating the need for multiple IMC tissue types during training, whilst also providing the potential for models with very few labels to improve on themselves. Source code is available on GitHub: https://github.com/kimberley/ISBI2024.

* International Symposium on Biomedical Imaging (ISBI) 2024 Submission

Via

Access Paper or Ask Questions

Rethinking the transfer learning for FCN based polyp segmentation in colonoscopy

Nov 04, 2022

Yan Wen, Lei Zhang, Xiangli Meng, Xujiong Ye

Abstract:Besides the complex nature of colonoscopy frames with intrinsic frame formation artefacts such as light reflections and the diversity of polyp types/shapes, the publicly available polyp segmentation training datasets are limited, small and imbalanced. In this case, the automated polyp segmentation using a deep neural network remains an open challenge due to the overfitting of training on small datasets. We proposed a simple yet effective polyp segmentation pipeline that couples the segmentation (FCN) and classification (CNN) tasks. We find the effectiveness of interactive weight transfer between dense and coarse vision tasks that mitigates the overfitting in learning. And It motivates us to design a new training scheme within our segmentation pipeline. Our method is evaluated on CVC-EndoSceneStill and Kvasir-SEG datasets. It achieves 4.34% and 5.70% Polyp-IoU improvements compared to the state-of-the-art methods on the EndoSceneStill and Kvasir-SEG datasets, respectively.

* 11 pages, 10 figures, submit version

Via

Access Paper or Ask Questions

Dual-stream spatiotemporal networks with feature sharing for monitoring animals in the home cage

Jun 01, 2022

Ezechukwu I. Nwokedi, Rasneer S. Bains, Luc Bidaut, Xujiong Ye, Sara Wells, James M. Brown

Figure 1 for Dual-stream spatiotemporal networks with feature sharing for monitoring animals in the home cage

Figure 2 for Dual-stream spatiotemporal networks with feature sharing for monitoring animals in the home cage

Figure 3 for Dual-stream spatiotemporal networks with feature sharing for monitoring animals in the home cage

Figure 4 for Dual-stream spatiotemporal networks with feature sharing for monitoring animals in the home cage

Abstract:This paper presents a spatiotemporal deep learning approach for mouse behavioural classification in the home cage. Using a series of dual-stream architectures with assorted modifications to increase performance, we introduce a novel feature-sharing approach that jointly processes the streams at regular intervals throughout the network. Using a publicly available labelled dataset of singly-housed mice, we achieve a prediction accuracy of 86.47% using an ensemble of Inception-based networks that utilize feature sharing. We also demonstrate through ablation studies that for all models, the feature-sharing architectures consistently perform better than conventional ones having separate streams. The best performing models were further evaluated on other activity datasets, both mouse and human, and achieved state-of-the-art results. Future work will investigate the effectiveness of feature sharing in behavioural classification in the unsupervised anomaly detection domain.

Via

Access Paper or Ask Questions

Unsupervised detection of mouse behavioural anomalies using two-stream convolutional autoencoders

May 28, 2021

Ezechukwu I Nwokedi, Rasneer S Bains, Luc Bidaut, Sara Wells, Xujiong Ye, James M Brown

Figure 1 for Unsupervised detection of mouse behavioural anomalies using two-stream convolutional autoencoders

Figure 2 for Unsupervised detection of mouse behavioural anomalies using two-stream convolutional autoencoders

Figure 3 for Unsupervised detection of mouse behavioural anomalies using two-stream convolutional autoencoders

Figure 4 for Unsupervised detection of mouse behavioural anomalies using two-stream convolutional autoencoders

Abstract:This paper explores the application of unsupervised learning to detecting anomalies in mouse video data. The two models presented in this paper are a dual-stream, 3D convolutional autoencoder (with residual connections) and a dual-stream, 2D convolutional autoencoder. The publicly available dataset used here contains twelve videos of single home-caged mice alongside frame-level annotations. Under the pretext that the autoencoder only sees normal events, the video data was handcrafted to treat each behaviour as a pseudo-anomaly thereby eliminating them from the others during training. The results are presented for one conspicuous behaviour (hang) and one inconspicuous behaviour (groom). The performance of these models is compared to a single stream autoencoder and a supervised learning model, which are both based on the custom CAE. Both models are also tested on the CUHK Avenue dataset were found to perform as well as some state-of-the-art architectures.

Via

Access Paper or Ask Questions

DR-Unet104 for Multimodal MRI brain tumor segmentation

Nov 04, 2020

Jordan Colman, Lei Zhang, Wenting Duan, Xujiong Ye

Figure 1 for DR-Unet104 for Multimodal MRI brain tumor segmentation

Figure 2 for DR-Unet104 for Multimodal MRI brain tumor segmentation

Figure 3 for DR-Unet104 for Multimodal MRI brain tumor segmentation

Figure 4 for DR-Unet104 for Multimodal MRI brain tumor segmentation

Abstract:In this paper we propose a 2D deep residual Unet with 104 convolutional layers (DR-Unet104) for lesion segmentation in brain MRIs. We make multiple additions to the Unet architecture, including adding the 'bottleneck' residual block to the Unet encoder and adding dropout after each convolution block stack. We verified the effect of introducing the regularisation of dropout with small rate (e.g. 0.2) on the architecture, and found a dropout of 0.2 improved the overall performance compared to no dropout, or a dropout of 0.5. We evaluated the proposed architecture as part of the Multimodal Brain Tumor Segmentation (BraTS) 2020 Challenge and compared our method to DeepLabV3+ with a ResNet-V2-152 backbone. We found that the DR-Unet104 achieved a mean dice score coefficient of 0.8862, 0.6756 and 0.6721 for validation data, whole tumor, enhancing tumor and tumor core respectively, an overall improvement on 0.8770, 0.65242 and 0.68134 achieved by DeepLabV3+. Our method produced a final mean DSC of 0.8673, 0.7514 and 0.7983 on whole tumor, enhancing tumor and tumor core on the challenge's testing data. We present this as a state-of-the-art 2D lesion segmentation architecture that can be used on lower power computers than a 3D architecture. The source code and trained model for this work is openly available at https://github.com/jordan-colman/DR-Unet104.

* Part of the Multimodal Brain Tumor Segmentation 2020 Challenge conference proceedings

Via

Access Paper or Ask Questions

MRI Brain Tumor Segmentation using Random Forests and Fully Convolutional Networks

Sep 13, 2019

Mohammadreza Soltaninejad, Lei Zhang, Tryphon Lambrou, Guang Yang, Nigel Allinson, Xujiong Ye

Figure 1 for MRI Brain Tumor Segmentation using Random Forests and Fully Convolutional Networks

Figure 2 for MRI Brain Tumor Segmentation using Random Forests and Fully Convolutional Networks

Figure 3 for MRI Brain Tumor Segmentation using Random Forests and Fully Convolutional Networks

Abstract:In this paper, we propose a novel learning based method for automated segmentation of brain tumor in multimodal MRI images, which incorporates two sets of machine -learned and hand crafted features. Fully convolutional networks (FCN) forms the machine learned features and texton based features are considered as hand-crafted features. Random forest (RF) is used to classify the MRI image voxels into normal brain tissues and different parts of tumors, i.e. edema, necrosis and enhancing tumor. The method was evaluated on BRATS 2017 challenge dataset. The results show that the proposed method provides promising segmentations. The mean Dice overlap measure for automatic brain tumor segmentation against ground truth is 0.86, 0.78 and 0.66 for whole tumor, core and enhancing tumor, respectively.

* Soltaninejad, M., Zhang, L., Lambrou, T., Yang, G., Allinson, N. and Ye, X., 2017. MRI brain tumor segmentation using random forests and fully convolutional networks. In Proceeding of 2017 International MICCAI BraTS Challenge, pp. 279-283
* Published in the pre-conference proceeding of "2017 International MICCAI BraTS Challenge"

Via

Access Paper or Ask Questions

A hybrid model for predicting human physical activity status from lifelogging data

May 26, 2019

Ji Ni, Bowei Chen, Nigel M. Allinson, Xujiong Ye

Figure 1 for A hybrid model for predicting human physical activity status from lifelogging data

Figure 2 for A hybrid model for predicting human physical activity status from lifelogging data

Figure 3 for A hybrid model for predicting human physical activity status from lifelogging data

Figure 4 for A hybrid model for predicting human physical activity status from lifelogging data

Abstract:One trend in the recent healthcare transformations is people are encouraged to monitor and manage their health based on their daily diets and physical activity habits. However, much attention of the use of operational research and analytical models in healthcare has been paid to the systematic level such as country or regional policy making or organisational issues. This paper proposes a model concerned with healthcare analytics at the individual level, which can predict human physical activity status from sequential lifelogging data collected from wearable sensors. The model has a two-stage hybrid structure (in short, MOGP-HMM) -- a multi-objective genetic programming (MOGP) algorithm in the first stage to reduce the dimensions of lifelogging data and a hidden Markov model (HMM) in the second stage for activity status prediction over time. It can be used as a decision support tool to provide real-time monitoring, statistical analysis and personalized advice to individuals, encouraging positive attitudes towards healthy lifestyles. We validate the model with the real data collected from a group of participants in the UK, and compare it with other popular two-stage hybrid models. Our experimental results show that the MOGP-HMM can achieve comparable performance. To the best of our knowledge, this is the very first study that uses the MOGP in the hybrid two-stage structure for individuals\rq{} activity status prediction. It fits seamlessly with the current trend in the UK healthcare transformation of patient empowerment as well as contributing to a strategic development for more efficient and cost-effective provision of healthcare.

* 13 pages, paper accepted by European Journal of Operational Research

Via

Access Paper or Ask Questions