Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Barry Cardiff

ORXE: Orchestrating Experts for Dynamically Configurable Efficiency

May 07, 2025

Qingyuan Wang, Guoxin Wang, Barry Cardiff, Deepu John

Abstract:This paper presents ORXE, a modular and adaptable framework for achieving real-time configurable efficiency in AI models. By leveraging a collection of pre-trained experts with diverse computational costs and performance levels, ORXE dynamically adjusts inference pathways based on the complexity of input samples. Unlike conventional approaches that require complex metamodel training, ORXE achieves high efficiency and flexibility without complicating the development process. The proposed system utilizes a confidence-based gating mechanism to allocate appropriate computational resources for each input. ORXE also supports adjustments to the preference between inference cost and prediction performance across a wide range during runtime. We implemented a training-free ORXE system for image classification tasks, evaluating its efficiency and accuracy across various devices. The results demonstrate that ORXE achieves superior performance compared to individual experts and other dynamic models in most cases. This approach can be extended to other applications, providing a scalable solution for diverse real-world deployment scenarios.

Via

Access Paper or Ask Questions

A Review on Multisensor Data Fusion for Wearable Health Monitoring

Dec 08, 2024

Arlene John, Barry Cardiff, Deepu John

Figure 1 for A Review on Multisensor Data Fusion for Wearable Health Monitoring

Figure 2 for A Review on Multisensor Data Fusion for Wearable Health Monitoring

Figure 3 for A Review on Multisensor Data Fusion for Wearable Health Monitoring

Figure 4 for A Review on Multisensor Data Fusion for Wearable Health Monitoring

Abstract:The growing demand for accurate, continuous, and non-invasive health monitoring has propelled multi-sensor data fusion to the forefront of healthcare technology. This review aims to provide an overview of the development of fusion frameworks in the literature and common terminology used in fusion literature. The review introduces the fusion classification standards and methods that are most relevant from an algorithm development perspective. Applications of the reviewed fusion frameworks in fields such as defense, autonomous driving, robotics, and image fusion are also discussed to provide contextual information on the various fusion methodologies that have been developed in this field. This review provides a comprehensive analysis of multi-sensor data fusion methods applied to health monitoring systems, focusing on key algorithms, applications, challenges, and future directions. We examine commonly used fusion techniques, including Kalman filters, Bayesian networks, and machine learning models. By integrating data from various sources, these fusion approaches enhance the reliability, accuracy, and resilience of health monitoring systems. However, challenges such as data quality and differences in acquisition systems exist, calling for intelligent fusion algorithms in recent years. The review finally converges on applications of fusion algorithms in biomedical inference tasks like heartbeat detection, respiration rate estimation, sleep apnea detection, arrhythmia detection, and atrial fibrillation detection.

Via

Access Paper or Ask Questions

Tiny Models are the Computational Saver for Large Models

Mar 26, 2024

Qingyuan Wang, Barry Cardiff, Antoine Frappé, Benoit Larras, Deepu John

Abstract:This paper introduces TinySaver, an early-exit-like dynamic model compression approach which employs tiny models to substitute large models adaptively. Distinct from traditional compression techniques, dynamic methods like TinySaver can leverage the difficulty differences to allow certain inputs to complete their inference processes early, thereby conserving computational resources. Most existing early exit designs are implemented by attaching additional network branches to the model's backbone. Our study, however, reveals that completely independent tiny models can replace a substantial portion of the larger models' job with minimal impact on performance. Employing them as the first exit can remarkably enhance computational efficiency. By searching and employing the most appropriate tiny model as the computational saver for a given large model, the proposed approaches work as a novel and generic method to model compression. This finding will help the research community in exploring new compression methods to address the escalating computational demands posed by rapidly evolving AI models. Our evaluation of this approach in ImageNet-1k classification demonstrates its potential to reduce the number of compute operations by up to 90%, with only negligible losses in performance, across various modern vision models. The code of this work will be available.

Via

Access Paper or Ask Questions

DyCE: Dynamic Configurable Exiting for Deep Learning Compression and Scaling

Mar 04, 2024

Qingyuan Wang, Barry Cardiff, Antoine Frappé, Benoit Larras, Deepu John

Figure 1 for DyCE: Dynamic Configurable Exiting for Deep Learning Compression and Scaling

Figure 2 for DyCE: Dynamic Configurable Exiting for Deep Learning Compression and Scaling

Figure 3 for DyCE: Dynamic Configurable Exiting for Deep Learning Compression and Scaling

Figure 4 for DyCE: Dynamic Configurable Exiting for Deep Learning Compression and Scaling

Abstract:Modern deep learning (DL) models necessitate the employment of scaling and compression techniques for effective deployment in resource-constrained environments. Most existing techniques, such as pruning and quantization are generally static. On the other hand, dynamic compression methods, such as early exits, reduce complexity by recognizing the difficulty of input samples and allocating computation as needed. Dynamic methods, despite their superior flexibility and potential for co-existing with static methods, pose significant challenges in terms of implementation due to any changes in dynamic parts will influence subsequent processes. Moreover, most current dynamic compression designs are monolithic and tightly integrated with base models, thereby complicating the adaptation to novel base models. This paper introduces DyCE, an dynamic configurable early-exit framework that decouples design considerations from each other and from the base model. Utilizing this framework, various types and positions of exits can be organized according to predefined configurations, which can be dynamically switched in real-time to accommodate evolving performance-complexity requirements. We also propose techniques for generating optimized configurations based on any desired trade-off between performance and computational complexity. This empowers future researchers to focus on the improvement of individual exits without latent compromise of overall system performance. The efficacy of this approach is demonstrated through image classification tasks with deep CNNs. DyCE significantly reduces the computational complexity by 23.5% of ResNet152 and 25.9% of ConvNextv2-tiny on ImageNet, with accuracy reductions of less than 0.5%. Furthermore, DyCE offers advantages over existing dynamic methods in terms of real-time configuration and fine-grained performance tuning.

Via

Access Paper or Ask Questions

Classification of ECG based on Hybrid Features using CNNs for Wearable Applications

Jun 14, 2022

Li Xiaolin, Fang Xiang, Rajesh C. Panicker, Barry Cardiff, Deepu John

Figure 1 for Classification of ECG based on Hybrid Features using CNNs for Wearable Applications

Figure 2 for Classification of ECG based on Hybrid Features using CNNs for Wearable Applications

Figure 3 for Classification of ECG based on Hybrid Features using CNNs for Wearable Applications

Figure 4 for Classification of ECG based on Hybrid Features using CNNs for Wearable Applications

Abstract:Sudden cardiac death and arrhythmia account for a large percentage of all deaths worldwide. Electrocardiography (ECG) is the most widely used screening tool for cardiovascular diseases. Traditionally, ECG signals are classified manually, requiring experience and great skill, while being time-consuming and prone to error. Thus machine learning algorithms have been widely adopted because of their ability to perform complex data analysis. Features derived from the points of interest in ECG - mainly Q, R, and S, are widely used for arrhythmia detection. In this work, we demonstrate improved performance for ECG classification using hybrid features and three different models, building on a 1-D convolutional neural network (CNN) model that we had proposed in the past. An RR interval features based model proposed in this work achieved an accuracy of 98.98%, which is an improvement over the baseline model. To make the model immune to noise, we updated the model using frequency features and achieved good sustained performance in presence of noise with a slightly lower accuracy of 98.69%. Further, another model combining the frequency features and the RR interval features was developed, which achieved a high accuracy of 99% with good sustained performance in noisy environments. Due to its high accuracy and noise immunity, the proposed model which combines multiple hybrid features, is well suited for ambulatory wearable sensing applications.

Via

Access Paper or Ask Questions

Multistage Pruning of CNN Based ECG Classifiers for Edge Devices

Aug 31, 2021

Xiaolin Li, Rajesh Panicker, Barry Cardiff, Deepu John

Figure 1 for Multistage Pruning of CNN Based ECG Classifiers for Edge Devices

Figure 2 for Multistage Pruning of CNN Based ECG Classifiers for Edge Devices

Figure 3 for Multistage Pruning of CNN Based ECG Classifiers for Edge Devices

Figure 4 for Multistage Pruning of CNN Based ECG Classifiers for Edge Devices

Abstract:Using smart wearable devices to monitor patients electrocardiogram (ECG) for real-time detection of arrhythmias can significantly improve healthcare outcomes. Convolutional neural network (CNN) based deep learning has been used successfully to detect anomalous beats in ECG. However, the computational complexity of existing CNN models prohibits them from being implemented in low-powered edge devices. Usually, such models are complex with lots of model parameters which results in large number of computations, memory, and power usage in edge devices. Network pruning techniques can reduce model complexity at the expense of performance in CNN models. This paper presents a novel multistage pruning technique that reduces CNN model complexity with negligible loss in performance compared to existing pruning techniques. An existing CNN model for ECG classification is used as a baseline reference. At 60% sparsity, the proposed technique achieves 97.7% accuracy and an F1 score of 93.59% for ECG classification tasks. This is an improvement of 3.3% and 9% for accuracy and F1 Score respectively, compared to traditional pruning with fine-tuning approach. Compared to the baseline model, we also achieve a 60.4% decrease in run-time complexity.

* 4 pages

Via

Access Paper or Ask Questions

SomnNET: An SpO2 Based Deep Learning Network for Sleep Apnea Detection in Smartwatches

Aug 25, 2021

Arlene John, Koushik Kumar Nundy, Barry Cardiff, Deepu John

Figure 1 for SomnNET: An SpO2 Based Deep Learning Network for Sleep Apnea Detection in Smartwatches

Figure 2 for SomnNET: An SpO2 Based Deep Learning Network for Sleep Apnea Detection in Smartwatches

Figure 3 for SomnNET: An SpO2 Based Deep Learning Network for Sleep Apnea Detection in Smartwatches

Figure 4 for SomnNET: An SpO2 Based Deep Learning Network for Sleep Apnea Detection in Smartwatches

Abstract:The abnormal pause or rate reduction in breathing is known as the sleep-apnea hypopnea syndrome and affects the quality of sleep of an individual. A novel method for the detection of sleep apnea events (pause in breathing) from peripheral oxygen saturation (SpO2) signals obtained from wearable devices is discussed in this paper. The paper details an apnea detection algorithm of a very high resolution on a per-second basis for which a 1-dimensional convolutional neural network -- which we termed SomnNET -- is developed. This network exhibits an accuracy of 97.08% and outperforms several lower resolution state-of-the-art apnea detection methods. The feasibility of model pruning and binarization to reduce the computational complexity is explored. The pruned network with 80% sparsity exhibited an accuracy of 89.75%, and the binarized network exhibited an accuracy of 68.22%. The performance of the proposed networks is compared against several state-of-the-art algorithms.

* Accepted for discussion at the IEEE Engineering in Medicine and Biology Conference (EMBC) 2021

Via

Access Paper or Ask Questions

A 1D-CNN Based Deep Learning Technique for Sleep Apnea Detection in IoT Sensors

May 02, 2021

Arlene John, Barry Cardiff, Deepu John

Figure 1 for A 1D-CNN Based Deep Learning Technique for Sleep Apnea Detection in IoT Sensors

Figure 2 for A 1D-CNN Based Deep Learning Technique for Sleep Apnea Detection in IoT Sensors

Figure 3 for A 1D-CNN Based Deep Learning Technique for Sleep Apnea Detection in IoT Sensors

Figure 4 for A 1D-CNN Based Deep Learning Technique for Sleep Apnea Detection in IoT Sensors

Abstract:Internet of Things (IoT) enabled wearable sensors for health monitoring are widely used to reduce the cost of personal healthcare and improve quality of life. The sleep apnea-hypopnea syndrome, characterized by the abnormal reduction or pause in breathing, greatly affects the quality of sleep of an individual. This paper introduces a novel method for apnea detection (pause in breathing) from electrocardiogram (ECG) signals obtained from wearable devices. The novelty stems from the high resolution of apnea detection on a second-by-second basis, and this is achieved using a 1-dimensional convolutional neural network for feature extraction and detection of sleep apnea events. The proposed method exhibits an accuracy of 99.56% and a sensitivity of 96.05%. This model outperforms several lower resolution state-of-the-art apnea detection methods. The complexity of the proposed model is analyzed. We also analyze the feasibility of model pruning and binarization to reduce the resource requirements on a wearable IoT device. The pruned model with 80\% sparsity exhibited an accuracy of 97.34% and a sensitivity of 86.48%. The binarized model exhibited an accuracy of 75.59% and sensitivity of 63.23%. The performance of low complexity patient-specific models derived from the generic model is also studied to analyze the feasibility of retraining existing models to fit patient-specific requirements. The patient-specific models on average exhibited an accuracy of 97.79% and sensitivity of 92.23%. The source code for this work is made publicly available.

* Accepted for discussion at the IEEE International Symposium on Circuits and Systems (ISCAS) 2021

Via

Access Paper or Ask Questions