Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Oleksandr Rokovyi

Prediction of Physical Load Level by Machine Learning Analysis of Heart Activity after Exercises

Dec 20, 2019

Peng Gang, Wei Zeng, Yuri Gordienko, Oleksandr Rokovyi, Oleg Alienin, Sergii Stirenko

Figure 1 for Prediction of Physical Load Level by Machine Learning Analysis of Heart Activity after Exercises

Figure 2 for Prediction of Physical Load Level by Machine Learning Analysis of Heart Activity after Exercises

Figure 3 for Prediction of Physical Load Level by Machine Learning Analysis of Heart Activity after Exercises

Figure 4 for Prediction of Physical Load Level by Machine Learning Analysis of Heart Activity after Exercises

Abstract:The assessment of energy expenditure in real life is of great importance for monitoring the current physical state of people, especially in work, sport, elderly care, health care, and everyday life even. This work reports about application of some machine learning methods (linear regression, linear discriminant analysis, k-nearest neighbors, decision tree, random forest, Gaussian naive Bayes, support-vector machine) for monitoring energy expenditures in athletes. The classification problem was to predict the known level of the in-exercise loads (in three categories by calories) by the heart rate activity features measured during the short period of time (1 minute only) after training, i.e by features of the post-exercise load. The results obtained shown that the post-exercise heart activity features preserve the information of the in-exercise training loads and allow us to predict their actual in-exercise levels. The best performance can be obtained by the random forest classifier with all 8 heart rate features (micro-averaged area under curve value AUCmicro = 0.87 and macro-averaged one AUCmacro = 0.88) and the k-nearest neighbors classifier with 4 most important heart rate features (AUCmicro = 0.91 and AUCmacro = 0.89). The limitations and perspectives of the ML methods used are outlined, and some practical advices are proposed as to their improvement and implementation for the better prediction of in-exercise energy expenditures.

* 6 pages, 8 figures, 3 tables; preprint of paper for 2019 IEEE Symposium Series on Computational Intelligence (SSCI), December 6-9 2019, Xiamen, China

Via

Access Paper or Ask Questions

Open Source Dataset and Machine Learning Techniques for Automatic Recognition of Historical Graffiti

Aug 31, 2018

Nikita Gordienko, Peng Gang, Yuri Gordienko, Wei Zeng, Oleg Alienin, Oleksandr Rokovyi, Sergii Stirenko

Figure 1 for Open Source Dataset and Machine Learning Techniques for Automatic Recognition of Historical Graffiti

Figure 2 for Open Source Dataset and Machine Learning Techniques for Automatic Recognition of Historical Graffiti

Figure 3 for Open Source Dataset and Machine Learning Techniques for Automatic Recognition of Historical Graffiti

Figure 4 for Open Source Dataset and Machine Learning Techniques for Automatic Recognition of Historical Graffiti

Abstract:Machine learning techniques are presented for automatic recognition of the historical letters (XI-XVIII centuries) carved on the stoned walls of St.Sophia cathedral in Kyiv (Ukraine). A new image dataset of these carved Glagolitic and Cyrillic letters (CGCL) was assembled and pre-processed for recognition and prediction by machine learning methods. The dataset consists of more than 4000 images for 34 types of letters. The explanatory data analysis of CGCL and notMNIST datasets shown that the carved letters can hardly be differentiated by dimensionality reduction methods, for example, by t-distributed stochastic neighbor embedding (tSNE) due to the worse letter representation by stone carving in comparison to hand writing. The multinomial logistic regression (MLR) and a 2D convolutional neural network (CNN) models were applied. The MLR model demonstrated the area under curve (AUC) values for receiver operating characteristic (ROC) are not lower than 0.92 and 0.60 for notMNIST and CGCL, respectively. The CNN model gave AUC values close to 0.99 for both notMNIST and CGCL (despite the much smaller size and quality of CGCL in comparison to notMNIST) under condition of the high lossy data augmentation. CGCL dataset was published to be available for the data science community as an open source resource.

* 11 pages, 9 figures, accepted for 25th International Conference on Neural Information Processing (ICONIP 2018), 14-16 December, 2018 (Siem Reap, Cambodia)

Via

Access Paper or Ask Questions

Parallel Statistical and Machine Learning Methods for Estimation of Physical Load

Aug 14, 2018

Sergii Stirenko, Gang Peng, Wei Zeng, Yuri Gordienko, Oleg Alienin, Oleksandr Rokovyi, Nikita Gordienko

Figure 1 for Parallel Statistical and Machine Learning Methods for Estimation of Physical Load

Figure 2 for Parallel Statistical and Machine Learning Methods for Estimation of Physical Load

Figure 3 for Parallel Statistical and Machine Learning Methods for Estimation of Physical Load

Figure 4 for Parallel Statistical and Machine Learning Methods for Estimation of Physical Load

Abstract:Several statistical and machine learning methods are proposed to estimate the type and intensity of physical load and accumulated fatigue . They are based on the statistical analysis of accumulated and moving window data subsets with construction of a kurtosis-skewness diagram. This approach was applied to the data gathered by the wearable heart monitor for various types and levels of physical activities, and for people with various physical conditions. The different levels of physical activities, loads, and fitness can be distinguished from the kurtosis-skewness diagram, and their evolution can be monitored. Several metrics for estimation of the instant effect and accumulated effect (physical fatigue) of physical loads were proposed. The data and results presented allow to extend application of these methods for modeling and characterization of complex human activity patterns, for example, to estimate the actual and accumulated physical load and fatigue, model the potential dangerous development, and give cautions and advice in real time.

* 15 pages, 8 figures, accepted for 18th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP) 15-17 November, 2018 (Guangzhou, China)

Via

Access Paper or Ask Questions

Chest X-Ray Analysis of Tuberculosis by Deep Learning with Segmentation and Augmentation

Mar 03, 2018

Sergii Stirenko, Yuriy Kochura, Oleg Alienin, Oleksandr Rokovyi, Peng Gang, Wei Zeng, Yuri Gordienko

Figure 1 for Chest X-Ray Analysis of Tuberculosis by Deep Learning with Segmentation and Augmentation

Figure 2 for Chest X-Ray Analysis of Tuberculosis by Deep Learning with Segmentation and Augmentation

Figure 3 for Chest X-Ray Analysis of Tuberculosis by Deep Learning with Segmentation and Augmentation

Figure 4 for Chest X-Ray Analysis of Tuberculosis by Deep Learning with Segmentation and Augmentation

Abstract:The results of chest X-ray (CXR) analysis of 2D images to get the statistically reliable predictions (availability of tuberculosis) by computer-aided diagnosis (CADx) on the basis of deep learning are presented. They demonstrate the efficiency of lung segmentation, lossless and lossy data augmentation for CADx of tuberculosis by deep convolutional neural network (CNN) applied to the small and not well-balanced dataset even. CNN demonstrates ability to train (despite overfitting) on the pre-processed dataset obtained after lung segmentation in contrast to the original not-segmented dataset. Lossless data augmentation of the segmented dataset leads to the lowest validation loss (without overfitting) and nearly the same accuracy (within the limits of standard deviation) in comparison to the original and other pre-processed datasets after lossy data augmentation. The additional limited lossy data augmentation results in the lower validation loss, but with a decrease of the validation accuracy. In conclusion, besides the more complex deep CNNs and bigger datasets, the better progress of CADx for the small and not well-balanced datasets even could be obtained by better segmentation, data augmentation, dataset stratification, and exclusion of non-evident outliers.

* 6 pages, 11 figures, 1 table

Via

Access Paper or Ask Questions