Cargando…

Application of ensemble machine learning algorithms on lifestyle factors and wearables for cardiovascular risk prediction

This study looked at novel data sources for cardiovascular risk prediction including detailed lifestyle questionnaire and continuous blood pressure monitoring, using ensemble machine learning algorithms (MLAs). The reference conventional risk score compared against was the Framingham Risk Score (FRS...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Weiting, Ying, Tan Wei, Chin, Woon Loong Calvin, Baskaran, Lohendran, Marcus, Ong Eng Hock, Yeo, Khung Keong, Kiong, Ng See
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8776753/
https://www.ncbi.nlm.nih.gov/pubmed/35058500
http://dx.doi.org/10.1038/s41598-021-04649-y
_version_ 1784636903763476480
author Huang, Weiting
Ying, Tan Wei
Chin, Woon Loong Calvin
Baskaran, Lohendran
Marcus, Ong Eng Hock
Yeo, Khung Keong
Kiong, Ng See
author_facet Huang, Weiting
Ying, Tan Wei
Chin, Woon Loong Calvin
Baskaran, Lohendran
Marcus, Ong Eng Hock
Yeo, Khung Keong
Kiong, Ng See
author_sort Huang, Weiting
collection PubMed
description This study looked at novel data sources for cardiovascular risk prediction including detailed lifestyle questionnaire and continuous blood pressure monitoring, using ensemble machine learning algorithms (MLAs). The reference conventional risk score compared against was the Framingham Risk Score (FRS). The outcome variables were low or high risk based on calcium score 0 or calcium score 100 and above. Ensemble MLAs were built based on naive bayes, random forest and support vector classifier for low risk and generalized linear regression, support vector regressor and stochastic gradient descent regressor for high risk categories. MLAs were trained on 600 Southeast Asians aged 21 to 69 years free of cardiovascular disease. All MLAs outperformed the FRS for low and high-risk categories. MLA based on lifestyle questionnaire only achieved AUC of 0.715 (95% CI 0.681, 0.750) and 0.710 (95% CI 0.653, 0.766) for low and high risk respectively. Combining all groups of risk factors (lifestyle survey questionnaires, clinical blood tests, 24-h ambulatory blood pressure and heart rate monitoring) along with feature selection, prediction of low and high CVD risk groups were further enhanced to 0.791 (95% CI 0.759, 0.822) and 0.790 (95% CI 0.745, 0.836). Besides conventional predictors, self-reported physical activity, average daily heart rate, awake blood pressure variability and percentage time in diastolic hypertension were important contributors to CVD risk classification.
format Online
Article
Text
id pubmed-8776753
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-87767532022-01-24 Application of ensemble machine learning algorithms on lifestyle factors and wearables for cardiovascular risk prediction Huang, Weiting Ying, Tan Wei Chin, Woon Loong Calvin Baskaran, Lohendran Marcus, Ong Eng Hock Yeo, Khung Keong Kiong, Ng See Sci Rep Article This study looked at novel data sources for cardiovascular risk prediction including detailed lifestyle questionnaire and continuous blood pressure monitoring, using ensemble machine learning algorithms (MLAs). The reference conventional risk score compared against was the Framingham Risk Score (FRS). The outcome variables were low or high risk based on calcium score 0 or calcium score 100 and above. Ensemble MLAs were built based on naive bayes, random forest and support vector classifier for low risk and generalized linear regression, support vector regressor and stochastic gradient descent regressor for high risk categories. MLAs were trained on 600 Southeast Asians aged 21 to 69 years free of cardiovascular disease. All MLAs outperformed the FRS for low and high-risk categories. MLA based on lifestyle questionnaire only achieved AUC of 0.715 (95% CI 0.681, 0.750) and 0.710 (95% CI 0.653, 0.766) for low and high risk respectively. Combining all groups of risk factors (lifestyle survey questionnaires, clinical blood tests, 24-h ambulatory blood pressure and heart rate monitoring) along with feature selection, prediction of low and high CVD risk groups were further enhanced to 0.791 (95% CI 0.759, 0.822) and 0.790 (95% CI 0.745, 0.836). Besides conventional predictors, self-reported physical activity, average daily heart rate, awake blood pressure variability and percentage time in diastolic hypertension were important contributors to CVD risk classification. Nature Publishing Group UK 2022-01-20 /pmc/articles/PMC8776753/ /pubmed/35058500 http://dx.doi.org/10.1038/s41598-021-04649-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Huang, Weiting
Ying, Tan Wei
Chin, Woon Loong Calvin
Baskaran, Lohendran
Marcus, Ong Eng Hock
Yeo, Khung Keong
Kiong, Ng See
Application of ensemble machine learning algorithms on lifestyle factors and wearables for cardiovascular risk prediction
title Application of ensemble machine learning algorithms on lifestyle factors and wearables for cardiovascular risk prediction
title_full Application of ensemble machine learning algorithms on lifestyle factors and wearables for cardiovascular risk prediction
title_fullStr Application of ensemble machine learning algorithms on lifestyle factors and wearables for cardiovascular risk prediction
title_full_unstemmed Application of ensemble machine learning algorithms on lifestyle factors and wearables for cardiovascular risk prediction
title_short Application of ensemble machine learning algorithms on lifestyle factors and wearables for cardiovascular risk prediction
title_sort application of ensemble machine learning algorithms on lifestyle factors and wearables for cardiovascular risk prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8776753/
https://www.ncbi.nlm.nih.gov/pubmed/35058500
http://dx.doi.org/10.1038/s41598-021-04649-y
work_keys_str_mv AT huangweiting applicationofensemblemachinelearningalgorithmsonlifestylefactorsandwearablesforcardiovascularriskprediction
AT yingtanwei applicationofensemblemachinelearningalgorithmsonlifestylefactorsandwearablesforcardiovascularriskprediction
AT chinwoonloongcalvin applicationofensemblemachinelearningalgorithmsonlifestylefactorsandwearablesforcardiovascularriskprediction
AT baskaranlohendran applicationofensemblemachinelearningalgorithmsonlifestylefactorsandwearablesforcardiovascularriskprediction
AT marcusongenghock applicationofensemblemachinelearningalgorithmsonlifestylefactorsandwearablesforcardiovascularriskprediction
AT yeokhungkeong applicationofensemblemachinelearningalgorithmsonlifestylefactorsandwearablesforcardiovascularriskprediction
AT kiongngsee applicationofensemblemachinelearningalgorithmsonlifestylefactorsandwearablesforcardiovascularriskprediction