Cargando…
Predicting the Risk of Hypertension Based on Several Easy-to-Collect Risk Factors: A Machine Learning Method
Hypertension is a widespread chronic disease. Risk prediction of hypertension is an intervention that contributes to the early prevention and management of hypertension. The implementation of such intervention requires an effective and easy-to-implement hypertension risk prediction model. This study...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8497705/ https://www.ncbi.nlm.nih.gov/pubmed/34631636 http://dx.doi.org/10.3389/fpubh.2021.619429 |
_version_ | 1784580009252356096 |
---|---|
author | Zhao, Huanhuan Zhang, Xiaoyu Xu, Yang Gao, Lisheng Ma, Zuchang Sun, Yining Wang, Weimin |
author_facet | Zhao, Huanhuan Zhang, Xiaoyu Xu, Yang Gao, Lisheng Ma, Zuchang Sun, Yining Wang, Weimin |
author_sort | Zhao, Huanhuan |
collection | PubMed |
description | Hypertension is a widespread chronic disease. Risk prediction of hypertension is an intervention that contributes to the early prevention and management of hypertension. The implementation of such intervention requires an effective and easy-to-implement hypertension risk prediction model. This study evaluated and compared the performance of four machine learning algorithms on predicting the risk of hypertension based on easy-to-collect risk factors. A dataset of 29,700 samples collected through a physical examination was used for model training and testing. Firstly, we identified easy-to-collect risk factors of hypertension, through univariate logistic regression analysis. Then, based on the selected features, 10-fold cross-validation was utilized to optimize four models, random forest (RF), CatBoost, MLP neural network and logistic regression (LR), to find the best hyper-parameters on the training set. Finally, the performance of models was evaluated by AUC, accuracy, sensitivity and specificity on the test set. The experimental results showed that the RF model outperformed the other three models, and achieved an AUC of 0.92, an accuracy of 0.82, a sensitivity of 0.83 and a specificity of 0.81. In addition, Body Mass Index (BMI), age, family history and waist circumference (WC) are the four primary risk factors of hypertension. These findings reveal that it is feasible to use machine learning algorithms, especially RF, to predict hypertension risk without clinical or genetic data. The technique can provide a non-invasive and economical way for the prevention and management of hypertension in a large population. |
format | Online Article Text |
id | pubmed-8497705 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-84977052021-10-09 Predicting the Risk of Hypertension Based on Several Easy-to-Collect Risk Factors: A Machine Learning Method Zhao, Huanhuan Zhang, Xiaoyu Xu, Yang Gao, Lisheng Ma, Zuchang Sun, Yining Wang, Weimin Front Public Health Public Health Hypertension is a widespread chronic disease. Risk prediction of hypertension is an intervention that contributes to the early prevention and management of hypertension. The implementation of such intervention requires an effective and easy-to-implement hypertension risk prediction model. This study evaluated and compared the performance of four machine learning algorithms on predicting the risk of hypertension based on easy-to-collect risk factors. A dataset of 29,700 samples collected through a physical examination was used for model training and testing. Firstly, we identified easy-to-collect risk factors of hypertension, through univariate logistic regression analysis. Then, based on the selected features, 10-fold cross-validation was utilized to optimize four models, random forest (RF), CatBoost, MLP neural network and logistic regression (LR), to find the best hyper-parameters on the training set. Finally, the performance of models was evaluated by AUC, accuracy, sensitivity and specificity on the test set. The experimental results showed that the RF model outperformed the other three models, and achieved an AUC of 0.92, an accuracy of 0.82, a sensitivity of 0.83 and a specificity of 0.81. In addition, Body Mass Index (BMI), age, family history and waist circumference (WC) are the four primary risk factors of hypertension. These findings reveal that it is feasible to use machine learning algorithms, especially RF, to predict hypertension risk without clinical or genetic data. The technique can provide a non-invasive and economical way for the prevention and management of hypertension in a large population. Frontiers Media S.A. 2021-09-24 /pmc/articles/PMC8497705/ /pubmed/34631636 http://dx.doi.org/10.3389/fpubh.2021.619429 Text en Copyright © 2021 Zhao, Zhang, Xu, Gao, Ma, Sun and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Public Health Zhao, Huanhuan Zhang, Xiaoyu Xu, Yang Gao, Lisheng Ma, Zuchang Sun, Yining Wang, Weimin Predicting the Risk of Hypertension Based on Several Easy-to-Collect Risk Factors: A Machine Learning Method |
title | Predicting the Risk of Hypertension Based on Several Easy-to-Collect Risk Factors: A Machine Learning Method |
title_full | Predicting the Risk of Hypertension Based on Several Easy-to-Collect Risk Factors: A Machine Learning Method |
title_fullStr | Predicting the Risk of Hypertension Based on Several Easy-to-Collect Risk Factors: A Machine Learning Method |
title_full_unstemmed | Predicting the Risk of Hypertension Based on Several Easy-to-Collect Risk Factors: A Machine Learning Method |
title_short | Predicting the Risk of Hypertension Based on Several Easy-to-Collect Risk Factors: A Machine Learning Method |
title_sort | predicting the risk of hypertension based on several easy-to-collect risk factors: a machine learning method |
topic | Public Health |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8497705/ https://www.ncbi.nlm.nih.gov/pubmed/34631636 http://dx.doi.org/10.3389/fpubh.2021.619429 |
work_keys_str_mv | AT zhaohuanhuan predictingtheriskofhypertensionbasedonseveraleasytocollectriskfactorsamachinelearningmethod AT zhangxiaoyu predictingtheriskofhypertensionbasedonseveraleasytocollectriskfactorsamachinelearningmethod AT xuyang predictingtheriskofhypertensionbasedonseveraleasytocollectriskfactorsamachinelearningmethod AT gaolisheng predictingtheriskofhypertensionbasedonseveraleasytocollectriskfactorsamachinelearningmethod AT mazuchang predictingtheriskofhypertensionbasedonseveraleasytocollectriskfactorsamachinelearningmethod AT sunyining predictingtheriskofhypertensionbasedonseveraleasytocollectriskfactorsamachinelearningmethod AT wangweimin predictingtheriskofhypertensionbasedonseveraleasytocollectriskfactorsamachinelearningmethod |