Cargando…
Integrating Health Data-Driven Machine Learning Algorithms to Evaluate Risk Factors of Early Stage Hypertension at Different Levels of HDL and LDL Cholesterol
Purpose: Cardiovascular disease (CVD) is a major worldwide health burden. As the risk factors of CVD, hypertension, and hyperlipidemia are most mentioned. Early stage hypertension in the population with dyslipidemia is an important public health hazard. This study was the application of data-driven...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9407063/ https://www.ncbi.nlm.nih.gov/pubmed/36010315 http://dx.doi.org/10.3390/diagnostics12081965 |
_version_ | 1784774272924778496 |
---|---|
author | Liao, Pen-Chih Chen, Ming-Shu Jhou, Mao-Jhen Chen, Tsan-Chi Yang, Chih-Te Lu, Chi-Jie |
author_facet | Liao, Pen-Chih Chen, Ming-Shu Jhou, Mao-Jhen Chen, Tsan-Chi Yang, Chih-Te Lu, Chi-Jie |
author_sort | Liao, Pen-Chih |
collection | PubMed |
description | Purpose: Cardiovascular disease (CVD) is a major worldwide health burden. As the risk factors of CVD, hypertension, and hyperlipidemia are most mentioned. Early stage hypertension in the population with dyslipidemia is an important public health hazard. This study was the application of data-driven machine learning (ML), demonstrating complex relationships between risk factors and outcomes and promising predictive performance with vast amounts of medical data, aimed to investigate the association between dyslipidemia and the incidence of early stage hypertension in a large cohort with normal blood pressure at baseline. Methods: This study analyzed annual health screening data for 71,108 people from 2005 to 2017, including data for 27 risk-related indicators, sourced from the MJ Group, a major health screening center in Taiwan. We used five machine learning (ML) methods—stochastic gradient boosting (SGB), multivariate adaptive regression splines (MARS), least absolute shrinkage and selection operator regression (Lasso), ridge regression (Ridge), and gradient boosting with categorical features support (CatBoost)—to develop a multi-stage ML algorithm-based prediction scheme and then evaluate important risk factors at the early stage of hypertension, especially for groups with high-density lipoprotein cholesterol (HDL-C) and low-density lipoprotein cholesterol (LDL-C) levels within or out of the reference range. Results: Age, body mass index, waist circumference, waist-to-hip ratio, fasting plasma glucose, and C-reactive protein (CRP) were associated with hypertension. The hemoglobin level was also a positive contributor to blood pressure elevation and it appeared among the top three important risk factors in all LDL-C/HDL-C groups; therefore, these variables may be important in affecting blood pressure in the early stage of hypertension. A residual contribution to blood pressure elevation was found in groups with increased LDL-C. This suggests that LDL-C levels are associated with CPR levels, and that the LDL-C level may be an important factor for predicting the development of hypertension. Conclusion: The five prediction models provided similar classifications of risk factors. The results of this study show that an increase in LDL-C is more important than the start of a drop in HDL-C in health screening of sub-healthy adults. The findings of this study should be of value to health awareness raising about hypertension and further discussion and follow-up research. |
format | Online Article Text |
id | pubmed-9407063 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-94070632022-08-26 Integrating Health Data-Driven Machine Learning Algorithms to Evaluate Risk Factors of Early Stage Hypertension at Different Levels of HDL and LDL Cholesterol Liao, Pen-Chih Chen, Ming-Shu Jhou, Mao-Jhen Chen, Tsan-Chi Yang, Chih-Te Lu, Chi-Jie Diagnostics (Basel) Article Purpose: Cardiovascular disease (CVD) is a major worldwide health burden. As the risk factors of CVD, hypertension, and hyperlipidemia are most mentioned. Early stage hypertension in the population with dyslipidemia is an important public health hazard. This study was the application of data-driven machine learning (ML), demonstrating complex relationships between risk factors and outcomes and promising predictive performance with vast amounts of medical data, aimed to investigate the association between dyslipidemia and the incidence of early stage hypertension in a large cohort with normal blood pressure at baseline. Methods: This study analyzed annual health screening data for 71,108 people from 2005 to 2017, including data for 27 risk-related indicators, sourced from the MJ Group, a major health screening center in Taiwan. We used five machine learning (ML) methods—stochastic gradient boosting (SGB), multivariate adaptive regression splines (MARS), least absolute shrinkage and selection operator regression (Lasso), ridge regression (Ridge), and gradient boosting with categorical features support (CatBoost)—to develop a multi-stage ML algorithm-based prediction scheme and then evaluate important risk factors at the early stage of hypertension, especially for groups with high-density lipoprotein cholesterol (HDL-C) and low-density lipoprotein cholesterol (LDL-C) levels within or out of the reference range. Results: Age, body mass index, waist circumference, waist-to-hip ratio, fasting plasma glucose, and C-reactive protein (CRP) were associated with hypertension. The hemoglobin level was also a positive contributor to blood pressure elevation and it appeared among the top three important risk factors in all LDL-C/HDL-C groups; therefore, these variables may be important in affecting blood pressure in the early stage of hypertension. A residual contribution to blood pressure elevation was found in groups with increased LDL-C. This suggests that LDL-C levels are associated with CPR levels, and that the LDL-C level may be an important factor for predicting the development of hypertension. Conclusion: The five prediction models provided similar classifications of risk factors. The results of this study show that an increase in LDL-C is more important than the start of a drop in HDL-C in health screening of sub-healthy adults. The findings of this study should be of value to health awareness raising about hypertension and further discussion and follow-up research. MDPI 2022-08-14 /pmc/articles/PMC9407063/ /pubmed/36010315 http://dx.doi.org/10.3390/diagnostics12081965 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Liao, Pen-Chih Chen, Ming-Shu Jhou, Mao-Jhen Chen, Tsan-Chi Yang, Chih-Te Lu, Chi-Jie Integrating Health Data-Driven Machine Learning Algorithms to Evaluate Risk Factors of Early Stage Hypertension at Different Levels of HDL and LDL Cholesterol |
title | Integrating Health Data-Driven Machine Learning Algorithms to Evaluate Risk Factors of Early Stage Hypertension at Different Levels of HDL and LDL Cholesterol |
title_full | Integrating Health Data-Driven Machine Learning Algorithms to Evaluate Risk Factors of Early Stage Hypertension at Different Levels of HDL and LDL Cholesterol |
title_fullStr | Integrating Health Data-Driven Machine Learning Algorithms to Evaluate Risk Factors of Early Stage Hypertension at Different Levels of HDL and LDL Cholesterol |
title_full_unstemmed | Integrating Health Data-Driven Machine Learning Algorithms to Evaluate Risk Factors of Early Stage Hypertension at Different Levels of HDL and LDL Cholesterol |
title_short | Integrating Health Data-Driven Machine Learning Algorithms to Evaluate Risk Factors of Early Stage Hypertension at Different Levels of HDL and LDL Cholesterol |
title_sort | integrating health data-driven machine learning algorithms to evaluate risk factors of early stage hypertension at different levels of hdl and ldl cholesterol |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9407063/ https://www.ncbi.nlm.nih.gov/pubmed/36010315 http://dx.doi.org/10.3390/diagnostics12081965 |
work_keys_str_mv | AT liaopenchih integratinghealthdatadrivenmachinelearningalgorithmstoevaluateriskfactorsofearlystagehypertensionatdifferentlevelsofhdlandldlcholesterol AT chenmingshu integratinghealthdatadrivenmachinelearningalgorithmstoevaluateriskfactorsofearlystagehypertensionatdifferentlevelsofhdlandldlcholesterol AT jhoumaojhen integratinghealthdatadrivenmachinelearningalgorithmstoevaluateriskfactorsofearlystagehypertensionatdifferentlevelsofhdlandldlcholesterol AT chentsanchi integratinghealthdatadrivenmachinelearningalgorithmstoevaluateriskfactorsofearlystagehypertensionatdifferentlevelsofhdlandldlcholesterol AT yangchihte integratinghealthdatadrivenmachinelearningalgorithmstoevaluateriskfactorsofearlystagehypertensionatdifferentlevelsofhdlandldlcholesterol AT luchijie integratinghealthdatadrivenmachinelearningalgorithmstoevaluateriskfactorsofearlystagehypertensionatdifferentlevelsofhdlandldlcholesterol |