Cargando…
Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension
BACKGROUND: Hypertension is the main reason why the incidence of cardiovascular disease has increased year-by-year and early diagnosis of hypertension is necessary to reducing the incidence of cardiovascular disease. This also puts forward higher requirements for the accuracy of diagnosis. We tried...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Tehran University of Medical Sciences
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9884380/ https://www.ncbi.nlm.nih.gov/pubmed/36743358 http://dx.doi.org/10.18502/ijph.v51i9.10565 |
_version_ | 1784879705860603904 |
---|---|
author | Bai, Xue Liu, Wenjun Huang, Hui You, Huan |
author_facet | Bai, Xue Liu, Wenjun Huang, Hui You, Huan |
author_sort | Bai, Xue |
collection | PubMed |
description | BACKGROUND: Hypertension is the main reason why the incidence of cardiovascular disease has increased year-by-year and early diagnosis of hypertension is necessary to reducing the incidence of cardiovascular disease. This also puts forward higher requirements for the accuracy of diagnosis. We tried a variety of feature selection methods to improve the accuracy of logistic regression (LR). METHODS: We collected 397 samples from Nanjing, Jiangsu, China between Jan 2016 and Dec 2017, including 178 hypertension samples and 219 control samples. It includes not only clinical and laboratory data, but also imaging data. We focused on the difference of imaging attributes between the control group and the hypertension group, and analyzed the correlation coefficients of all attributes. In order to establish the optimal LR model, this study tried three different feature selection methods, including statistical analysis, random forest (RF) and extreme gradient boosting (XGBoost). The area under the ROC curve (AUC) and accuracy were used as the main criterion for model evaluation. RESULTS: In the prediction of hypertension, the performance of LR with RF as the feature selection method (accuracy: 0.910; AUC: 0.924) was better than the performance of LR with XGBoost as the feature selection method (accuracy: 0.897; AUC: 0.915) and the performance of LR with statistical analysis as the feature selection method (accuracy: 0.872; AUC: 0.926). CONCLUSION: LR with RF as the feature selection method may provide accurate results in predicting hypertension. Carotid intima-media thickness (cIMT) and pulse wave velocity at the end of systole (ESPWV) are two key imaging indicators in the prediction of hypertension. |
format | Online Article Text |
id | pubmed-9884380 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Tehran University of Medical Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-98843802023-02-03 Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension Bai, Xue Liu, Wenjun Huang, Hui You, Huan Iran J Public Health Original Article BACKGROUND: Hypertension is the main reason why the incidence of cardiovascular disease has increased year-by-year and early diagnosis of hypertension is necessary to reducing the incidence of cardiovascular disease. This also puts forward higher requirements for the accuracy of diagnosis. We tried a variety of feature selection methods to improve the accuracy of logistic regression (LR). METHODS: We collected 397 samples from Nanjing, Jiangsu, China between Jan 2016 and Dec 2017, including 178 hypertension samples and 219 control samples. It includes not only clinical and laboratory data, but also imaging data. We focused on the difference of imaging attributes between the control group and the hypertension group, and analyzed the correlation coefficients of all attributes. In order to establish the optimal LR model, this study tried three different feature selection methods, including statistical analysis, random forest (RF) and extreme gradient boosting (XGBoost). The area under the ROC curve (AUC) and accuracy were used as the main criterion for model evaluation. RESULTS: In the prediction of hypertension, the performance of LR with RF as the feature selection method (accuracy: 0.910; AUC: 0.924) was better than the performance of LR with XGBoost as the feature selection method (accuracy: 0.897; AUC: 0.915) and the performance of LR with statistical analysis as the feature selection method (accuracy: 0.872; AUC: 0.926). CONCLUSION: LR with RF as the feature selection method may provide accurate results in predicting hypertension. Carotid intima-media thickness (cIMT) and pulse wave velocity at the end of systole (ESPWV) are two key imaging indicators in the prediction of hypertension. Tehran University of Medical Sciences 2022-09 /pmc/articles/PMC9884380/ /pubmed/36743358 http://dx.doi.org/10.18502/ijph.v51i9.10565 Text en Copyright © 2022 Bai et al. Published by Tehran University of Medical Sciences https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International license (https://creativecommons.org/licenses/by-nc/4.0/). Non-commercial uses of the work are permitted, provided the original work is properly cited. |
spellingShingle | Original Article Bai, Xue Liu, Wenjun Huang, Hui You, Huan Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension |
title | Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension |
title_full | Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension |
title_fullStr | Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension |
title_full_unstemmed | Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension |
title_short | Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension |
title_sort | logistic regression model based on ultrafast pulse wave velocity and different feature selection methods to predict the risk of hypertension |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9884380/ https://www.ncbi.nlm.nih.gov/pubmed/36743358 http://dx.doi.org/10.18502/ijph.v51i9.10565 |
work_keys_str_mv | AT baixue logisticregressionmodelbasedonultrafastpulsewavevelocityanddifferentfeatureselectionmethodstopredicttheriskofhypertension AT liuwenjun logisticregressionmodelbasedonultrafastpulsewavevelocityanddifferentfeatureselectionmethodstopredicttheriskofhypertension AT huanghui logisticregressionmodelbasedonultrafastpulsewavevelocityanddifferentfeatureselectionmethodstopredicttheriskofhypertension AT youhuan logisticregressionmodelbasedonultrafastpulsewavevelocityanddifferentfeatureselectionmethodstopredicttheriskofhypertension |