Cargando…

Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension

BACKGROUND: Hypertension is the main reason why the incidence of cardiovascular disease has increased year-by-year and early diagnosis of hypertension is necessary to reducing the incidence of cardiovascular disease. This also puts forward higher requirements for the accuracy of diagnosis. We tried...

Descripción completa

Detalles Bibliográficos
Autores principales: Bai, Xue, Liu, Wenjun, Huang, Hui, You, Huan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Tehran University of Medical Sciences 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9884380/
https://www.ncbi.nlm.nih.gov/pubmed/36743358
http://dx.doi.org/10.18502/ijph.v51i9.10565
_version_ 1784879705860603904
author Bai, Xue
Liu, Wenjun
Huang, Hui
You, Huan
author_facet Bai, Xue
Liu, Wenjun
Huang, Hui
You, Huan
author_sort Bai, Xue
collection PubMed
description BACKGROUND: Hypertension is the main reason why the incidence of cardiovascular disease has increased year-by-year and early diagnosis of hypertension is necessary to reducing the incidence of cardiovascular disease. This also puts forward higher requirements for the accuracy of diagnosis. We tried a variety of feature selection methods to improve the accuracy of logistic regression (LR). METHODS: We collected 397 samples from Nanjing, Jiangsu, China between Jan 2016 and Dec 2017, including 178 hypertension samples and 219 control samples. It includes not only clinical and laboratory data, but also imaging data. We focused on the difference of imaging attributes between the control group and the hypertension group, and analyzed the correlation coefficients of all attributes. In order to establish the optimal LR model, this study tried three different feature selection methods, including statistical analysis, random forest (RF) and extreme gradient boosting (XGBoost). The area under the ROC curve (AUC) and accuracy were used as the main criterion for model evaluation. RESULTS: In the prediction of hypertension, the performance of LR with RF as the feature selection method (accuracy: 0.910; AUC: 0.924) was better than the performance of LR with XGBoost as the feature selection method (accuracy: 0.897; AUC: 0.915) and the performance of LR with statistical analysis as the feature selection method (accuracy: 0.872; AUC: 0.926). CONCLUSION: LR with RF as the feature selection method may provide accurate results in predicting hypertension. Carotid intima-media thickness (cIMT) and pulse wave velocity at the end of systole (ESPWV) are two key imaging indicators in the prediction of hypertension.
format Online
Article
Text
id pubmed-9884380
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Tehran University of Medical Sciences
record_format MEDLINE/PubMed
spelling pubmed-98843802023-02-03 Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension Bai, Xue Liu, Wenjun Huang, Hui You, Huan Iran J Public Health Original Article BACKGROUND: Hypertension is the main reason why the incidence of cardiovascular disease has increased year-by-year and early diagnosis of hypertension is necessary to reducing the incidence of cardiovascular disease. This also puts forward higher requirements for the accuracy of diagnosis. We tried a variety of feature selection methods to improve the accuracy of logistic regression (LR). METHODS: We collected 397 samples from Nanjing, Jiangsu, China between Jan 2016 and Dec 2017, including 178 hypertension samples and 219 control samples. It includes not only clinical and laboratory data, but also imaging data. We focused on the difference of imaging attributes between the control group and the hypertension group, and analyzed the correlation coefficients of all attributes. In order to establish the optimal LR model, this study tried three different feature selection methods, including statistical analysis, random forest (RF) and extreme gradient boosting (XGBoost). The area under the ROC curve (AUC) and accuracy were used as the main criterion for model evaluation. RESULTS: In the prediction of hypertension, the performance of LR with RF as the feature selection method (accuracy: 0.910; AUC: 0.924) was better than the performance of LR with XGBoost as the feature selection method (accuracy: 0.897; AUC: 0.915) and the performance of LR with statistical analysis as the feature selection method (accuracy: 0.872; AUC: 0.926). CONCLUSION: LR with RF as the feature selection method may provide accurate results in predicting hypertension. Carotid intima-media thickness (cIMT) and pulse wave velocity at the end of systole (ESPWV) are two key imaging indicators in the prediction of hypertension. Tehran University of Medical Sciences 2022-09 /pmc/articles/PMC9884380/ /pubmed/36743358 http://dx.doi.org/10.18502/ijph.v51i9.10565 Text en Copyright © 2022 Bai et al. Published by Tehran University of Medical Sciences https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International license (https://creativecommons.org/licenses/by-nc/4.0/). Non-commercial uses of the work are permitted, provided the original work is properly cited.
spellingShingle Original Article
Bai, Xue
Liu, Wenjun
Huang, Hui
You, Huan
Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension
title Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension
title_full Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension
title_fullStr Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension
title_full_unstemmed Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension
title_short Logistic Regression Model Based on Ultrafast Pulse Wave Velocity and Different Feature Selection Methods to Predict the Risk of Hypertension
title_sort logistic regression model based on ultrafast pulse wave velocity and different feature selection methods to predict the risk of hypertension
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9884380/
https://www.ncbi.nlm.nih.gov/pubmed/36743358
http://dx.doi.org/10.18502/ijph.v51i9.10565
work_keys_str_mv AT baixue logisticregressionmodelbasedonultrafastpulsewavevelocityanddifferentfeatureselectionmethodstopredicttheriskofhypertension
AT liuwenjun logisticregressionmodelbasedonultrafastpulsewavevelocityanddifferentfeatureselectionmethodstopredicttheriskofhypertension
AT huanghui logisticregressionmodelbasedonultrafastpulsewavevelocityanddifferentfeatureselectionmethodstopredicttheriskofhypertension
AT youhuan logisticregressionmodelbasedonultrafastpulsewavevelocityanddifferentfeatureselectionmethodstopredicttheriskofhypertension