Cargando…
Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study
Hypertension is a critical public health concern worldwide. Identification of risk factors using traditional multivariable models has been a field of active research. The present study was undertaken to identify risk patterns associated with hypertension incidence using data mining methods in a coho...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Wolters Kluwer Health
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5008529/ https://www.ncbi.nlm.nih.gov/pubmed/27583845 http://dx.doi.org/10.1097/MD.0000000000004143 |
_version_ | 1782451389146857472 |
---|---|
author | Ramezankhani, Azra Kabir, Ali Pournik, Omid Azizi, Fereidoun Hadaegh, Farzad |
author_facet | Ramezankhani, Azra Kabir, Ali Pournik, Omid Azizi, Fereidoun Hadaegh, Farzad |
author_sort | Ramezankhani, Azra |
collection | PubMed |
description | Hypertension is a critical public health concern worldwide. Identification of risk factors using traditional multivariable models has been a field of active research. The present study was undertaken to identify risk patterns associated with hypertension incidence using data mining methods in a cohort of Iranian adult population. Data on 6205 participants (44% men) age > 20 years, free from hypertension at baseline with no history of cardiovascular disease, were used to develop a series of prediction models by 3 types of decision tree (DT) algorithms. The performances of all classifiers were evaluated on the testing data set. The Quick Unbiased Efficient Statistical Tree algorithm among men and women and Classification and Regression Tree among the total population had the best performance. The C-statistic and sensitivity for the prediction models were (0.70 and 71%) in men, (0.79 and 71%) in women, and (0.78 and 72%) in total population, respectively. In DT models, systolic blood pressure (SBP), diastolic blood pressure, age, and waist circumference significantly contributed to the risk of incident hypertension in both genders and total population, wrist circumference and 2-h postchallenge plasma glucose among women and fasting plasma glucose among men. In men, the highest hypertension risk was seen in those with SBP > 115 mm Hg and age > 30 years. In women those with SBP > 114 mm Hg and age > 33 years had the highest risk for hypertension. For the total population, higher risk was observed in those with SBP > 114 mm Hg and age > 38 years. Our study emphasizes the utility of DTs for prediction of hypertension and exploring interaction between predictors. DT models used the easily available variables to identify homogeneous subgroups with different risk pattern for the hypertension. |
format | Online Article Text |
id | pubmed-5008529 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Wolters Kluwer Health |
record_format | MEDLINE/PubMed |
spelling | pubmed-50085292016-09-10 Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study Ramezankhani, Azra Kabir, Ali Pournik, Omid Azizi, Fereidoun Hadaegh, Farzad Medicine (Baltimore) 3400 Hypertension is a critical public health concern worldwide. Identification of risk factors using traditional multivariable models has been a field of active research. The present study was undertaken to identify risk patterns associated with hypertension incidence using data mining methods in a cohort of Iranian adult population. Data on 6205 participants (44% men) age > 20 years, free from hypertension at baseline with no history of cardiovascular disease, were used to develop a series of prediction models by 3 types of decision tree (DT) algorithms. The performances of all classifiers were evaluated on the testing data set. The Quick Unbiased Efficient Statistical Tree algorithm among men and women and Classification and Regression Tree among the total population had the best performance. The C-statistic and sensitivity for the prediction models were (0.70 and 71%) in men, (0.79 and 71%) in women, and (0.78 and 72%) in total population, respectively. In DT models, systolic blood pressure (SBP), diastolic blood pressure, age, and waist circumference significantly contributed to the risk of incident hypertension in both genders and total population, wrist circumference and 2-h postchallenge plasma glucose among women and fasting plasma glucose among men. In men, the highest hypertension risk was seen in those with SBP > 115 mm Hg and age > 30 years. In women those with SBP > 114 mm Hg and age > 33 years had the highest risk for hypertension. For the total population, higher risk was observed in those with SBP > 114 mm Hg and age > 38 years. Our study emphasizes the utility of DTs for prediction of hypertension and exploring interaction between predictors. DT models used the easily available variables to identify homogeneous subgroups with different risk pattern for the hypertension. Wolters Kluwer Health 2016-09-02 /pmc/articles/PMC5008529/ /pubmed/27583845 http://dx.doi.org/10.1097/MD.0000000000004143 Text en Copyright © 2016 the Author(s). Published by Wolters Kluwer Health, Inc. All rights reserved. http://creativecommons.org/licenses/by-nd/4.0 This is an open access article distributed under the Creative Commons Attribution-NoDerivatives License 4.0, which allows for redistribution, commercial and non-commercial, as long as it is passed along unchanged and in whole, with credit to the author. http://creativecommons.org/licenses/by-nd/4.0 |
spellingShingle | 3400 Ramezankhani, Azra Kabir, Ali Pournik, Omid Azizi, Fereidoun Hadaegh, Farzad Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study |
title | Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study |
title_full | Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study |
title_fullStr | Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study |
title_full_unstemmed | Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study |
title_short | Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study |
title_sort | classification-based data mining for identification of risk patterns associated with hypertension in middle eastern population: a 12-year longitudinal study |
topic | 3400 |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5008529/ https://www.ncbi.nlm.nih.gov/pubmed/27583845 http://dx.doi.org/10.1097/MD.0000000000004143 |
work_keys_str_mv | AT ramezankhaniazra classificationbaseddataminingforidentificationofriskpatternsassociatedwithhypertensioninmiddleeasternpopulationa12yearlongitudinalstudy AT kabirali classificationbaseddataminingforidentificationofriskpatternsassociatedwithhypertensioninmiddleeasternpopulationa12yearlongitudinalstudy AT pournikomid classificationbaseddataminingforidentificationofriskpatternsassociatedwithhypertensioninmiddleeasternpopulationa12yearlongitudinalstudy AT azizifereidoun classificationbaseddataminingforidentificationofriskpatternsassociatedwithhypertensioninmiddleeasternpopulationa12yearlongitudinalstudy AT hadaeghfarzad classificationbaseddataminingforidentificationofriskpatternsassociatedwithhypertensioninmiddleeasternpopulationa12yearlongitudinalstudy |