Cargando…

Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study

Hypertension is a critical public health concern worldwide. Identification of risk factors using traditional multivariable models has been a field of active research. The present study was undertaken to identify risk patterns associated with hypertension incidence using data mining methods in a coho...

Descripción completa

Detalles Bibliográficos
Autores principales: Ramezankhani, Azra, Kabir, Ali, Pournik, Omid, Azizi, Fereidoun, Hadaegh, Farzad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Wolters Kluwer Health 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5008529/
https://www.ncbi.nlm.nih.gov/pubmed/27583845
http://dx.doi.org/10.1097/MD.0000000000004143
_version_ 1782451389146857472
author Ramezankhani, Azra
Kabir, Ali
Pournik, Omid
Azizi, Fereidoun
Hadaegh, Farzad
author_facet Ramezankhani, Azra
Kabir, Ali
Pournik, Omid
Azizi, Fereidoun
Hadaegh, Farzad
author_sort Ramezankhani, Azra
collection PubMed
description Hypertension is a critical public health concern worldwide. Identification of risk factors using traditional multivariable models has been a field of active research. The present study was undertaken to identify risk patterns associated with hypertension incidence using data mining methods in a cohort of Iranian adult population. Data on 6205 participants (44% men) age > 20 years, free from hypertension at baseline with no history of cardiovascular disease, were used to develop a series of prediction models by 3 types of decision tree (DT) algorithms. The performances of all classifiers were evaluated on the testing data set. The Quick Unbiased Efficient Statistical Tree algorithm among men and women and Classification and Regression Tree among the total population had the best performance. The C-statistic and sensitivity for the prediction models were (0.70 and 71%) in men, (0.79 and 71%) in women, and (0.78 and 72%) in total population, respectively. In DT models, systolic blood pressure (SBP), diastolic blood pressure, age, and waist circumference significantly contributed to the risk of incident hypertension in both genders and total population, wrist circumference and 2-h postchallenge plasma glucose among women and fasting plasma glucose among men. In men, the highest hypertension risk was seen in those with SBP > 115 mm Hg and age > 30 years. In women those with SBP > 114 mm Hg and age > 33 years had the highest risk for hypertension. For the total population, higher risk was observed in those with SBP > 114 mm Hg and age > 38 years. Our study emphasizes the utility of DTs for prediction of hypertension and exploring interaction between predictors. DT models used the easily available variables to identify homogeneous subgroups with different risk pattern for the hypertension.
format Online
Article
Text
id pubmed-5008529
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Wolters Kluwer Health
record_format MEDLINE/PubMed
spelling pubmed-50085292016-09-10 Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study Ramezankhani, Azra Kabir, Ali Pournik, Omid Azizi, Fereidoun Hadaegh, Farzad Medicine (Baltimore) 3400 Hypertension is a critical public health concern worldwide. Identification of risk factors using traditional multivariable models has been a field of active research. The present study was undertaken to identify risk patterns associated with hypertension incidence using data mining methods in a cohort of Iranian adult population. Data on 6205 participants (44% men) age > 20 years, free from hypertension at baseline with no history of cardiovascular disease, were used to develop a series of prediction models by 3 types of decision tree (DT) algorithms. The performances of all classifiers were evaluated on the testing data set. The Quick Unbiased Efficient Statistical Tree algorithm among men and women and Classification and Regression Tree among the total population had the best performance. The C-statistic and sensitivity for the prediction models were (0.70 and 71%) in men, (0.79 and 71%) in women, and (0.78 and 72%) in total population, respectively. In DT models, systolic blood pressure (SBP), diastolic blood pressure, age, and waist circumference significantly contributed to the risk of incident hypertension in both genders and total population, wrist circumference and 2-h postchallenge plasma glucose among women and fasting plasma glucose among men. In men, the highest hypertension risk was seen in those with SBP > 115 mm Hg and age > 30 years. In women those with SBP > 114 mm Hg and age > 33 years had the highest risk for hypertension. For the total population, higher risk was observed in those with SBP > 114 mm Hg and age > 38 years. Our study emphasizes the utility of DTs for prediction of hypertension and exploring interaction between predictors. DT models used the easily available variables to identify homogeneous subgroups with different risk pattern for the hypertension. Wolters Kluwer Health 2016-09-02 /pmc/articles/PMC5008529/ /pubmed/27583845 http://dx.doi.org/10.1097/MD.0000000000004143 Text en Copyright © 2016 the Author(s). Published by Wolters Kluwer Health, Inc. All rights reserved. http://creativecommons.org/licenses/by-nd/4.0 This is an open access article distributed under the Creative Commons Attribution-NoDerivatives License 4.0, which allows for redistribution, commercial and non-commercial, as long as it is passed along unchanged and in whole, with credit to the author. http://creativecommons.org/licenses/by-nd/4.0
spellingShingle 3400
Ramezankhani, Azra
Kabir, Ali
Pournik, Omid
Azizi, Fereidoun
Hadaegh, Farzad
Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study
title Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study
title_full Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study
title_fullStr Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study
title_full_unstemmed Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study
title_short Classification-based data mining for identification of risk patterns associated with hypertension in Middle Eastern population: A 12-year longitudinal study
title_sort classification-based data mining for identification of risk patterns associated with hypertension in middle eastern population: a 12-year longitudinal study
topic 3400
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5008529/
https://www.ncbi.nlm.nih.gov/pubmed/27583845
http://dx.doi.org/10.1097/MD.0000000000004143
work_keys_str_mv AT ramezankhaniazra classificationbaseddataminingforidentificationofriskpatternsassociatedwithhypertensioninmiddleeasternpopulationa12yearlongitudinalstudy
AT kabirali classificationbaseddataminingforidentificationofriskpatternsassociatedwithhypertensioninmiddleeasternpopulationa12yearlongitudinalstudy
AT pournikomid classificationbaseddataminingforidentificationofriskpatternsassociatedwithhypertensioninmiddleeasternpopulationa12yearlongitudinalstudy
AT azizifereidoun classificationbaseddataminingforidentificationofriskpatternsassociatedwithhypertensioninmiddleeasternpopulationa12yearlongitudinalstudy
AT hadaeghfarzad classificationbaseddataminingforidentificationofriskpatternsassociatedwithhypertensioninmiddleeasternpopulationa12yearlongitudinalstudy