Cargando…

Predictive models to assess risk of type 2 diabetes, hypertension and comorbidity: machine-learning algorithms and validation using national health data from Kuwait—a cohort study

OBJECTIVE: We build classification models and risk assessment tools for diabetes, hypertension and comorbidity using machine-learning algorithms on data from Kuwait. We model the increased proneness in diabetic patients to develop hypertension and vice versa. We ascertain the importance of ethnicity...

Descripción completa

Detalles Bibliográficos
Autores principales: Farran, Bassam, Channanath, Arshad Mohamed, Behbehani, Kazem, Thanaraj, Thangavel Alphonse
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3657675/
https://www.ncbi.nlm.nih.gov/pubmed/23676796
http://dx.doi.org/10.1136/bmjopen-2012-002457
_version_ 1782270162687229952
author Farran, Bassam
Channanath, Arshad Mohamed
Behbehani, Kazem
Thanaraj, Thangavel Alphonse
author_facet Farran, Bassam
Channanath, Arshad Mohamed
Behbehani, Kazem
Thanaraj, Thangavel Alphonse
author_sort Farran, Bassam
collection PubMed
description OBJECTIVE: We build classification models and risk assessment tools for diabetes, hypertension and comorbidity using machine-learning algorithms on data from Kuwait. We model the increased proneness in diabetic patients to develop hypertension and vice versa. We ascertain the importance of ethnicity (and natives vs expatriate migrants) and of using regional data in risk assessment. DESIGN: Retrospective cohort study. Four machine-learning techniques were used: logistic regression, k-nearest neighbours (k-NN), multifactor dimensionality reduction and support vector machines. The study uses fivefold cross validation to obtain generalisation accuracies and errors. SETTING: Kuwait Health Network (KHN) that integrates data from primary health centres and hospitals in Kuwait. PARTICIPANTS: 270 172 hospital visitors (of which, 89 858 are diabetic, 58 745 hypertensive and 30 522 comorbid) comprising Kuwaiti natives, Asian and Arab expatriates. OUTCOME MEASURES: Incident type 2 diabetes, hypertension and comorbidity. RESULTS: Classification accuracies of >85% (for diabetes) and >90% (for hypertension) are achieved using only simple non-laboratory-based parameters. Risk assessment tools based on k-NN classification models are able to assign ‘high’ risk to 75% of diabetic patients and to 94% of hypertensive patients. Only 5% of diabetic patients are seen assigned ‘low’ risk. Asian-specific models and assessments perform even better. Pathological conditions of diabetes in the general population or in hypertensive population and those of hypertension are modelled. Two-stage aggregate classification models and risk assessment tools, built combining both the component models on diabetes (or on hypertension), perform better than individual models. CONCLUSIONS: Data on diabetes, hypertension and comorbidity from the cosmopolitan State of Kuwait are available for the first time. This enabled us to apply four different case–control models to assess risks. These tools aid in the preliminary non-intrusive assessment of the population. Ethnicity is seen significant to the predictive models. Risk assessments need to be developed using regional data as we demonstrate the applicability of the American Diabetes Association online calculator on data from Kuwait.
format Online
Article
Text
id pubmed-3657675
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-36576752013-05-21 Predictive models to assess risk of type 2 diabetes, hypertension and comorbidity: machine-learning algorithms and validation using national health data from Kuwait—a cohort study Farran, Bassam Channanath, Arshad Mohamed Behbehani, Kazem Thanaraj, Thangavel Alphonse BMJ Open Health Informatics OBJECTIVE: We build classification models and risk assessment tools for diabetes, hypertension and comorbidity using machine-learning algorithms on data from Kuwait. We model the increased proneness in diabetic patients to develop hypertension and vice versa. We ascertain the importance of ethnicity (and natives vs expatriate migrants) and of using regional data in risk assessment. DESIGN: Retrospective cohort study. Four machine-learning techniques were used: logistic regression, k-nearest neighbours (k-NN), multifactor dimensionality reduction and support vector machines. The study uses fivefold cross validation to obtain generalisation accuracies and errors. SETTING: Kuwait Health Network (KHN) that integrates data from primary health centres and hospitals in Kuwait. PARTICIPANTS: 270 172 hospital visitors (of which, 89 858 are diabetic, 58 745 hypertensive and 30 522 comorbid) comprising Kuwaiti natives, Asian and Arab expatriates. OUTCOME MEASURES: Incident type 2 diabetes, hypertension and comorbidity. RESULTS: Classification accuracies of >85% (for diabetes) and >90% (for hypertension) are achieved using only simple non-laboratory-based parameters. Risk assessment tools based on k-NN classification models are able to assign ‘high’ risk to 75% of diabetic patients and to 94% of hypertensive patients. Only 5% of diabetic patients are seen assigned ‘low’ risk. Asian-specific models and assessments perform even better. Pathological conditions of diabetes in the general population or in hypertensive population and those of hypertension are modelled. Two-stage aggregate classification models and risk assessment tools, built combining both the component models on diabetes (or on hypertension), perform better than individual models. CONCLUSIONS: Data on diabetes, hypertension and comorbidity from the cosmopolitan State of Kuwait are available for the first time. This enabled us to apply four different case–control models to assess risks. These tools aid in the preliminary non-intrusive assessment of the population. Ethnicity is seen significant to the predictive models. Risk assessments need to be developed using regional data as we demonstrate the applicability of the American Diabetes Association online calculator on data from Kuwait. BMJ Publishing Group 2013-05-14 /pmc/articles/PMC3657675/ /pubmed/23676796 http://dx.doi.org/10.1136/bmjopen-2012-002457 Text en Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions This is an open-access article distributed under the terms of the Creative Commons Attribution Non-commercial License, which permits use, distribution, and reproduction in any medium, provided the original work is properly cited, the use is non commercial and is otherwise in compliance with the license. See: http://creativecommons.org/licenses/by-nc/3.0/ and http://creativecommons.org/licenses/by-nc/3.0/legalcode
spellingShingle Health Informatics
Farran, Bassam
Channanath, Arshad Mohamed
Behbehani, Kazem
Thanaraj, Thangavel Alphonse
Predictive models to assess risk of type 2 diabetes, hypertension and comorbidity: machine-learning algorithms and validation using national health data from Kuwait—a cohort study
title Predictive models to assess risk of type 2 diabetes, hypertension and comorbidity: machine-learning algorithms and validation using national health data from Kuwait—a cohort study
title_full Predictive models to assess risk of type 2 diabetes, hypertension and comorbidity: machine-learning algorithms and validation using national health data from Kuwait—a cohort study
title_fullStr Predictive models to assess risk of type 2 diabetes, hypertension and comorbidity: machine-learning algorithms and validation using national health data from Kuwait—a cohort study
title_full_unstemmed Predictive models to assess risk of type 2 diabetes, hypertension and comorbidity: machine-learning algorithms and validation using national health data from Kuwait—a cohort study
title_short Predictive models to assess risk of type 2 diabetes, hypertension and comorbidity: machine-learning algorithms and validation using national health data from Kuwait—a cohort study
title_sort predictive models to assess risk of type 2 diabetes, hypertension and comorbidity: machine-learning algorithms and validation using national health data from kuwait—a cohort study
topic Health Informatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3657675/
https://www.ncbi.nlm.nih.gov/pubmed/23676796
http://dx.doi.org/10.1136/bmjopen-2012-002457
work_keys_str_mv AT farranbassam predictivemodelstoassessriskoftype2diabeteshypertensionandcomorbiditymachinelearningalgorithmsandvalidationusingnationalhealthdatafromkuwaitacohortstudy
AT channanatharshadmohamed predictivemodelstoassessriskoftype2diabeteshypertensionandcomorbiditymachinelearningalgorithmsandvalidationusingnationalhealthdatafromkuwaitacohortstudy
AT behbehanikazem predictivemodelstoassessriskoftype2diabeteshypertensionandcomorbiditymachinelearningalgorithmsandvalidationusingnationalhealthdatafromkuwaitacohortstudy
AT thanarajthangavelalphonse predictivemodelstoassessriskoftype2diabeteshypertensionandcomorbiditymachinelearningalgorithmsandvalidationusingnationalhealthdatafromkuwaitacohortstudy