Cargando…

Risk Stratification for Early Detection of Diabetes and Hypertension in Resource-Limited Settings: Machine Learning Analysis

BACKGROUND: The impending scale up of noncommunicable disease screening programs in low- and middle-income countries coupled with limited health resources require that such programs be as accurate as possible at identifying patients at high risk. OBJECTIVE: The aim of this study was to develop machi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Boutilier, Justin J, Chan, Timothy C Y, Ranjan, Manish, Deo, Sarang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2021
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7862003/ https://www.ncbi.nlm.nih.gov/pubmed/33475518 http://dx.doi.org/10.2196/20123

_version_	1783647193799852032
author	Boutilier, Justin J Chan, Timothy C Y Ranjan, Manish Deo, Sarang
author_facet	Boutilier, Justin J Chan, Timothy C Y Ranjan, Manish Deo, Sarang
author_sort	Boutilier, Justin J
collection	PubMed
description	BACKGROUND: The impending scale up of noncommunicable disease screening programs in low- and middle-income countries coupled with limited health resources require that such programs be as accurate as possible at identifying patients at high risk. OBJECTIVE: The aim of this study was to develop machine learning–based risk stratification algorithms for diabetes and hypertension that are tailored for the at-risk population served by community-based screening programs in low-resource settings. METHODS: We trained and tested our models by using data from 2278 patients collected by community health workers through door-to-door and camp-based screenings in the urban slums of Hyderabad, India between July 14, 2015 and April 21, 2018. We determined the best models for predicting short-term (2-month) risk of diabetes and hypertension (a model for diabetes and a model for hypertension) and compared these models to previously developed risk scores from the United States and the United Kingdom by using prediction accuracy as characterized by the area under the receiver operating characteristic curve (AUC) and the number of false negatives. RESULTS: We found that models based on random forest had the highest prediction accuracy for both diseases and were able to outperform the US and UK risk scores in terms of AUC by 35.5% for diabetes (improvement of 0.239 from 0.671 to 0.910) and 13.5% for hypertension (improvement of 0.094 from 0.698 to 0.792). For a fixed screening specificity of 0.9, the random forest model was able to reduce the expected number of false negatives by 620 patients per 1000 screenings for diabetes and 220 patients per 1000 screenings for hypertension. This improvement reduces the cost of incorrect risk stratification by US $1.99 (or 35%) per screening for diabetes and US $1.60 (or 21%) per screening for hypertension. CONCLUSIONS: In the next decade, health systems in many countries are planning to spend significant resources on noncommunicable disease screening programs and our study demonstrates that machine learning models can be leveraged by these programs to effectively utilize limited resources by improving risk stratification.
format	Online Article Text
id	pubmed-7862003
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-78620032021-02-10 Risk Stratification for Early Detection of Diabetes and Hypertension in Resource-Limited Settings: Machine Learning Analysis Boutilier, Justin J Chan, Timothy C Y Ranjan, Manish Deo, Sarang J Med Internet Res Original Paper BACKGROUND: The impending scale up of noncommunicable disease screening programs in low- and middle-income countries coupled with limited health resources require that such programs be as accurate as possible at identifying patients at high risk. OBJECTIVE: The aim of this study was to develop machine learning–based risk stratification algorithms for diabetes and hypertension that are tailored for the at-risk population served by community-based screening programs in low-resource settings. METHODS: We trained and tested our models by using data from 2278 patients collected by community health workers through door-to-door and camp-based screenings in the urban slums of Hyderabad, India between July 14, 2015 and April 21, 2018. We determined the best models for predicting short-term (2-month) risk of diabetes and hypertension (a model for diabetes and a model for hypertension) and compared these models to previously developed risk scores from the United States and the United Kingdom by using prediction accuracy as characterized by the area under the receiver operating characteristic curve (AUC) and the number of false negatives. RESULTS: We found that models based on random forest had the highest prediction accuracy for both diseases and were able to outperform the US and UK risk scores in terms of AUC by 35.5% for diabetes (improvement of 0.239 from 0.671 to 0.910) and 13.5% for hypertension (improvement of 0.094 from 0.698 to 0.792). For a fixed screening specificity of 0.9, the random forest model was able to reduce the expected number of false negatives by 620 patients per 1000 screenings for diabetes and 220 patients per 1000 screenings for hypertension. This improvement reduces the cost of incorrect risk stratification by US $1.99 (or 35%) per screening for diabetes and US $1.60 (or 21%) per screening for hypertension. CONCLUSIONS: In the next decade, health systems in many countries are planning to spend significant resources on noncommunicable disease screening programs and our study demonstrates that machine learning models can be leveraged by these programs to effectively utilize limited resources by improving risk stratification. JMIR Publications 2021-01-21 /pmc/articles/PMC7862003/ /pubmed/33475518 http://dx.doi.org/10.2196/20123 Text en ©Justin J Boutilier, Timothy C Y Chan, Manish Ranjan, Sarang Deo. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 21.01.2021. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Boutilier, Justin J Chan, Timothy C Y Ranjan, Manish Deo, Sarang Risk Stratification for Early Detection of Diabetes and Hypertension in Resource-Limited Settings: Machine Learning Analysis
title	Risk Stratification for Early Detection of Diabetes and Hypertension in Resource-Limited Settings: Machine Learning Analysis
title_full	Risk Stratification for Early Detection of Diabetes and Hypertension in Resource-Limited Settings: Machine Learning Analysis
title_fullStr	Risk Stratification for Early Detection of Diabetes and Hypertension in Resource-Limited Settings: Machine Learning Analysis
title_full_unstemmed	Risk Stratification for Early Detection of Diabetes and Hypertension in Resource-Limited Settings: Machine Learning Analysis
title_short	Risk Stratification for Early Detection of Diabetes and Hypertension in Resource-Limited Settings: Machine Learning Analysis
title_sort	risk stratification for early detection of diabetes and hypertension in resource-limited settings: machine learning analysis
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7862003/ https://www.ncbi.nlm.nih.gov/pubmed/33475518 http://dx.doi.org/10.2196/20123
work_keys_str_mv	AT boutilierjustinj riskstratificationforearlydetectionofdiabetesandhypertensioninresourcelimitedsettingsmachinelearninganalysis AT chantimothycy riskstratificationforearlydetectionofdiabetesandhypertensioninresourcelimitedsettingsmachinelearninganalysis AT ranjanmanish riskstratificationforearlydetectionofdiabetesandhypertensioninresourcelimitedsettingsmachinelearninganalysis AT deosarang riskstratificationforearlydetectionofdiabetesandhypertensioninresourcelimitedsettingsmachinelearninganalysis

Risk Stratification for Early Detection of Diabetes and Hypertension in Resource-Limited Settings: Machine Learning Analysis

Ejemplares similares