Cargando…

Developing a Risk Stratification Model Based on Machine Learning for Targeted Screening of Diabetic Retinopathy in the Indian Population

Objective: This study aimed to develop a predictive risk score model based on deep learning (DL) independent of fundus photography, totally reliant on systemic data through targeted screening from a population-based study to diagnose diabetic retinopathy (DR) in the Indian population. Methods: It in...

Descripción completa

Detalles Bibliográficos
Autores principales: Surya, Janani, Kashyap, Himanshu, Nadig, Ramya R, Raman, Rajiv
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cureus 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10595397/
https://www.ncbi.nlm.nih.gov/pubmed/37881381
http://dx.doi.org/10.7759/cureus.45853
_version_ 1785124860780871680
author Surya, Janani
Kashyap, Himanshu
Nadig, Ramya R
Raman, Rajiv
author_facet Surya, Janani
Kashyap, Himanshu
Nadig, Ramya R
Raman, Rajiv
author_sort Surya, Janani
collection PubMed
description Objective: This study aimed to develop a predictive risk score model based on deep learning (DL) independent of fundus photography, totally reliant on systemic data through targeted screening from a population-based study to diagnose diabetic retinopathy (DR) in the Indian population. Methods: It involved machine learning application on datasets of a cross-sectional population-based study. A total of 1425 subjects (1175 subjects with known diabetes and 250 with newly diagnosed diabetes) were included in the study. We applied five machine learning algorithms, random forest (RF), logistic regression (LR), support vector machines (SVM), artificial neural networks (ANN), and decision trees (DT), to predict diabetic retinopathy in our datasets. We incorporated a percentage split in the first experiment and randomly divided our data set into 80% as a training set and 20% as a test set. We performed a three-way data split in the second experiment to prevent overestimating predictive performance. We randomly divided our data set into 60% as a training set, 20% as a validation set, and 20% as the test set. Furthermore, we integrated five-fold cross-validation to split the percentage to evaluate our method. We judged the predictive performance based on the receiver operating characteristic (ROC) curve, the area under the curve (AUC), accuracy (Acc), sensitivity, and specificity. Results: The RF classifier achieved the best prediction performance with AUC, Acc, and sensitivity values of 0.91, 0.89, and 0.90, respectively, in the percentage split. Similarly, a three-way data split attained an outcome of 0.86 and 0.85 in AUC and Acc. Likewise, the five-fold cross-validation performed the best with results of 0.90, 0.97, 0.91, and 0.75 in AUC, Acc, sensitivity, and specificity, respectively. Conclusion: Since the RF classifier achieved the best performance, we propose it to identify diabetic retinopathy for targeted screening in the general population.
format Online
Article
Text
id pubmed-10595397
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cureus
record_format MEDLINE/PubMed
spelling pubmed-105953972023-10-25 Developing a Risk Stratification Model Based on Machine Learning for Targeted Screening of Diabetic Retinopathy in the Indian Population Surya, Janani Kashyap, Himanshu Nadig, Ramya R Raman, Rajiv Cureus Public Health Objective: This study aimed to develop a predictive risk score model based on deep learning (DL) independent of fundus photography, totally reliant on systemic data through targeted screening from a population-based study to diagnose diabetic retinopathy (DR) in the Indian population. Methods: It involved machine learning application on datasets of a cross-sectional population-based study. A total of 1425 subjects (1175 subjects with known diabetes and 250 with newly diagnosed diabetes) were included in the study. We applied five machine learning algorithms, random forest (RF), logistic regression (LR), support vector machines (SVM), artificial neural networks (ANN), and decision trees (DT), to predict diabetic retinopathy in our datasets. We incorporated a percentage split in the first experiment and randomly divided our data set into 80% as a training set and 20% as a test set. We performed a three-way data split in the second experiment to prevent overestimating predictive performance. We randomly divided our data set into 60% as a training set, 20% as a validation set, and 20% as the test set. Furthermore, we integrated five-fold cross-validation to split the percentage to evaluate our method. We judged the predictive performance based on the receiver operating characteristic (ROC) curve, the area under the curve (AUC), accuracy (Acc), sensitivity, and specificity. Results: The RF classifier achieved the best prediction performance with AUC, Acc, and sensitivity values of 0.91, 0.89, and 0.90, respectively, in the percentage split. Similarly, a three-way data split attained an outcome of 0.86 and 0.85 in AUC and Acc. Likewise, the five-fold cross-validation performed the best with results of 0.90, 0.97, 0.91, and 0.75 in AUC, Acc, sensitivity, and specificity, respectively. Conclusion: Since the RF classifier achieved the best performance, we propose it to identify diabetic retinopathy for targeted screening in the general population. Cureus 2023-09-24 /pmc/articles/PMC10595397/ /pubmed/37881381 http://dx.doi.org/10.7759/cureus.45853 Text en Copyright © 2023, Surya et al. https://creativecommons.org/licenses/by/3.0/This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Public Health
Surya, Janani
Kashyap, Himanshu
Nadig, Ramya R
Raman, Rajiv
Developing a Risk Stratification Model Based on Machine Learning for Targeted Screening of Diabetic Retinopathy in the Indian Population
title Developing a Risk Stratification Model Based on Machine Learning for Targeted Screening of Diabetic Retinopathy in the Indian Population
title_full Developing a Risk Stratification Model Based on Machine Learning for Targeted Screening of Diabetic Retinopathy in the Indian Population
title_fullStr Developing a Risk Stratification Model Based on Machine Learning for Targeted Screening of Diabetic Retinopathy in the Indian Population
title_full_unstemmed Developing a Risk Stratification Model Based on Machine Learning for Targeted Screening of Diabetic Retinopathy in the Indian Population
title_short Developing a Risk Stratification Model Based on Machine Learning for Targeted Screening of Diabetic Retinopathy in the Indian Population
title_sort developing a risk stratification model based on machine learning for targeted screening of diabetic retinopathy in the indian population
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10595397/
https://www.ncbi.nlm.nih.gov/pubmed/37881381
http://dx.doi.org/10.7759/cureus.45853
work_keys_str_mv AT suryajanani developingariskstratificationmodelbasedonmachinelearningfortargetedscreeningofdiabeticretinopathyintheindianpopulation
AT kashyaphimanshu developingariskstratificationmodelbasedonmachinelearningfortargetedscreeningofdiabeticretinopathyintheindianpopulation
AT nadigramyar developingariskstratificationmodelbasedonmachinelearningfortargetedscreeningofdiabeticretinopathyintheindianpopulation
AT ramanrajiv developingariskstratificationmodelbasedonmachinelearningfortargetedscreeningofdiabeticretinopathyintheindianpopulation