Cargando…

Usefulness of Machine Learning for Identification of Referable Diabetic Retinopathy in a Large-Scale Population-Based Study

Purpose: To development and validation of machine learning-based classifiers based on simple non-ocular metrics for detecting referable diabetic retinopathy (RDR) in a large-scale Chinese population–based survey. Methods: The 1,418 patients with diabetes mellitus from 8,952 rural residents screened...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yang, Cheng, Liu, Qingyang, Guo, Haike, Zhang, Min, Zhang, Lixin, Zhang, Guanrong, Zeng, Jin, Huang, Zhongning, Meng, Qianli, Cui, Ying
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2021
Materias:	Medicine
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8717406/ https://www.ncbi.nlm.nih.gov/pubmed/34977075 http://dx.doi.org/10.3389/fmed.2021.773881

_version_	1784624525418168320
author	Yang, Cheng Liu, Qingyang Guo, Haike Zhang, Min Zhang, Lixin Zhang, Guanrong Zeng, Jin Huang, Zhongning Meng, Qianli Cui, Ying
author_facet	Yang, Cheng Liu, Qingyang Guo, Haike Zhang, Min Zhang, Lixin Zhang, Guanrong Zeng, Jin Huang, Zhongning Meng, Qianli Cui, Ying
author_sort	Yang, Cheng
collection	PubMed
description	Purpose: To development and validation of machine learning-based classifiers based on simple non-ocular metrics for detecting referable diabetic retinopathy (RDR) in a large-scale Chinese population–based survey. Methods: The 1,418 patients with diabetes mellitus from 8,952 rural residents screened in the population-based Dongguan Eye Study were used for model development and validation. Eight algorithms [extreme gradient boosting (XGBoost), random forest, naïve Bayes, k-nearest neighbor (KNN), AdaBoost, Light GBM, artificial neural network (ANN), and logistic regression] were used for modeling to detect RDR in individuals with diabetes. The area under the receiver operating characteristic curve (AUC) and their 95% confidential interval (95% CI) were estimated using five-fold cross-validation as well as an 80:20 ratio of training and validation. Results: The 10 most important features in machine learning models were duration of diabetes, HbA1c, systolic blood pressure, triglyceride, body mass index, serum creatine, age, educational level, duration of hypertension, and income level. Based on these top 10 variables, the XGBoost model achieved the best discriminative performance, with an AUC of 0.816 (95%CI: 0.812, 0.820). The AUCs for logistic regression, AdaBoost, naïve Bayes, and Random forest were 0.766 (95%CI: 0.756, 0.776), 0.754 (95%CI: 0.744, 0.764), 0.753 (95%CI: 0.743, 0.763), and 0.705 (95%CI: 0.697, 0.713), respectively. Conclusions: A machine learning–based classifier that used 10 easily obtained non-ocular variables was able to effectively detect RDR patients. The importance scores of the variables provide insight to prevent the occurrence of RDR. Screening RDR with machine learning provides a useful complementary tool for clinical practice in resource-poor areas with limited ophthalmic infrastructure.
format	Online Article Text
id	pubmed-8717406
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-87174062021-12-31 Usefulness of Machine Learning for Identification of Referable Diabetic Retinopathy in a Large-Scale Population-Based Study Yang, Cheng Liu, Qingyang Guo, Haike Zhang, Min Zhang, Lixin Zhang, Guanrong Zeng, Jin Huang, Zhongning Meng, Qianli Cui, Ying Front Med (Lausanne) Medicine Purpose: To development and validation of machine learning-based classifiers based on simple non-ocular metrics for detecting referable diabetic retinopathy (RDR) in a large-scale Chinese population–based survey. Methods: The 1,418 patients with diabetes mellitus from 8,952 rural residents screened in the population-based Dongguan Eye Study were used for model development and validation. Eight algorithms [extreme gradient boosting (XGBoost), random forest, naïve Bayes, k-nearest neighbor (KNN), AdaBoost, Light GBM, artificial neural network (ANN), and logistic regression] were used for modeling to detect RDR in individuals with diabetes. The area under the receiver operating characteristic curve (AUC) and their 95% confidential interval (95% CI) were estimated using five-fold cross-validation as well as an 80:20 ratio of training and validation. Results: The 10 most important features in machine learning models were duration of diabetes, HbA1c, systolic blood pressure, triglyceride, body mass index, serum creatine, age, educational level, duration of hypertension, and income level. Based on these top 10 variables, the XGBoost model achieved the best discriminative performance, with an AUC of 0.816 (95%CI: 0.812, 0.820). The AUCs for logistic regression, AdaBoost, naïve Bayes, and Random forest were 0.766 (95%CI: 0.756, 0.776), 0.754 (95%CI: 0.744, 0.764), 0.753 (95%CI: 0.743, 0.763), and 0.705 (95%CI: 0.697, 0.713), respectively. Conclusions: A machine learning–based classifier that used 10 easily obtained non-ocular variables was able to effectively detect RDR patients. The importance scores of the variables provide insight to prevent the occurrence of RDR. Screening RDR with machine learning provides a useful complementary tool for clinical practice in resource-poor areas with limited ophthalmic infrastructure. Frontiers Media S.A. 2021-12-09 /pmc/articles/PMC8717406/ /pubmed/34977075 http://dx.doi.org/10.3389/fmed.2021.773881 Text en Copyright © 2021 Yang, Liu, Guo, Zhang, Zhang, Zhang, Zeng, Huang, Meng and Cui. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Medicine Yang, Cheng Liu, Qingyang Guo, Haike Zhang, Min Zhang, Lixin Zhang, Guanrong Zeng, Jin Huang, Zhongning Meng, Qianli Cui, Ying Usefulness of Machine Learning for Identification of Referable Diabetic Retinopathy in a Large-Scale Population-Based Study
title	Usefulness of Machine Learning for Identification of Referable Diabetic Retinopathy in a Large-Scale Population-Based Study
title_full	Usefulness of Machine Learning for Identification of Referable Diabetic Retinopathy in a Large-Scale Population-Based Study
title_fullStr	Usefulness of Machine Learning for Identification of Referable Diabetic Retinopathy in a Large-Scale Population-Based Study
title_full_unstemmed	Usefulness of Machine Learning for Identification of Referable Diabetic Retinopathy in a Large-Scale Population-Based Study
title_short	Usefulness of Machine Learning for Identification of Referable Diabetic Retinopathy in a Large-Scale Population-Based Study
title_sort	usefulness of machine learning for identification of referable diabetic retinopathy in a large-scale population-based study
topic	Medicine
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8717406/ https://www.ncbi.nlm.nih.gov/pubmed/34977075 http://dx.doi.org/10.3389/fmed.2021.773881
work_keys_str_mv	AT yangcheng usefulnessofmachinelearningforidentificationofreferablediabeticretinopathyinalargescalepopulationbasedstudy AT liuqingyang usefulnessofmachinelearningforidentificationofreferablediabeticretinopathyinalargescalepopulationbasedstudy AT guohaike usefulnessofmachinelearningforidentificationofreferablediabeticretinopathyinalargescalepopulationbasedstudy AT zhangmin usefulnessofmachinelearningforidentificationofreferablediabeticretinopathyinalargescalepopulationbasedstudy AT zhanglixin usefulnessofmachinelearningforidentificationofreferablediabeticretinopathyinalargescalepopulationbasedstudy AT zhangguanrong usefulnessofmachinelearningforidentificationofreferablediabeticretinopathyinalargescalepopulationbasedstudy AT zengjin usefulnessofmachinelearningforidentificationofreferablediabeticretinopathyinalargescalepopulationbasedstudy AT huangzhongning usefulnessofmachinelearningforidentificationofreferablediabeticretinopathyinalargescalepopulationbasedstudy AT mengqianli usefulnessofmachinelearningforidentificationofreferablediabeticretinopathyinalargescalepopulationbasedstudy AT cuiying usefulnessofmachinelearningforidentificationofreferablediabeticretinopathyinalargescalepopulationbasedstudy

Usefulness of Machine Learning for Identification of Referable Diabetic Retinopathy in a Large-Scale Population-Based Study

Ejemplares similares