Cargando…

Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study

BACKGROUND: Electronic medical records provide large-scale real-world clinical data for use in developing clinical decision systems. However, sophisticated methodology and analytical skills are required to handle the large-scale datasets necessary for the optimisation of prediction accuracy. Myopia...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Haotian, Long, Erping, Ding, Xiaohu, Diao, Hongxing, Chen, Zicong, Liu, Runzhong, Huang, Jialing, Cai, Jingheng, Xu, Shuangjuan, Zhang, Xiayin, Wang, Dongni, Chen, Kexin, Yu, Tongyong, Wu, Dongxuan, Zhao, Xutu, Liu, Zhenzhen, Wu, Xiaohang, Jiang, Yuzhen, Yang, Xiao, Cui, Dongmei, Liu, Wenyan, Zheng, Yingfeng, Luo, Lixia, Wang, Haibo, Chan, Chi-Chao, Morgan, Ian G., He, Mingguang, Liu, Yizhi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6219762/
https://www.ncbi.nlm.nih.gov/pubmed/30399150
http://dx.doi.org/10.1371/journal.pmed.1002674
_version_ 1783368712733065216
author Lin, Haotian
Long, Erping
Ding, Xiaohu
Diao, Hongxing
Chen, Zicong
Liu, Runzhong
Huang, Jialing
Cai, Jingheng
Xu, Shuangjuan
Zhang, Xiayin
Wang, Dongni
Chen, Kexin
Yu, Tongyong
Wu, Dongxuan
Zhao, Xutu
Liu, Zhenzhen
Wu, Xiaohang
Jiang, Yuzhen
Yang, Xiao
Cui, Dongmei
Liu, Wenyan
Zheng, Yingfeng
Luo, Lixia
Wang, Haibo
Chan, Chi-Chao
Morgan, Ian G.
He, Mingguang
Liu, Yizhi
author_facet Lin, Haotian
Long, Erping
Ding, Xiaohu
Diao, Hongxing
Chen, Zicong
Liu, Runzhong
Huang, Jialing
Cai, Jingheng
Xu, Shuangjuan
Zhang, Xiayin
Wang, Dongni
Chen, Kexin
Yu, Tongyong
Wu, Dongxuan
Zhao, Xutu
Liu, Zhenzhen
Wu, Xiaohang
Jiang, Yuzhen
Yang, Xiao
Cui, Dongmei
Liu, Wenyan
Zheng, Yingfeng
Luo, Lixia
Wang, Haibo
Chan, Chi-Chao
Morgan, Ian G.
He, Mingguang
Liu, Yizhi
author_sort Lin, Haotian
collection PubMed
description BACKGROUND: Electronic medical records provide large-scale real-world clinical data for use in developing clinical decision systems. However, sophisticated methodology and analytical skills are required to handle the large-scale datasets necessary for the optimisation of prediction accuracy. Myopia is a common cause of vision loss. Current approaches to control myopia progression are effective but have significant side effects. Therefore, identifying those at greatest risk who should undergo targeted therapy is of great clinical importance. The objective of this study was to apply big data and machine learning technology to develop an algorithm that can predict the onset of high myopia, at specific future time points, among Chinese school-aged children. METHODS AND FINDINGS: Real-world clinical refraction data were derived from electronic medical record systems in 8 ophthalmic centres from January 1, 2005, to December 30, 2015. The variables of age, spherical equivalent (SE), and annual progression rate were used to develop an algorithm to predict SE and onset of high myopia (SE ≤ −6.0 dioptres) up to 10 years in the future. Random forest machine learning was used for algorithm training and validation. Electronic medical records from the Zhongshan Ophthalmic Centre (a major tertiary ophthalmic centre in China) were used as the training set. Ten-fold cross-validation and out-of-bag (OOB) methods were applied for internal validation. The remaining 7 independent datasets were used for external validation. Two population-based datasets, which had no participant overlap with the ophthalmic-centre-based datasets, were used for multi-resource validation testing. The main outcomes and measures were the area under the curve (AUC) values for predicting the onset of high myopia over 10 years and the presence of high myopia at 18 years of age. In total, 687,063 multiple visit records (≥3 records) of 129,242 individuals in the ophthalmic-centre-based electronic medical record databases and 17,113 follow-up records of 3,215 participants in population-based cohorts were included in the analysis. Our algorithm accurately predicted the presence of high myopia in internal validation (the AUC ranged from 0.903 to 0.986 for 3 years, 0.875 to 0.901 for 5 years, and 0.852 to 0.888 for 8 years), external validation (the AUC ranged from 0.874 to 0.976 for 3 years, 0.847 to 0.921 for 5 years, and 0.802 to 0.886 for 8 years), and multi-resource testing (the AUC ranged from 0.752 to 0.869 for 4 years). With respect to the prediction of high myopia development by 18 years of age, as a surrogate of high myopia in adulthood, the algorithm provided clinically acceptable accuracy over 3 years (the AUC ranged from 0.940 to 0.985), 5 years (the AUC ranged from 0.856 to 0.901), and even 8 years (the AUC ranged from 0.801 to 0.837). Meanwhile, our algorithm achieved clinically acceptable prediction of the actual refraction values at future time points, which is supported by the regressive performance and calibration curves. Although the algorithm achieved balanced and robust performance, concerns about the compromised quality of real-world clinical data and over-fitting issues should be cautiously considered. CONCLUSIONS: To our knowledge, this study, for the first time, used large-scale data collected from electronic health records to demonstrate the contribution of big data and machine learning approaches to improved prediction of myopia prognosis in Chinese school-aged children. This work provides evidence for transforming clinical practice, health policy-making, and precise individualised interventions regarding the practical control of school-aged myopia.
format Online
Article
Text
id pubmed-6219762
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-62197622018-11-19 Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study Lin, Haotian Long, Erping Ding, Xiaohu Diao, Hongxing Chen, Zicong Liu, Runzhong Huang, Jialing Cai, Jingheng Xu, Shuangjuan Zhang, Xiayin Wang, Dongni Chen, Kexin Yu, Tongyong Wu, Dongxuan Zhao, Xutu Liu, Zhenzhen Wu, Xiaohang Jiang, Yuzhen Yang, Xiao Cui, Dongmei Liu, Wenyan Zheng, Yingfeng Luo, Lixia Wang, Haibo Chan, Chi-Chao Morgan, Ian G. He, Mingguang Liu, Yizhi PLoS Med Research Article BACKGROUND: Electronic medical records provide large-scale real-world clinical data for use in developing clinical decision systems. However, sophisticated methodology and analytical skills are required to handle the large-scale datasets necessary for the optimisation of prediction accuracy. Myopia is a common cause of vision loss. Current approaches to control myopia progression are effective but have significant side effects. Therefore, identifying those at greatest risk who should undergo targeted therapy is of great clinical importance. The objective of this study was to apply big data and machine learning technology to develop an algorithm that can predict the onset of high myopia, at specific future time points, among Chinese school-aged children. METHODS AND FINDINGS: Real-world clinical refraction data were derived from electronic medical record systems in 8 ophthalmic centres from January 1, 2005, to December 30, 2015. The variables of age, spherical equivalent (SE), and annual progression rate were used to develop an algorithm to predict SE and onset of high myopia (SE ≤ −6.0 dioptres) up to 10 years in the future. Random forest machine learning was used for algorithm training and validation. Electronic medical records from the Zhongshan Ophthalmic Centre (a major tertiary ophthalmic centre in China) were used as the training set. Ten-fold cross-validation and out-of-bag (OOB) methods were applied for internal validation. The remaining 7 independent datasets were used for external validation. Two population-based datasets, which had no participant overlap with the ophthalmic-centre-based datasets, were used for multi-resource validation testing. The main outcomes and measures were the area under the curve (AUC) values for predicting the onset of high myopia over 10 years and the presence of high myopia at 18 years of age. In total, 687,063 multiple visit records (≥3 records) of 129,242 individuals in the ophthalmic-centre-based electronic medical record databases and 17,113 follow-up records of 3,215 participants in population-based cohorts were included in the analysis. Our algorithm accurately predicted the presence of high myopia in internal validation (the AUC ranged from 0.903 to 0.986 for 3 years, 0.875 to 0.901 for 5 years, and 0.852 to 0.888 for 8 years), external validation (the AUC ranged from 0.874 to 0.976 for 3 years, 0.847 to 0.921 for 5 years, and 0.802 to 0.886 for 8 years), and multi-resource testing (the AUC ranged from 0.752 to 0.869 for 4 years). With respect to the prediction of high myopia development by 18 years of age, as a surrogate of high myopia in adulthood, the algorithm provided clinically acceptable accuracy over 3 years (the AUC ranged from 0.940 to 0.985), 5 years (the AUC ranged from 0.856 to 0.901), and even 8 years (the AUC ranged from 0.801 to 0.837). Meanwhile, our algorithm achieved clinically acceptable prediction of the actual refraction values at future time points, which is supported by the regressive performance and calibration curves. Although the algorithm achieved balanced and robust performance, concerns about the compromised quality of real-world clinical data and over-fitting issues should be cautiously considered. CONCLUSIONS: To our knowledge, this study, for the first time, used large-scale data collected from electronic health records to demonstrate the contribution of big data and machine learning approaches to improved prediction of myopia prognosis in Chinese school-aged children. This work provides evidence for transforming clinical practice, health policy-making, and precise individualised interventions regarding the practical control of school-aged myopia. Public Library of Science 2018-11-06 /pmc/articles/PMC6219762/ /pubmed/30399150 http://dx.doi.org/10.1371/journal.pmed.1002674 Text en © 2018 Lin et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lin, Haotian
Long, Erping
Ding, Xiaohu
Diao, Hongxing
Chen, Zicong
Liu, Runzhong
Huang, Jialing
Cai, Jingheng
Xu, Shuangjuan
Zhang, Xiayin
Wang, Dongni
Chen, Kexin
Yu, Tongyong
Wu, Dongxuan
Zhao, Xutu
Liu, Zhenzhen
Wu, Xiaohang
Jiang, Yuzhen
Yang, Xiao
Cui, Dongmei
Liu, Wenyan
Zheng, Yingfeng
Luo, Lixia
Wang, Haibo
Chan, Chi-Chao
Morgan, Ian G.
He, Mingguang
Liu, Yizhi
Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study
title Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study
title_full Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study
title_fullStr Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study
title_full_unstemmed Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study
title_short Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study
title_sort prediction of myopia development among chinese school-aged children using refraction data from electronic medical records: a retrospective, multicentre machine learning study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6219762/
https://www.ncbi.nlm.nih.gov/pubmed/30399150
http://dx.doi.org/10.1371/journal.pmed.1002674
work_keys_str_mv AT linhaotian predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT longerping predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT dingxiaohu predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT diaohongxing predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT chenzicong predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT liurunzhong predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT huangjialing predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT caijingheng predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT xushuangjuan predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT zhangxiayin predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT wangdongni predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT chenkexin predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT yutongyong predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT wudongxuan predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT zhaoxutu predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT liuzhenzhen predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT wuxiaohang predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT jiangyuzhen predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT yangxiao predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT cuidongmei predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT liuwenyan predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT zhengyingfeng predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT luolixia predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT wanghaibo predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT chanchichao predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT morganiang predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT hemingguang predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy
AT liuyizhi predictionofmyopiadevelopmentamongchineseschoolagedchildrenusingrefractiondatafromelectronicmedicalrecordsaretrospectivemulticentremachinelearningstudy