Cargando…

Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study

PURPOSE: This study aimed to develop an interpretable machine learning model to predict the onset of myopia based on individual daily information. METHOD: This study was a prospective cohort study. At baseline, non-myopia children aged 6–13 years old were recruited, and individual data were collecte...

Descripción completa

Detalles Bibliográficos
Autores principales: Peng, Wei, Wang, Fei, Sun, Shaoming, Sun, Yining, Chen, Jingcheng, Wang, Mu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10182351/
https://www.ncbi.nlm.nih.gov/pubmed/37179307
http://dx.doi.org/10.1186/s12938-023-01109-8
_version_ 1785041753252823040
author Peng, Wei
Wang, Fei
Sun, Shaoming
Sun, Yining
Chen, Jingcheng
Wang, Mu
author_facet Peng, Wei
Wang, Fei
Sun, Shaoming
Sun, Yining
Chen, Jingcheng
Wang, Mu
author_sort Peng, Wei
collection PubMed
description PURPOSE: This study aimed to develop an interpretable machine learning model to predict the onset of myopia based on individual daily information. METHOD: This study was a prospective cohort study. At baseline, non-myopia children aged 6–13 years old were recruited, and individual data were collected through interviewing students and parents. One year after baseline, the incidence of myopia was evaluated based on visual acuity test and cycloplegic refraction measurement. Five algorithms, Random Forest, Support Vector Machines, Gradient Boosting Decision Tree, CatBoost and Logistic Regression were utilized to develop different models and their performance was validated by area under curve (AUC). Shapley Additive exPlanations was applied to interpret the model output on the individual and global level. RESULT: Of 2221 children, 260 (11.7%) developed myopia in 1 year. In univariable analysis, 26 features were associated with the myopia incidence. Catboost algorithm had the highest AUC of 0.951 in the model validation. The top 3 features for predicting myopia were parental myopia, grade and frequency of eye fatigue. A compact model using only 10 features was validated with an AUC of 0.891. CONCLUSION: The daily information contributed reliable predictors for childhood’s myopia onset. The interpretable Catboost model presented the best prediction performance. Oversampling technology greatly improved model performance. This model could be a tool in myopia preventing and intervention that can help identify children who are at risk of myopia, and provide personalized prevention strategies based on contributions of risk factors to the individual prediction result. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12938-023-01109-8.
format Online
Article
Text
id pubmed-10182351
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-101823512023-05-14 Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study Peng, Wei Wang, Fei Sun, Shaoming Sun, Yining Chen, Jingcheng Wang, Mu Biomed Eng Online Research PURPOSE: This study aimed to develop an interpretable machine learning model to predict the onset of myopia based on individual daily information. METHOD: This study was a prospective cohort study. At baseline, non-myopia children aged 6–13 years old were recruited, and individual data were collected through interviewing students and parents. One year after baseline, the incidence of myopia was evaluated based on visual acuity test and cycloplegic refraction measurement. Five algorithms, Random Forest, Support Vector Machines, Gradient Boosting Decision Tree, CatBoost and Logistic Regression were utilized to develop different models and their performance was validated by area under curve (AUC). Shapley Additive exPlanations was applied to interpret the model output on the individual and global level. RESULT: Of 2221 children, 260 (11.7%) developed myopia in 1 year. In univariable analysis, 26 features were associated with the myopia incidence. Catboost algorithm had the highest AUC of 0.951 in the model validation. The top 3 features for predicting myopia were parental myopia, grade and frequency of eye fatigue. A compact model using only 10 features was validated with an AUC of 0.891. CONCLUSION: The daily information contributed reliable predictors for childhood’s myopia onset. The interpretable Catboost model presented the best prediction performance. Oversampling technology greatly improved model performance. This model could be a tool in myopia preventing and intervention that can help identify children who are at risk of myopia, and provide personalized prevention strategies based on contributions of risk factors to the individual prediction result. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12938-023-01109-8. BioMed Central 2023-05-13 /pmc/articles/PMC10182351/ /pubmed/37179307 http://dx.doi.org/10.1186/s12938-023-01109-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Peng, Wei
Wang, Fei
Sun, Shaoming
Sun, Yining
Chen, Jingcheng
Wang, Mu
Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study
title Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study
title_full Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study
title_fullStr Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study
title_full_unstemmed Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study
title_short Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study
title_sort does multidimensional daily information predict the onset of myopia? a 1-year prospective cohort study
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10182351/
https://www.ncbi.nlm.nih.gov/pubmed/37179307
http://dx.doi.org/10.1186/s12938-023-01109-8
work_keys_str_mv AT pengwei doesmultidimensionaldailyinformationpredicttheonsetofmyopiaa1yearprospectivecohortstudy
AT wangfei doesmultidimensionaldailyinformationpredicttheonsetofmyopiaa1yearprospectivecohortstudy
AT sunshaoming doesmultidimensionaldailyinformationpredicttheonsetofmyopiaa1yearprospectivecohortstudy
AT sunyining doesmultidimensionaldailyinformationpredicttheonsetofmyopiaa1yearprospectivecohortstudy
AT chenjingcheng doesmultidimensionaldailyinformationpredicttheonsetofmyopiaa1yearprospectivecohortstudy
AT wangmu doesmultidimensionaldailyinformationpredicttheonsetofmyopiaa1yearprospectivecohortstudy