Cargando…
Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study
PURPOSE: This study aimed to develop an interpretable machine learning model to predict the onset of myopia based on individual daily information. METHOD: This study was a prospective cohort study. At baseline, non-myopia children aged 6–13 years old were recruited, and individual data were collecte...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10182351/ https://www.ncbi.nlm.nih.gov/pubmed/37179307 http://dx.doi.org/10.1186/s12938-023-01109-8 |
_version_ | 1785041753252823040 |
---|---|
author | Peng, Wei Wang, Fei Sun, Shaoming Sun, Yining Chen, Jingcheng Wang, Mu |
author_facet | Peng, Wei Wang, Fei Sun, Shaoming Sun, Yining Chen, Jingcheng Wang, Mu |
author_sort | Peng, Wei |
collection | PubMed |
description | PURPOSE: This study aimed to develop an interpretable machine learning model to predict the onset of myopia based on individual daily information. METHOD: This study was a prospective cohort study. At baseline, non-myopia children aged 6–13 years old were recruited, and individual data were collected through interviewing students and parents. One year after baseline, the incidence of myopia was evaluated based on visual acuity test and cycloplegic refraction measurement. Five algorithms, Random Forest, Support Vector Machines, Gradient Boosting Decision Tree, CatBoost and Logistic Regression were utilized to develop different models and their performance was validated by area under curve (AUC). Shapley Additive exPlanations was applied to interpret the model output on the individual and global level. RESULT: Of 2221 children, 260 (11.7%) developed myopia in 1 year. In univariable analysis, 26 features were associated with the myopia incidence. Catboost algorithm had the highest AUC of 0.951 in the model validation. The top 3 features for predicting myopia were parental myopia, grade and frequency of eye fatigue. A compact model using only 10 features was validated with an AUC of 0.891. CONCLUSION: The daily information contributed reliable predictors for childhood’s myopia onset. The interpretable Catboost model presented the best prediction performance. Oversampling technology greatly improved model performance. This model could be a tool in myopia preventing and intervention that can help identify children who are at risk of myopia, and provide personalized prevention strategies based on contributions of risk factors to the individual prediction result. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12938-023-01109-8. |
format | Online Article Text |
id | pubmed-10182351 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-101823512023-05-14 Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study Peng, Wei Wang, Fei Sun, Shaoming Sun, Yining Chen, Jingcheng Wang, Mu Biomed Eng Online Research PURPOSE: This study aimed to develop an interpretable machine learning model to predict the onset of myopia based on individual daily information. METHOD: This study was a prospective cohort study. At baseline, non-myopia children aged 6–13 years old were recruited, and individual data were collected through interviewing students and parents. One year after baseline, the incidence of myopia was evaluated based on visual acuity test and cycloplegic refraction measurement. Five algorithms, Random Forest, Support Vector Machines, Gradient Boosting Decision Tree, CatBoost and Logistic Regression were utilized to develop different models and their performance was validated by area under curve (AUC). Shapley Additive exPlanations was applied to interpret the model output on the individual and global level. RESULT: Of 2221 children, 260 (11.7%) developed myopia in 1 year. In univariable analysis, 26 features were associated with the myopia incidence. Catboost algorithm had the highest AUC of 0.951 in the model validation. The top 3 features for predicting myopia were parental myopia, grade and frequency of eye fatigue. A compact model using only 10 features was validated with an AUC of 0.891. CONCLUSION: The daily information contributed reliable predictors for childhood’s myopia onset. The interpretable Catboost model presented the best prediction performance. Oversampling technology greatly improved model performance. This model could be a tool in myopia preventing and intervention that can help identify children who are at risk of myopia, and provide personalized prevention strategies based on contributions of risk factors to the individual prediction result. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12938-023-01109-8. BioMed Central 2023-05-13 /pmc/articles/PMC10182351/ /pubmed/37179307 http://dx.doi.org/10.1186/s12938-023-01109-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Peng, Wei Wang, Fei Sun, Shaoming Sun, Yining Chen, Jingcheng Wang, Mu Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study |
title | Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study |
title_full | Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study |
title_fullStr | Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study |
title_full_unstemmed | Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study |
title_short | Does multidimensional daily information predict the onset of myopia? A 1-year prospective cohort study |
title_sort | does multidimensional daily information predict the onset of myopia? a 1-year prospective cohort study |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10182351/ https://www.ncbi.nlm.nih.gov/pubmed/37179307 http://dx.doi.org/10.1186/s12938-023-01109-8 |
work_keys_str_mv | AT pengwei doesmultidimensionaldailyinformationpredicttheonsetofmyopiaa1yearprospectivecohortstudy AT wangfei doesmultidimensionaldailyinformationpredicttheonsetofmyopiaa1yearprospectivecohortstudy AT sunshaoming doesmultidimensionaldailyinformationpredicttheonsetofmyopiaa1yearprospectivecohortstudy AT sunyining doesmultidimensionaldailyinformationpredicttheonsetofmyopiaa1yearprospectivecohortstudy AT chenjingcheng doesmultidimensionaldailyinformationpredicttheonsetofmyopiaa1yearprospectivecohortstudy AT wangmu doesmultidimensionaldailyinformationpredicttheonsetofmyopiaa1yearprospectivecohortstudy |