Cargando…

Predicting the 10-year risk of cataract surgery using machine learning techniques on questionnaire data: findings from the 45 and Up Study

BACKGROUND/AIMS: To investigate the feasibility and accuracy of using machine learning (ML) techniques on self-reported questionnaire data to predict the 10-year risk of cataract surgery, and to identify meaningful predictors of cataract surgery in middle-aged and older Australians. METHODS: Baselin...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Wei, Han, Xiaotong, Zhang, Jiaqing, Shang, Xianwen, Ha, Jason, Liu, Zhenzhen, Zhang, Lei, Luo, Lixia, He, Mingguang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9606508/
https://www.ncbi.nlm.nih.gov/pubmed/34039562
http://dx.doi.org/10.1136/bjophthalmol-2020-318609
_version_ 1784818313252044800
author Wang, Wei
Han, Xiaotong
Zhang, Jiaqing
Shang, Xianwen
Ha, Jason
Liu, Zhenzhen
Zhang, Lei
Luo, Lixia
He, Mingguang
author_facet Wang, Wei
Han, Xiaotong
Zhang, Jiaqing
Shang, Xianwen
Ha, Jason
Liu, Zhenzhen
Zhang, Lei
Luo, Lixia
He, Mingguang
author_sort Wang, Wei
collection PubMed
description BACKGROUND/AIMS: To investigate the feasibility and accuracy of using machine learning (ML) techniques on self-reported questionnaire data to predict the 10-year risk of cataract surgery, and to identify meaningful predictors of cataract surgery in middle-aged and older Australians. METHODS: Baseline information regarding demographic, socioeconomic, medical history and family history, lifestyle, dietary and self-rated health status were collected as risk factors. Cataract surgery events were confirmed by the Medicare Benefits Schedule Claims dataset. Three ML algorithms (random forests [RF], gradient boosting machine and deep learning) and one traditional regression algorithm (logistic model) were compared on the accuracy of their predictions for the risk of cataract surgery. The performance was assessed using 10-fold cross-validation. The main outcome measures were areas under the receiver operating characteristic curves (AUCs). RESULTS: In total, 207 573 participants, aged 45 years and above without a history of cataract surgery at baseline, were recruited from the 45 and Up Study. The performance of gradient boosting machine (AUC 0.790, 95% CI 0.785 to 0.795), RF (AUC 0.785, 95% CI 0.780 to 0.790) and deep learning (AUC 0.781, 95% CI 0.775 to 61 0.786) were robust and outperformed the traditional logistic regression method (AUC 0.767, 95% CI 0.762 to 0.773, all p<0.05). Age, self-rated eye vision and health insurance were consistently identified as important predictors in all models. CONCLUSIONS: The study demonstrated that ML modelling was able to reasonably accurately predict the 10-year risk of cataract surgery based on questionnaire data alone and was marginally superior to the conventional logistic model.
format Online
Article
Text
id pubmed-9606508
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-96065082022-10-28 Predicting the 10-year risk of cataract surgery using machine learning techniques on questionnaire data: findings from the 45 and Up Study Wang, Wei Han, Xiaotong Zhang, Jiaqing Shang, Xianwen Ha, Jason Liu, Zhenzhen Zhang, Lei Luo, Lixia He, Mingguang Br J Ophthalmol Clinical Science BACKGROUND/AIMS: To investigate the feasibility and accuracy of using machine learning (ML) techniques on self-reported questionnaire data to predict the 10-year risk of cataract surgery, and to identify meaningful predictors of cataract surgery in middle-aged and older Australians. METHODS: Baseline information regarding demographic, socioeconomic, medical history and family history, lifestyle, dietary and self-rated health status were collected as risk factors. Cataract surgery events were confirmed by the Medicare Benefits Schedule Claims dataset. Three ML algorithms (random forests [RF], gradient boosting machine and deep learning) and one traditional regression algorithm (logistic model) were compared on the accuracy of their predictions for the risk of cataract surgery. The performance was assessed using 10-fold cross-validation. The main outcome measures were areas under the receiver operating characteristic curves (AUCs). RESULTS: In total, 207 573 participants, aged 45 years and above without a history of cataract surgery at baseline, were recruited from the 45 and Up Study. The performance of gradient boosting machine (AUC 0.790, 95% CI 0.785 to 0.795), RF (AUC 0.785, 95% CI 0.780 to 0.790) and deep learning (AUC 0.781, 95% CI 0.775 to 61 0.786) were robust and outperformed the traditional logistic regression method (AUC 0.767, 95% CI 0.762 to 0.773, all p<0.05). Age, self-rated eye vision and health insurance were consistently identified as important predictors in all models. CONCLUSIONS: The study demonstrated that ML modelling was able to reasonably accurately predict the 10-year risk of cataract surgery based on questionnaire data alone and was marginally superior to the conventional logistic model. BMJ Publishing Group 2022-11 2021-05-26 /pmc/articles/PMC9606508/ /pubmed/34039562 http://dx.doi.org/10.1136/bjophthalmol-2020-318609 Text en © Author(s) (or their employer(s)) 2022. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Clinical Science
Wang, Wei
Han, Xiaotong
Zhang, Jiaqing
Shang, Xianwen
Ha, Jason
Liu, Zhenzhen
Zhang, Lei
Luo, Lixia
He, Mingguang
Predicting the 10-year risk of cataract surgery using machine learning techniques on questionnaire data: findings from the 45 and Up Study
title Predicting the 10-year risk of cataract surgery using machine learning techniques on questionnaire data: findings from the 45 and Up Study
title_full Predicting the 10-year risk of cataract surgery using machine learning techniques on questionnaire data: findings from the 45 and Up Study
title_fullStr Predicting the 10-year risk of cataract surgery using machine learning techniques on questionnaire data: findings from the 45 and Up Study
title_full_unstemmed Predicting the 10-year risk of cataract surgery using machine learning techniques on questionnaire data: findings from the 45 and Up Study
title_short Predicting the 10-year risk of cataract surgery using machine learning techniques on questionnaire data: findings from the 45 and Up Study
title_sort predicting the 10-year risk of cataract surgery using machine learning techniques on questionnaire data: findings from the 45 and up study
topic Clinical Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9606508/
https://www.ncbi.nlm.nih.gov/pubmed/34039562
http://dx.doi.org/10.1136/bjophthalmol-2020-318609
work_keys_str_mv AT wangwei predictingthe10yearriskofcataractsurgeryusingmachinelearningtechniquesonquestionnairedatafindingsfromthe45andupstudy
AT hanxiaotong predictingthe10yearriskofcataractsurgeryusingmachinelearningtechniquesonquestionnairedatafindingsfromthe45andupstudy
AT zhangjiaqing predictingthe10yearriskofcataractsurgeryusingmachinelearningtechniquesonquestionnairedatafindingsfromthe45andupstudy
AT shangxianwen predictingthe10yearriskofcataractsurgeryusingmachinelearningtechniquesonquestionnairedatafindingsfromthe45andupstudy
AT hajason predictingthe10yearriskofcataractsurgeryusingmachinelearningtechniquesonquestionnairedatafindingsfromthe45andupstudy
AT liuzhenzhen predictingthe10yearriskofcataractsurgeryusingmachinelearningtechniquesonquestionnairedatafindingsfromthe45andupstudy
AT zhanglei predictingthe10yearriskofcataractsurgeryusingmachinelearningtechniquesonquestionnairedatafindingsfromthe45andupstudy
AT luolixia predictingthe10yearriskofcataractsurgeryusingmachinelearningtechniquesonquestionnairedatafindingsfromthe45andupstudy
AT hemingguang predictingthe10yearriskofcataractsurgeryusingmachinelearningtechniquesonquestionnairedatafindingsfromthe45andupstudy