Cargando…

Explanatory Model of Dry Eye Disease Using Health and Nutrition Examinations: Machine Learning and Network-Based Factor Analysis From a National Survey

BACKGROUND: Dry eye disease (DED) is a complex disease of the ocular surface, and its associated factors are important for understanding and effectively treating DED. OBJECTIVE: This study aimed to provide an integrative and personalized model of DED by making an explanatory model of DED using as ma...

Descripción completa

Detalles Bibliográficos
Autores principales: Nam, Sang Min, Peterson, Thomas A, Butte, Atul J, Seo, Kyoung Yul, Han, Hyun Wook
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7059080/
https://www.ncbi.nlm.nih.gov/pubmed/32130150
http://dx.doi.org/10.2196/16153
_version_ 1783503976018214912
author Nam, Sang Min
Peterson, Thomas A
Butte, Atul J
Seo, Kyoung Yul
Han, Hyun Wook
author_facet Nam, Sang Min
Peterson, Thomas A
Butte, Atul J
Seo, Kyoung Yul
Han, Hyun Wook
author_sort Nam, Sang Min
collection PubMed
description BACKGROUND: Dry eye disease (DED) is a complex disease of the ocular surface, and its associated factors are important for understanding and effectively treating DED. OBJECTIVE: This study aimed to provide an integrative and personalized model of DED by making an explanatory model of DED using as many factors as possible from the Korea National Health and Nutrition Examination Survey (KNHANES) data. METHODS: Using KNHANES data for 2012 (4391 sample cases), a point-based scoring system was created for ranking factors associated with DED and assessing patient-specific DED risk. First, decision trees and lasso were used to classify continuous factors and to select important factors, respectively. Next, a survey-weighted multiple logistic regression was trained using these factors, and points were assigned using the regression coefficients. Finally, network graphs of partial correlations between factors were utilized to study the interrelatedness of DED-associated factors. RESULTS: The point-based model achieved an area under the curve of 0.70 (95% CI 0.61-0.78), and 13 of 78 factors considered were chosen. Important factors included sex (+9 points for women), corneal refractive surgery (+9 points), current depression (+7 points), cataract surgery (+7 points), stress (+6 points), age (54-66 years; +4 points), rhinitis (+4 points), lipid-lowering medication (+4 points), and intake of omega-3 (0.43%-0.65% kcal/day; −4 points). Among these, the age group 54 to 66 years had high centrality in the network, whereas omega-3 had low centrality. CONCLUSIONS: Integrative understanding of DED was possible using the machine learning–based model and network-based factor analysis. This method for finding important risk factors and identifying patient-specific risk could be applied to other multifactorial diseases.
format Online
Article
Text
id pubmed-7059080
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-70590802020-03-16 Explanatory Model of Dry Eye Disease Using Health and Nutrition Examinations: Machine Learning and Network-Based Factor Analysis From a National Survey Nam, Sang Min Peterson, Thomas A Butte, Atul J Seo, Kyoung Yul Han, Hyun Wook JMIR Med Inform Original Paper BACKGROUND: Dry eye disease (DED) is a complex disease of the ocular surface, and its associated factors are important for understanding and effectively treating DED. OBJECTIVE: This study aimed to provide an integrative and personalized model of DED by making an explanatory model of DED using as many factors as possible from the Korea National Health and Nutrition Examination Survey (KNHANES) data. METHODS: Using KNHANES data for 2012 (4391 sample cases), a point-based scoring system was created for ranking factors associated with DED and assessing patient-specific DED risk. First, decision trees and lasso were used to classify continuous factors and to select important factors, respectively. Next, a survey-weighted multiple logistic regression was trained using these factors, and points were assigned using the regression coefficients. Finally, network graphs of partial correlations between factors were utilized to study the interrelatedness of DED-associated factors. RESULTS: The point-based model achieved an area under the curve of 0.70 (95% CI 0.61-0.78), and 13 of 78 factors considered were chosen. Important factors included sex (+9 points for women), corneal refractive surgery (+9 points), current depression (+7 points), cataract surgery (+7 points), stress (+6 points), age (54-66 years; +4 points), rhinitis (+4 points), lipid-lowering medication (+4 points), and intake of omega-3 (0.43%-0.65% kcal/day; −4 points). Among these, the age group 54 to 66 years had high centrality in the network, whereas omega-3 had low centrality. CONCLUSIONS: Integrative understanding of DED was possible using the machine learning–based model and network-based factor analysis. This method for finding important risk factors and identifying patient-specific risk could be applied to other multifactorial diseases. JMIR Publications 2020-02-20 /pmc/articles/PMC7059080/ /pubmed/32130150 http://dx.doi.org/10.2196/16153 Text en ©Sang Min Nam, Thomas A Peterson, Atul J Butte, Kyoung Yul Seo, Hyun Wook Han. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 20.02.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Nam, Sang Min
Peterson, Thomas A
Butte, Atul J
Seo, Kyoung Yul
Han, Hyun Wook
Explanatory Model of Dry Eye Disease Using Health and Nutrition Examinations: Machine Learning and Network-Based Factor Analysis From a National Survey
title Explanatory Model of Dry Eye Disease Using Health and Nutrition Examinations: Machine Learning and Network-Based Factor Analysis From a National Survey
title_full Explanatory Model of Dry Eye Disease Using Health and Nutrition Examinations: Machine Learning and Network-Based Factor Analysis From a National Survey
title_fullStr Explanatory Model of Dry Eye Disease Using Health and Nutrition Examinations: Machine Learning and Network-Based Factor Analysis From a National Survey
title_full_unstemmed Explanatory Model of Dry Eye Disease Using Health and Nutrition Examinations: Machine Learning and Network-Based Factor Analysis From a National Survey
title_short Explanatory Model of Dry Eye Disease Using Health and Nutrition Examinations: Machine Learning and Network-Based Factor Analysis From a National Survey
title_sort explanatory model of dry eye disease using health and nutrition examinations: machine learning and network-based factor analysis from a national survey
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7059080/
https://www.ncbi.nlm.nih.gov/pubmed/32130150
http://dx.doi.org/10.2196/16153
work_keys_str_mv AT namsangmin explanatorymodelofdryeyediseaseusinghealthandnutritionexaminationsmachinelearningandnetworkbasedfactoranalysisfromanationalsurvey
AT petersonthomasa explanatorymodelofdryeyediseaseusinghealthandnutritionexaminationsmachinelearningandnetworkbasedfactoranalysisfromanationalsurvey
AT butteatulj explanatorymodelofdryeyediseaseusinghealthandnutritionexaminationsmachinelearningandnetworkbasedfactoranalysisfromanationalsurvey
AT seokyoungyul explanatorymodelofdryeyediseaseusinghealthandnutritionexaminationsmachinelearningandnetworkbasedfactoranalysisfromanationalsurvey
AT hanhyunwook explanatorymodelofdryeyediseaseusinghealthandnutritionexaminationsmachinelearningandnetworkbasedfactoranalysisfromanationalsurvey