Cargando…

Use of machine learning to identify risk factors for insomnia

IMPORTANCE: Sleep is critical to a person’s physical and mental health, but there are few studies systematically assessing risk factors for sleep disorders. OBJECTIVE: The objective of this study was to identify risk factors for a sleep disorder through machine-learning and assess this methodology....

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Alexander A., Huang, Samuel Y.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10096447/
https://www.ncbi.nlm.nih.gov/pubmed/37043435
http://dx.doi.org/10.1371/journal.pone.0282622
_version_ 1785024338545606656
author Huang, Alexander A.
Huang, Samuel Y.
author_facet Huang, Alexander A.
Huang, Samuel Y.
author_sort Huang, Alexander A.
collection PubMed
description IMPORTANCE: Sleep is critical to a person’s physical and mental health, but there are few studies systematically assessing risk factors for sleep disorders. OBJECTIVE: The objective of this study was to identify risk factors for a sleep disorder through machine-learning and assess this methodology. DESIGN, SETTING, AND PARTICIPANTS: A retrospective, cross-sectional cohort study using the publicly available National Health and Nutrition Examination Survey (NHANES) was conducted in patients who completed the demographic, dietary, exercise, and mental health questionnaire and had laboratory and physical exam data. METHODS: A physician diagnosis of insomnia was the outcome of this study. Univariate logistic models, with insomnia as the outcome, were used to identify covariates that were associated with insomnia. Covariates that had a p<0.0001 on univariate analysis were included within the final machine-learning model. The machine learning model XGBoost was used due to its prevalence within the literature as well as its increased predictive accuracy in healthcare prediction. Model covariates were ranked according to the cover statistic to identify risk factors for insomnia. Shapely Additive Explanations (SHAP) were utilized to visualize the relationship between these potential risk factors and insomnia. RESULTS: Of the 7,929 patients that met the inclusion criteria in this study, 4,055 (51% were female, 3,874 (49%) were male. The mean age was 49.2 (SD = 18.4), with 2,885 (36%) White patients, 2,144 (27%) Black patients, 1,639 (21%) Hispanic patients, and 1,261 (16%) patients of another race. The machine learning model had 64 out of a total of 684 features that were found to be significant on univariate analysis (P<0.0001 used). These were fitted into the XGBoost model and an AUROC = 0.87, Sensitivity = 0.77, Specificity = 0.77 were observed. The top four highest ranked features by cover, a measure of the percentage contribution of the covariate to the overall model prediction, were the Patient Health Questionnaire depression survey (PHQ-9) (Cover = 31.1%), age (Cover = 7.54%), physician recommendation of exercise (Cover = 3.86%), weight (Cover = 2.99%), and waist circumference (Cover = 2.70%). CONCLUSION: Machine learning models can effectively predict risk for a sleep disorder using demographic, laboratory, physical exam, and lifestyle covariates and identify key risk factors.
format Online
Article
Text
id pubmed-10096447
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-100964472023-04-13 Use of machine learning to identify risk factors for insomnia Huang, Alexander A. Huang, Samuel Y. PLoS One Research Article IMPORTANCE: Sleep is critical to a person’s physical and mental health, but there are few studies systematically assessing risk factors for sleep disorders. OBJECTIVE: The objective of this study was to identify risk factors for a sleep disorder through machine-learning and assess this methodology. DESIGN, SETTING, AND PARTICIPANTS: A retrospective, cross-sectional cohort study using the publicly available National Health and Nutrition Examination Survey (NHANES) was conducted in patients who completed the demographic, dietary, exercise, and mental health questionnaire and had laboratory and physical exam data. METHODS: A physician diagnosis of insomnia was the outcome of this study. Univariate logistic models, with insomnia as the outcome, were used to identify covariates that were associated with insomnia. Covariates that had a p<0.0001 on univariate analysis were included within the final machine-learning model. The machine learning model XGBoost was used due to its prevalence within the literature as well as its increased predictive accuracy in healthcare prediction. Model covariates were ranked according to the cover statistic to identify risk factors for insomnia. Shapely Additive Explanations (SHAP) were utilized to visualize the relationship between these potential risk factors and insomnia. RESULTS: Of the 7,929 patients that met the inclusion criteria in this study, 4,055 (51% were female, 3,874 (49%) were male. The mean age was 49.2 (SD = 18.4), with 2,885 (36%) White patients, 2,144 (27%) Black patients, 1,639 (21%) Hispanic patients, and 1,261 (16%) patients of another race. The machine learning model had 64 out of a total of 684 features that were found to be significant on univariate analysis (P<0.0001 used). These were fitted into the XGBoost model and an AUROC = 0.87, Sensitivity = 0.77, Specificity = 0.77 were observed. The top four highest ranked features by cover, a measure of the percentage contribution of the covariate to the overall model prediction, were the Patient Health Questionnaire depression survey (PHQ-9) (Cover = 31.1%), age (Cover = 7.54%), physician recommendation of exercise (Cover = 3.86%), weight (Cover = 2.99%), and waist circumference (Cover = 2.70%). CONCLUSION: Machine learning models can effectively predict risk for a sleep disorder using demographic, laboratory, physical exam, and lifestyle covariates and identify key risk factors. Public Library of Science 2023-04-12 /pmc/articles/PMC10096447/ /pubmed/37043435 http://dx.doi.org/10.1371/journal.pone.0282622 Text en © 2023 Huang, Huang https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Huang, Alexander A.
Huang, Samuel Y.
Use of machine learning to identify risk factors for insomnia
title Use of machine learning to identify risk factors for insomnia
title_full Use of machine learning to identify risk factors for insomnia
title_fullStr Use of machine learning to identify risk factors for insomnia
title_full_unstemmed Use of machine learning to identify risk factors for insomnia
title_short Use of machine learning to identify risk factors for insomnia
title_sort use of machine learning to identify risk factors for insomnia
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10096447/
https://www.ncbi.nlm.nih.gov/pubmed/37043435
http://dx.doi.org/10.1371/journal.pone.0282622
work_keys_str_mv AT huangalexandera useofmachinelearningtoidentifyriskfactorsforinsomnia
AT huangsamuely useofmachinelearningtoidentifyriskfactorsforinsomnia