Cargando…

Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults

BACKGROUND: Obstructive sleep apnea (OSA) is a globally prevalent disease with a complex diagnostic method. Severe OSA is associated with multi-system dysfunction. We aimed to develop an interpretable machine learning (ML) model for predicting the risk of severe OSA and analyzing the risk factors ba...

Descripción completa

Detalles Bibliográficos
Autores principales: Shi, Yewen, Zhang, Yitong, Cao, Zine, Ma, Lina, Yuan, Yuqi, Niu, Xiaoxin, Su, Yonglong, Xie, Yushan, Chen, Xi, Xing, Liang, Hei, Xinhong, Liu, Haiqin, Wu, Shinan, Li, Wenle, Ren, Xiaoyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10585776/
https://www.ncbi.nlm.nih.gov/pubmed/37858225
http://dx.doi.org/10.1186/s12911-023-02331-z
_version_ 1785123016821178368
author Shi, Yewen
Zhang, Yitong
Cao, Zine
Ma, Lina
Yuan, Yuqi
Niu, Xiaoxin
Su, Yonglong
Xie, Yushan
Chen, Xi
Xing, Liang
Hei, Xinhong
Liu, Haiqin
Wu, Shinan
Li, Wenle
Ren, Xiaoyong
author_facet Shi, Yewen
Zhang, Yitong
Cao, Zine
Ma, Lina
Yuan, Yuqi
Niu, Xiaoxin
Su, Yonglong
Xie, Yushan
Chen, Xi
Xing, Liang
Hei, Xinhong
Liu, Haiqin
Wu, Shinan
Li, Wenle
Ren, Xiaoyong
author_sort Shi, Yewen
collection PubMed
description BACKGROUND: Obstructive sleep apnea (OSA) is a globally prevalent disease with a complex diagnostic method. Severe OSA is associated with multi-system dysfunction. We aimed to develop an interpretable machine learning (ML) model for predicting the risk of severe OSA and analyzing the risk factors based on clinical characteristics and questionnaires. METHODS: This was a retrospective study comprising 1656 subjects who presented and underwent polysomnography (PSG) between 2018 and 2021. A total of 23 variables were included, and after univariate analysis, 15 variables were selected for further preprocessing. Six types of classification models were used to evaluate the ability to predict severe OSA, namely logistic regression (LR), gradient boosting machine (GBM), extreme gradient boosting (XGBoost), adaptive boosting (AdaBoost), bootstrapped aggregating (Bagging), and multilayer perceptron (MLP). All models used the area under the receiver operating characteristic curve (AUC) was calculated as the performance metric. We also drew SHapley Additive exPlanations (SHAP) plots to interpret predictive results and to analyze the relative importance of risk factors. An online calculator was developed to estimate the risk of severe OSA in individuals. RESULTS: Among the enrolled subjects, 61.47% (1018/1656) were diagnosed with severe OSA. Multivariate LR analysis showed that 10 of 23 variables were independent risk factors for severe OSA. The GBM model showed the best performance (AUC = 0.857, accuracy = 0.766, sensitivity = 0.798, specificity = 0.734). An online calculator was developed to estimate the risk of severe OSA based on the GBM model. Finally, waist circumference, neck circumference, the Epworth Sleepiness Scale, age, and the Berlin questionnaire were revealed by the SHAP plot as the top five critical variables contributing to the diagnosis of severe OSA. Additionally, two typical cases were analyzed to interpret the contribution of each variable to the outcome prediction in a single patient. CONCLUSIONS: We established six risk prediction models for severe OSA using ML algorithms. Among them, the GBM model performed best. The model facilitates individualized assessment and further clinical strategies for patients with suspected severe OSA. This will help to identify patients with severe OSA as early as possible and ensure their timely treatment. TRIAL REGISTRATION: Retrospectively registered.
format Online
Article
Text
id pubmed-10585776
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-105857762023-10-20 Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults Shi, Yewen Zhang, Yitong Cao, Zine Ma, Lina Yuan, Yuqi Niu, Xiaoxin Su, Yonglong Xie, Yushan Chen, Xi Xing, Liang Hei, Xinhong Liu, Haiqin Wu, Shinan Li, Wenle Ren, Xiaoyong BMC Med Inform Decis Mak Research BACKGROUND: Obstructive sleep apnea (OSA) is a globally prevalent disease with a complex diagnostic method. Severe OSA is associated with multi-system dysfunction. We aimed to develop an interpretable machine learning (ML) model for predicting the risk of severe OSA and analyzing the risk factors based on clinical characteristics and questionnaires. METHODS: This was a retrospective study comprising 1656 subjects who presented and underwent polysomnography (PSG) between 2018 and 2021. A total of 23 variables were included, and after univariate analysis, 15 variables were selected for further preprocessing. Six types of classification models were used to evaluate the ability to predict severe OSA, namely logistic regression (LR), gradient boosting machine (GBM), extreme gradient boosting (XGBoost), adaptive boosting (AdaBoost), bootstrapped aggregating (Bagging), and multilayer perceptron (MLP). All models used the area under the receiver operating characteristic curve (AUC) was calculated as the performance metric. We also drew SHapley Additive exPlanations (SHAP) plots to interpret predictive results and to analyze the relative importance of risk factors. An online calculator was developed to estimate the risk of severe OSA in individuals. RESULTS: Among the enrolled subjects, 61.47% (1018/1656) were diagnosed with severe OSA. Multivariate LR analysis showed that 10 of 23 variables were independent risk factors for severe OSA. The GBM model showed the best performance (AUC = 0.857, accuracy = 0.766, sensitivity = 0.798, specificity = 0.734). An online calculator was developed to estimate the risk of severe OSA based on the GBM model. Finally, waist circumference, neck circumference, the Epworth Sleepiness Scale, age, and the Berlin questionnaire were revealed by the SHAP plot as the top five critical variables contributing to the diagnosis of severe OSA. Additionally, two typical cases were analyzed to interpret the contribution of each variable to the outcome prediction in a single patient. CONCLUSIONS: We established six risk prediction models for severe OSA using ML algorithms. Among them, the GBM model performed best. The model facilitates individualized assessment and further clinical strategies for patients with suspected severe OSA. This will help to identify patients with severe OSA as early as possible and ensure their timely treatment. TRIAL REGISTRATION: Retrospectively registered. BioMed Central 2023-10-19 /pmc/articles/PMC10585776/ /pubmed/37858225 http://dx.doi.org/10.1186/s12911-023-02331-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Shi, Yewen
Zhang, Yitong
Cao, Zine
Ma, Lina
Yuan, Yuqi
Niu, Xiaoxin
Su, Yonglong
Xie, Yushan
Chen, Xi
Xing, Liang
Hei, Xinhong
Liu, Haiqin
Wu, Shinan
Li, Wenle
Ren, Xiaoyong
Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults
title Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults
title_full Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults
title_fullStr Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults
title_full_unstemmed Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults
title_short Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults
title_sort application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10585776/
https://www.ncbi.nlm.nih.gov/pubmed/37858225
http://dx.doi.org/10.1186/s12911-023-02331-z
work_keys_str_mv AT shiyewen applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults
AT zhangyitong applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults
AT caozine applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults
AT malina applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults
AT yuanyuqi applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults
AT niuxiaoxin applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults
AT suyonglong applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults
AT xieyushan applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults
AT chenxi applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults
AT xingliang applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults
AT heixinhong applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults
AT liuhaiqin applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults
AT wushinan applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults
AT liwenle applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults
AT renxiaoyong applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults