Cargando…
Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults
BACKGROUND: Obstructive sleep apnea (OSA) is a globally prevalent disease with a complex diagnostic method. Severe OSA is associated with multi-system dysfunction. We aimed to develop an interpretable machine learning (ML) model for predicting the risk of severe OSA and analyzing the risk factors ba...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10585776/ https://www.ncbi.nlm.nih.gov/pubmed/37858225 http://dx.doi.org/10.1186/s12911-023-02331-z |
_version_ | 1785123016821178368 |
---|---|
author | Shi, Yewen Zhang, Yitong Cao, Zine Ma, Lina Yuan, Yuqi Niu, Xiaoxin Su, Yonglong Xie, Yushan Chen, Xi Xing, Liang Hei, Xinhong Liu, Haiqin Wu, Shinan Li, Wenle Ren, Xiaoyong |
author_facet | Shi, Yewen Zhang, Yitong Cao, Zine Ma, Lina Yuan, Yuqi Niu, Xiaoxin Su, Yonglong Xie, Yushan Chen, Xi Xing, Liang Hei, Xinhong Liu, Haiqin Wu, Shinan Li, Wenle Ren, Xiaoyong |
author_sort | Shi, Yewen |
collection | PubMed |
description | BACKGROUND: Obstructive sleep apnea (OSA) is a globally prevalent disease with a complex diagnostic method. Severe OSA is associated with multi-system dysfunction. We aimed to develop an interpretable machine learning (ML) model for predicting the risk of severe OSA and analyzing the risk factors based on clinical characteristics and questionnaires. METHODS: This was a retrospective study comprising 1656 subjects who presented and underwent polysomnography (PSG) between 2018 and 2021. A total of 23 variables were included, and after univariate analysis, 15 variables were selected for further preprocessing. Six types of classification models were used to evaluate the ability to predict severe OSA, namely logistic regression (LR), gradient boosting machine (GBM), extreme gradient boosting (XGBoost), adaptive boosting (AdaBoost), bootstrapped aggregating (Bagging), and multilayer perceptron (MLP). All models used the area under the receiver operating characteristic curve (AUC) was calculated as the performance metric. We also drew SHapley Additive exPlanations (SHAP) plots to interpret predictive results and to analyze the relative importance of risk factors. An online calculator was developed to estimate the risk of severe OSA in individuals. RESULTS: Among the enrolled subjects, 61.47% (1018/1656) were diagnosed with severe OSA. Multivariate LR analysis showed that 10 of 23 variables were independent risk factors for severe OSA. The GBM model showed the best performance (AUC = 0.857, accuracy = 0.766, sensitivity = 0.798, specificity = 0.734). An online calculator was developed to estimate the risk of severe OSA based on the GBM model. Finally, waist circumference, neck circumference, the Epworth Sleepiness Scale, age, and the Berlin questionnaire were revealed by the SHAP plot as the top five critical variables contributing to the diagnosis of severe OSA. Additionally, two typical cases were analyzed to interpret the contribution of each variable to the outcome prediction in a single patient. CONCLUSIONS: We established six risk prediction models for severe OSA using ML algorithms. Among them, the GBM model performed best. The model facilitates individualized assessment and further clinical strategies for patients with suspected severe OSA. This will help to identify patients with severe OSA as early as possible and ensure their timely treatment. TRIAL REGISTRATION: Retrospectively registered. |
format | Online Article Text |
id | pubmed-10585776 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-105857762023-10-20 Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults Shi, Yewen Zhang, Yitong Cao, Zine Ma, Lina Yuan, Yuqi Niu, Xiaoxin Su, Yonglong Xie, Yushan Chen, Xi Xing, Liang Hei, Xinhong Liu, Haiqin Wu, Shinan Li, Wenle Ren, Xiaoyong BMC Med Inform Decis Mak Research BACKGROUND: Obstructive sleep apnea (OSA) is a globally prevalent disease with a complex diagnostic method. Severe OSA is associated with multi-system dysfunction. We aimed to develop an interpretable machine learning (ML) model for predicting the risk of severe OSA and analyzing the risk factors based on clinical characteristics and questionnaires. METHODS: This was a retrospective study comprising 1656 subjects who presented and underwent polysomnography (PSG) between 2018 and 2021. A total of 23 variables were included, and after univariate analysis, 15 variables were selected for further preprocessing. Six types of classification models were used to evaluate the ability to predict severe OSA, namely logistic regression (LR), gradient boosting machine (GBM), extreme gradient boosting (XGBoost), adaptive boosting (AdaBoost), bootstrapped aggregating (Bagging), and multilayer perceptron (MLP). All models used the area under the receiver operating characteristic curve (AUC) was calculated as the performance metric. We also drew SHapley Additive exPlanations (SHAP) plots to interpret predictive results and to analyze the relative importance of risk factors. An online calculator was developed to estimate the risk of severe OSA in individuals. RESULTS: Among the enrolled subjects, 61.47% (1018/1656) were diagnosed with severe OSA. Multivariate LR analysis showed that 10 of 23 variables were independent risk factors for severe OSA. The GBM model showed the best performance (AUC = 0.857, accuracy = 0.766, sensitivity = 0.798, specificity = 0.734). An online calculator was developed to estimate the risk of severe OSA based on the GBM model. Finally, waist circumference, neck circumference, the Epworth Sleepiness Scale, age, and the Berlin questionnaire were revealed by the SHAP plot as the top five critical variables contributing to the diagnosis of severe OSA. Additionally, two typical cases were analyzed to interpret the contribution of each variable to the outcome prediction in a single patient. CONCLUSIONS: We established six risk prediction models for severe OSA using ML algorithms. Among them, the GBM model performed best. The model facilitates individualized assessment and further clinical strategies for patients with suspected severe OSA. This will help to identify patients with severe OSA as early as possible and ensure their timely treatment. TRIAL REGISTRATION: Retrospectively registered. BioMed Central 2023-10-19 /pmc/articles/PMC10585776/ /pubmed/37858225 http://dx.doi.org/10.1186/s12911-023-02331-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Shi, Yewen Zhang, Yitong Cao, Zine Ma, Lina Yuan, Yuqi Niu, Xiaoxin Su, Yonglong Xie, Yushan Chen, Xi Xing, Liang Hei, Xinhong Liu, Haiqin Wu, Shinan Li, Wenle Ren, Xiaoyong Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults |
title | Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults |
title_full | Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults |
title_fullStr | Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults |
title_full_unstemmed | Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults |
title_short | Application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults |
title_sort | application and interpretation of machine learning models in predicting the risk of severe obstructive sleep apnea in adults |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10585776/ https://www.ncbi.nlm.nih.gov/pubmed/37858225 http://dx.doi.org/10.1186/s12911-023-02331-z |
work_keys_str_mv | AT shiyewen applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults AT zhangyitong applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults AT caozine applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults AT malina applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults AT yuanyuqi applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults AT niuxiaoxin applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults AT suyonglong applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults AT xieyushan applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults AT chenxi applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults AT xingliang applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults AT heixinhong applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults AT liuhaiqin applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults AT wushinan applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults AT liwenle applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults AT renxiaoyong applicationandinterpretationofmachinelearningmodelsinpredictingtheriskofsevereobstructivesleepapneainadults |