Cargando…

Use of machine learning models to predict prognosis of combined pulmonary fibrosis and emphysema in a Chinese population

BACKGROUND: Combined pulmonary fibrosis and emphysema (CPFE) is a novel clinical entity with a poor prognosis. This study aimed to develop a clinical nomogram model to predict the 1-, 2- and 3-year mortality of patients with CPFE by using the machine learning approach, and to validate the predictive...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Qing, Sun, Di, Wang, Yu, Li, Pengfei, Jiang, Tianci, Dai, Lingling, Duo, Mengjie, Wu, Ruhao, Cheng, Zhe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9422147/
https://www.ncbi.nlm.nih.gov/pubmed/36038872
http://dx.doi.org/10.1186/s12890-022-02124-6
_version_ 1784777750363504640
author Liu, Qing
Sun, Di
Wang, Yu
Li, Pengfei
Jiang, Tianci
Dai, Lingling
Duo, Mengjie
Wu, Ruhao
Cheng, Zhe
author_facet Liu, Qing
Sun, Di
Wang, Yu
Li, Pengfei
Jiang, Tianci
Dai, Lingling
Duo, Mengjie
Wu, Ruhao
Cheng, Zhe
author_sort Liu, Qing
collection PubMed
description BACKGROUND: Combined pulmonary fibrosis and emphysema (CPFE) is a novel clinical entity with a poor prognosis. This study aimed to develop a clinical nomogram model to predict the 1-, 2- and 3-year mortality of patients with CPFE by using the machine learning approach, and to validate the predictive ability of the interstitial lung disease-gender-age-lung physiology (ILD-GAP) model in CPFE. METHODS: The data of CPFE patients from January 2015 to October 2021 who met the inclusion criteria were retrospectively collected. We utilized LASSO regression and multivariable Cox regression analysis to identify the variables associated with the prognosis of CPFE and generate a nomogram. The Harrell's C index, the calibration curve and the area under the receiver operating characteristic (ROC) curve (AUC) were used to evaluate the performance of the nomogram. Then, we performed likelihood ratio test, net reclassification improvement (NRI), integrated discrimination improvement (IDI) and decision curve analysis (DCA) to compare the performance of the nomogram with that of the ILD-GAP model. RESULTS: A total of 184 patients with CPFE were enrolled. During the follow-up, 90 patients died. After screening out, diffusing lung capacity for carbon monoxide (DLCO), right ventricular diameter (RVD), C-reactive protein (CRP), and globulin were found to be associated with the prognosis of CPFE. The nomogram was then developed by incorporating the above five variables, and it showed a good performance, with a Harrell's C index of 0.757 and an AUC of 0.800 (95% CI 0.736–0.863). Moreover, the calibration plot of the nomogram showed good concordance between the prediction probabilities and the actual observations. The nomogram also improved the discrimination ability of the ILD-GAP model compared to that of the ILD-GAP model alone, and this was substantiated by the likelihood ratio test, NRI and IDI. The significant clinical utility of the nomogram was demonstrated by DCA. CONCLUSION: Age, DLCO, RVD, CRP and globulin were identified as being significantly associated with the prognosis of CPFE in our cohort. The nomogram incorporating the 5 variables showed good performance in predicting the mortality of CPFE. In addition, although the nomogram was superior to the ILD-GAP model in the present cohort, further validation is needed to determine the clinical utility of the nomogram.
format Online
Article
Text
id pubmed-9422147
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-94221472022-08-30 Use of machine learning models to predict prognosis of combined pulmonary fibrosis and emphysema in a Chinese population Liu, Qing Sun, Di Wang, Yu Li, Pengfei Jiang, Tianci Dai, Lingling Duo, Mengjie Wu, Ruhao Cheng, Zhe BMC Pulm Med Research BACKGROUND: Combined pulmonary fibrosis and emphysema (CPFE) is a novel clinical entity with a poor prognosis. This study aimed to develop a clinical nomogram model to predict the 1-, 2- and 3-year mortality of patients with CPFE by using the machine learning approach, and to validate the predictive ability of the interstitial lung disease-gender-age-lung physiology (ILD-GAP) model in CPFE. METHODS: The data of CPFE patients from January 2015 to October 2021 who met the inclusion criteria were retrospectively collected. We utilized LASSO regression and multivariable Cox regression analysis to identify the variables associated with the prognosis of CPFE and generate a nomogram. The Harrell's C index, the calibration curve and the area under the receiver operating characteristic (ROC) curve (AUC) were used to evaluate the performance of the nomogram. Then, we performed likelihood ratio test, net reclassification improvement (NRI), integrated discrimination improvement (IDI) and decision curve analysis (DCA) to compare the performance of the nomogram with that of the ILD-GAP model. RESULTS: A total of 184 patients with CPFE were enrolled. During the follow-up, 90 patients died. After screening out, diffusing lung capacity for carbon monoxide (DLCO), right ventricular diameter (RVD), C-reactive protein (CRP), and globulin were found to be associated with the prognosis of CPFE. The nomogram was then developed by incorporating the above five variables, and it showed a good performance, with a Harrell's C index of 0.757 and an AUC of 0.800 (95% CI 0.736–0.863). Moreover, the calibration plot of the nomogram showed good concordance between the prediction probabilities and the actual observations. The nomogram also improved the discrimination ability of the ILD-GAP model compared to that of the ILD-GAP model alone, and this was substantiated by the likelihood ratio test, NRI and IDI. The significant clinical utility of the nomogram was demonstrated by DCA. CONCLUSION: Age, DLCO, RVD, CRP and globulin were identified as being significantly associated with the prognosis of CPFE in our cohort. The nomogram incorporating the 5 variables showed good performance in predicting the mortality of CPFE. In addition, although the nomogram was superior to the ILD-GAP model in the present cohort, further validation is needed to determine the clinical utility of the nomogram. BioMed Central 2022-08-29 /pmc/articles/PMC9422147/ /pubmed/36038872 http://dx.doi.org/10.1186/s12890-022-02124-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Liu, Qing
Sun, Di
Wang, Yu
Li, Pengfei
Jiang, Tianci
Dai, Lingling
Duo, Mengjie
Wu, Ruhao
Cheng, Zhe
Use of machine learning models to predict prognosis of combined pulmonary fibrosis and emphysema in a Chinese population
title Use of machine learning models to predict prognosis of combined pulmonary fibrosis and emphysema in a Chinese population
title_full Use of machine learning models to predict prognosis of combined pulmonary fibrosis and emphysema in a Chinese population
title_fullStr Use of machine learning models to predict prognosis of combined pulmonary fibrosis and emphysema in a Chinese population
title_full_unstemmed Use of machine learning models to predict prognosis of combined pulmonary fibrosis and emphysema in a Chinese population
title_short Use of machine learning models to predict prognosis of combined pulmonary fibrosis and emphysema in a Chinese population
title_sort use of machine learning models to predict prognosis of combined pulmonary fibrosis and emphysema in a chinese population
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9422147/
https://www.ncbi.nlm.nih.gov/pubmed/36038872
http://dx.doi.org/10.1186/s12890-022-02124-6
work_keys_str_mv AT liuqing useofmachinelearningmodelstopredictprognosisofcombinedpulmonaryfibrosisandemphysemainachinesepopulation
AT sundi useofmachinelearningmodelstopredictprognosisofcombinedpulmonaryfibrosisandemphysemainachinesepopulation
AT wangyu useofmachinelearningmodelstopredictprognosisofcombinedpulmonaryfibrosisandemphysemainachinesepopulation
AT lipengfei useofmachinelearningmodelstopredictprognosisofcombinedpulmonaryfibrosisandemphysemainachinesepopulation
AT jiangtianci useofmachinelearningmodelstopredictprognosisofcombinedpulmonaryfibrosisandemphysemainachinesepopulation
AT dailingling useofmachinelearningmodelstopredictprognosisofcombinedpulmonaryfibrosisandemphysemainachinesepopulation
AT duomengjie useofmachinelearningmodelstopredictprognosisofcombinedpulmonaryfibrosisandemphysemainachinesepopulation
AT wuruhao useofmachinelearningmodelstopredictprognosisofcombinedpulmonaryfibrosisandemphysemainachinesepopulation
AT chengzhe useofmachinelearningmodelstopredictprognosisofcombinedpulmonaryfibrosisandemphysemainachinesepopulation