Cargando…

Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data

BACKGROUND: The differential diagnosis of malignant pleural effusion (MPE) and benign pleural effusion (BPE) presents a clinical challenge. In recent years, the use of artificial intelligence (AI) machine learning models for disease diagnosis has increased. OBJECTIVE: This study aimed to develop and...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Ting-Ting, Zhang, Jia-Feng, Cheng, Zhuo, Jiang, Lei, Li, Jiang-Yan, Zhou, Lin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10637149/
https://www.ncbi.nlm.nih.gov/pubmed/37941347
http://dx.doi.org/10.1177/17534666231208632
_version_ 1785133345719451648
author Wei, Ting-Ting
Zhang, Jia-Feng
Cheng, Zhuo
Jiang, Lei
Li, Jiang-Yan
Zhou, Lin
author_facet Wei, Ting-Ting
Zhang, Jia-Feng
Cheng, Zhuo
Jiang, Lei
Li, Jiang-Yan
Zhou, Lin
author_sort Wei, Ting-Ting
collection PubMed
description BACKGROUND: The differential diagnosis of malignant pleural effusion (MPE) and benign pleural effusion (BPE) presents a clinical challenge. In recent years, the use of artificial intelligence (AI) machine learning models for disease diagnosis has increased. OBJECTIVE: This study aimed to develop and validate a diagnostic model for early differentiation between MPE and BPE based on routine laboratory data. DESIGN: This was a retrospective observational cohort study. METHODS: A total of 2352 newly diagnosed patients with pleural effusion (PE), between January 2008 and March 2021, were eventually enrolled. Among them, 1435, 466, and 451 participants were randomly assigned to the training, validation, and testing cohorts in a ratio of 3:1:1. Clinical parameters, including age, sex, and laboratory parameters of PE patients, were abstracted for analysis. Based on 81 candidate laboratory variables, five machine learning models, namely extreme gradient boosting (XGBoost) model, logistic regression (LR) model, random forest (RF) model, support vector machine (SVM) model, and multilayer perceptron (MLP) model were developed. Their respective diagnostic performances for MPE were evaluated by receiver operating characteristic (ROC) curves. RESULTS: Among the five models, the XGBoost model exhibited the best diagnostic performance for MPE (area under the curve (AUC): 0.903, 0.918, and 0.886 in the training, validation, and testing cohorts, respectively). Additionally, the XGBoost model outperformed carcinoembryonic antigen (CEA) levels in pleural fluid (PF), serum, and the PF/serum ratio (AUC: 0.726, 0.699, and 0.692 in the training cohort; 0.763, 0.695, and 0.731 in the validation cohort; and 0.722, 0.729, and 0.693 in the testing cohort, respectively). Furthermore, compared with CEA, the XGBoost model demonstrated greater diagnostic power and sensitivity in diagnosing lung cancer-induced MPE. CONCLUSION: The development of a machine learning model utilizing routine laboratory biomarkers significantly enhances the diagnostic capability for distinguishing between MPE and BPE. The XGBoost model emerges as a valuable tool for the diagnosis of MPE.
format Online
Article
Text
id pubmed-10637149
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-106371492023-11-11 Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data Wei, Ting-Ting Zhang, Jia-Feng Cheng, Zhuo Jiang, Lei Li, Jiang-Yan Zhou, Lin Ther Adv Respir Dis Original Research BACKGROUND: The differential diagnosis of malignant pleural effusion (MPE) and benign pleural effusion (BPE) presents a clinical challenge. In recent years, the use of artificial intelligence (AI) machine learning models for disease diagnosis has increased. OBJECTIVE: This study aimed to develop and validate a diagnostic model for early differentiation between MPE and BPE based on routine laboratory data. DESIGN: This was a retrospective observational cohort study. METHODS: A total of 2352 newly diagnosed patients with pleural effusion (PE), between January 2008 and March 2021, were eventually enrolled. Among them, 1435, 466, and 451 participants were randomly assigned to the training, validation, and testing cohorts in a ratio of 3:1:1. Clinical parameters, including age, sex, and laboratory parameters of PE patients, were abstracted for analysis. Based on 81 candidate laboratory variables, five machine learning models, namely extreme gradient boosting (XGBoost) model, logistic regression (LR) model, random forest (RF) model, support vector machine (SVM) model, and multilayer perceptron (MLP) model were developed. Their respective diagnostic performances for MPE were evaluated by receiver operating characteristic (ROC) curves. RESULTS: Among the five models, the XGBoost model exhibited the best diagnostic performance for MPE (area under the curve (AUC): 0.903, 0.918, and 0.886 in the training, validation, and testing cohorts, respectively). Additionally, the XGBoost model outperformed carcinoembryonic antigen (CEA) levels in pleural fluid (PF), serum, and the PF/serum ratio (AUC: 0.726, 0.699, and 0.692 in the training cohort; 0.763, 0.695, and 0.731 in the validation cohort; and 0.722, 0.729, and 0.693 in the testing cohort, respectively). Furthermore, compared with CEA, the XGBoost model demonstrated greater diagnostic power and sensitivity in diagnosing lung cancer-induced MPE. CONCLUSION: The development of a machine learning model utilizing routine laboratory biomarkers significantly enhances the diagnostic capability for distinguishing between MPE and BPE. The XGBoost model emerges as a valuable tool for the diagnosis of MPE. SAGE Publications 2023-11-08 /pmc/articles/PMC10637149/ /pubmed/37941347 http://dx.doi.org/10.1177/17534666231208632 Text en © The Author(s), 2023 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Wei, Ting-Ting
Zhang, Jia-Feng
Cheng, Zhuo
Jiang, Lei
Li, Jiang-Yan
Zhou, Lin
Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data
title Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data
title_full Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data
title_fullStr Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data
title_full_unstemmed Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data
title_short Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data
title_sort development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10637149/
https://www.ncbi.nlm.nih.gov/pubmed/37941347
http://dx.doi.org/10.1177/17534666231208632
work_keys_str_mv AT weitingting developmentandvalidationofamachinelearningmodelfordifferentialdiagnosisofmalignantpleuraleffusionusingroutinelaboratorydata
AT zhangjiafeng developmentandvalidationofamachinelearningmodelfordifferentialdiagnosisofmalignantpleuraleffusionusingroutinelaboratorydata
AT chengzhuo developmentandvalidationofamachinelearningmodelfordifferentialdiagnosisofmalignantpleuraleffusionusingroutinelaboratorydata
AT jianglei developmentandvalidationofamachinelearningmodelfordifferentialdiagnosisofmalignantpleuraleffusionusingroutinelaboratorydata
AT lijiangyan developmentandvalidationofamachinelearningmodelfordifferentialdiagnosisofmalignantpleuraleffusionusingroutinelaboratorydata
AT zhoulin developmentandvalidationofamachinelearningmodelfordifferentialdiagnosisofmalignantpleuraleffusionusingroutinelaboratorydata