Cargando…
Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data
BACKGROUND: The differential diagnosis of malignant pleural effusion (MPE) and benign pleural effusion (BPE) presents a clinical challenge. In recent years, the use of artificial intelligence (AI) machine learning models for disease diagnosis has increased. OBJECTIVE: This study aimed to develop and...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10637149/ https://www.ncbi.nlm.nih.gov/pubmed/37941347 http://dx.doi.org/10.1177/17534666231208632 |
_version_ | 1785133345719451648 |
---|---|
author | Wei, Ting-Ting Zhang, Jia-Feng Cheng, Zhuo Jiang, Lei Li, Jiang-Yan Zhou, Lin |
author_facet | Wei, Ting-Ting Zhang, Jia-Feng Cheng, Zhuo Jiang, Lei Li, Jiang-Yan Zhou, Lin |
author_sort | Wei, Ting-Ting |
collection | PubMed |
description | BACKGROUND: The differential diagnosis of malignant pleural effusion (MPE) and benign pleural effusion (BPE) presents a clinical challenge. In recent years, the use of artificial intelligence (AI) machine learning models for disease diagnosis has increased. OBJECTIVE: This study aimed to develop and validate a diagnostic model for early differentiation between MPE and BPE based on routine laboratory data. DESIGN: This was a retrospective observational cohort study. METHODS: A total of 2352 newly diagnosed patients with pleural effusion (PE), between January 2008 and March 2021, were eventually enrolled. Among them, 1435, 466, and 451 participants were randomly assigned to the training, validation, and testing cohorts in a ratio of 3:1:1. Clinical parameters, including age, sex, and laboratory parameters of PE patients, were abstracted for analysis. Based on 81 candidate laboratory variables, five machine learning models, namely extreme gradient boosting (XGBoost) model, logistic regression (LR) model, random forest (RF) model, support vector machine (SVM) model, and multilayer perceptron (MLP) model were developed. Their respective diagnostic performances for MPE were evaluated by receiver operating characteristic (ROC) curves. RESULTS: Among the five models, the XGBoost model exhibited the best diagnostic performance for MPE (area under the curve (AUC): 0.903, 0.918, and 0.886 in the training, validation, and testing cohorts, respectively). Additionally, the XGBoost model outperformed carcinoembryonic antigen (CEA) levels in pleural fluid (PF), serum, and the PF/serum ratio (AUC: 0.726, 0.699, and 0.692 in the training cohort; 0.763, 0.695, and 0.731 in the validation cohort; and 0.722, 0.729, and 0.693 in the testing cohort, respectively). Furthermore, compared with CEA, the XGBoost model demonstrated greater diagnostic power and sensitivity in diagnosing lung cancer-induced MPE. CONCLUSION: The development of a machine learning model utilizing routine laboratory biomarkers significantly enhances the diagnostic capability for distinguishing between MPE and BPE. The XGBoost model emerges as a valuable tool for the diagnosis of MPE. |
format | Online Article Text |
id | pubmed-10637149 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-106371492023-11-11 Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data Wei, Ting-Ting Zhang, Jia-Feng Cheng, Zhuo Jiang, Lei Li, Jiang-Yan Zhou, Lin Ther Adv Respir Dis Original Research BACKGROUND: The differential diagnosis of malignant pleural effusion (MPE) and benign pleural effusion (BPE) presents a clinical challenge. In recent years, the use of artificial intelligence (AI) machine learning models for disease diagnosis has increased. OBJECTIVE: This study aimed to develop and validate a diagnostic model for early differentiation between MPE and BPE based on routine laboratory data. DESIGN: This was a retrospective observational cohort study. METHODS: A total of 2352 newly diagnosed patients with pleural effusion (PE), between January 2008 and March 2021, were eventually enrolled. Among them, 1435, 466, and 451 participants were randomly assigned to the training, validation, and testing cohorts in a ratio of 3:1:1. Clinical parameters, including age, sex, and laboratory parameters of PE patients, were abstracted for analysis. Based on 81 candidate laboratory variables, five machine learning models, namely extreme gradient boosting (XGBoost) model, logistic regression (LR) model, random forest (RF) model, support vector machine (SVM) model, and multilayer perceptron (MLP) model were developed. Their respective diagnostic performances for MPE were evaluated by receiver operating characteristic (ROC) curves. RESULTS: Among the five models, the XGBoost model exhibited the best diagnostic performance for MPE (area under the curve (AUC): 0.903, 0.918, and 0.886 in the training, validation, and testing cohorts, respectively). Additionally, the XGBoost model outperformed carcinoembryonic antigen (CEA) levels in pleural fluid (PF), serum, and the PF/serum ratio (AUC: 0.726, 0.699, and 0.692 in the training cohort; 0.763, 0.695, and 0.731 in the validation cohort; and 0.722, 0.729, and 0.693 in the testing cohort, respectively). Furthermore, compared with CEA, the XGBoost model demonstrated greater diagnostic power and sensitivity in diagnosing lung cancer-induced MPE. CONCLUSION: The development of a machine learning model utilizing routine laboratory biomarkers significantly enhances the diagnostic capability for distinguishing between MPE and BPE. The XGBoost model emerges as a valuable tool for the diagnosis of MPE. SAGE Publications 2023-11-08 /pmc/articles/PMC10637149/ /pubmed/37941347 http://dx.doi.org/10.1177/17534666231208632 Text en © The Author(s), 2023 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Research Wei, Ting-Ting Zhang, Jia-Feng Cheng, Zhuo Jiang, Lei Li, Jiang-Yan Zhou, Lin Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data |
title | Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data |
title_full | Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data |
title_fullStr | Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data |
title_full_unstemmed | Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data |
title_short | Development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data |
title_sort | development and validation of a machine learning model for differential diagnosis of malignant pleural effusion using routine laboratory data |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10637149/ https://www.ncbi.nlm.nih.gov/pubmed/37941347 http://dx.doi.org/10.1177/17534666231208632 |
work_keys_str_mv | AT weitingting developmentandvalidationofamachinelearningmodelfordifferentialdiagnosisofmalignantpleuraleffusionusingroutinelaboratorydata AT zhangjiafeng developmentandvalidationofamachinelearningmodelfordifferentialdiagnosisofmalignantpleuraleffusionusingroutinelaboratorydata AT chengzhuo developmentandvalidationofamachinelearningmodelfordifferentialdiagnosisofmalignantpleuraleffusionusingroutinelaboratorydata AT jianglei developmentandvalidationofamachinelearningmodelfordifferentialdiagnosisofmalignantpleuraleffusionusingroutinelaboratorydata AT lijiangyan developmentandvalidationofamachinelearningmodelfordifferentialdiagnosisofmalignantpleuraleffusionusingroutinelaboratorydata AT zhoulin developmentandvalidationofamachinelearningmodelfordifferentialdiagnosisofmalignantpleuraleffusionusingroutinelaboratorydata |