Cargando…

Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath

OBJECTIVES: Lung cancer (LC) is the largest single cause of death from cancer worldwide, and the lack of effective screening methods for early detection currently results in unsatisfactory curative treatments. We herein aimed to use breath analysis, a noninvasive and very simple method, to identify...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jing, Zhang, Yuwei, Chen, Qing, Pan, Zhenhua, Chen, Jun, Sun, Meixiu, Wang, Junfeng, Li, Yingxin, Ye, Qing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9531270/
https://www.ncbi.nlm.nih.gov/pubmed/36203414
http://dx.doi.org/10.3389/fonc.2022.975563
_version_ 1784801868900204544
author Li, Jing
Zhang, Yuwei
Chen, Qing
Pan, Zhenhua
Chen, Jun
Sun, Meixiu
Wang, Junfeng
Li, Yingxin
Ye, Qing
author_facet Li, Jing
Zhang, Yuwei
Chen, Qing
Pan, Zhenhua
Chen, Jun
Sun, Meixiu
Wang, Junfeng
Li, Yingxin
Ye, Qing
author_sort Li, Jing
collection PubMed
description OBJECTIVES: Lung cancer (LC) is the largest single cause of death from cancer worldwide, and the lack of effective screening methods for early detection currently results in unsatisfactory curative treatments. We herein aimed to use breath analysis, a noninvasive and very simple method, to identify and validate biomarkers in breath for the screening of lung cancer. MATERIALS AND METHODS: We enrolled a total of 2308 participants from two centers for online breath analyses using proton transfer reaction time-of-flight mass spectrometry (PTR-TOF-MS). The derivation cohort included 1007 patients with primary LC and 1036 healthy controls, and the external validation cohort included 158 LC patients and 107 healthy controls. We used eXtreme Gradient Boosting (XGBoost) to create a panel of predictive features and derived a prediction model to identify LC. The optimal number of features was determined by the greatest area under the receiver‐operating characteristic (ROC) curve (AUC). RESULTS: Six features were defined as a breath-biomarkers panel for the detection of LC. In the training dataset, the model had an AUC of 0.963 (95% CI, 0.941–0.982), and a sensitivity of 87.1% and specificity of 93.5% at a positivity threshold of 0.5. Our model was tested on the independent validation dataset and achieved an AUC of 0.771 (0.718–0.823), and sensitivity of 67.7% and specificity of 73.0%. CONCLUSION: Our results suggested that breath analysis may serve as a valid method in screening lung cancer in a borderline population prior to hospital visits. Although our breath-biomarker panel is noninvasive, quick, and simple to use, it will require further calibration and validation in a prospective study within a primary care setting.
format Online
Article
Text
id pubmed-9531270
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95312702022-10-05 Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath Li, Jing Zhang, Yuwei Chen, Qing Pan, Zhenhua Chen, Jun Sun, Meixiu Wang, Junfeng Li, Yingxin Ye, Qing Front Oncol Oncology OBJECTIVES: Lung cancer (LC) is the largest single cause of death from cancer worldwide, and the lack of effective screening methods for early detection currently results in unsatisfactory curative treatments. We herein aimed to use breath analysis, a noninvasive and very simple method, to identify and validate biomarkers in breath for the screening of lung cancer. MATERIALS AND METHODS: We enrolled a total of 2308 participants from two centers for online breath analyses using proton transfer reaction time-of-flight mass spectrometry (PTR-TOF-MS). The derivation cohort included 1007 patients with primary LC and 1036 healthy controls, and the external validation cohort included 158 LC patients and 107 healthy controls. We used eXtreme Gradient Boosting (XGBoost) to create a panel of predictive features and derived a prediction model to identify LC. The optimal number of features was determined by the greatest area under the receiver‐operating characteristic (ROC) curve (AUC). RESULTS: Six features were defined as a breath-biomarkers panel for the detection of LC. In the training dataset, the model had an AUC of 0.963 (95% CI, 0.941–0.982), and a sensitivity of 87.1% and specificity of 93.5% at a positivity threshold of 0.5. Our model was tested on the independent validation dataset and achieved an AUC of 0.771 (0.718–0.823), and sensitivity of 67.7% and specificity of 73.0%. CONCLUSION: Our results suggested that breath analysis may serve as a valid method in screening lung cancer in a borderline population prior to hospital visits. Although our breath-biomarker panel is noninvasive, quick, and simple to use, it will require further calibration and validation in a prospective study within a primary care setting. Frontiers Media S.A. 2022-09-20 /pmc/articles/PMC9531270/ /pubmed/36203414 http://dx.doi.org/10.3389/fonc.2022.975563 Text en Copyright © 2022 Li, Zhang, Chen, Pan, Chen, Sun, Wang, Li and Ye https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Li, Jing
Zhang, Yuwei
Chen, Qing
Pan, Zhenhua
Chen, Jun
Sun, Meixiu
Wang, Junfeng
Li, Yingxin
Ye, Qing
Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath
title Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath
title_full Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath
title_fullStr Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath
title_full_unstemmed Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath
title_short Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath
title_sort development and validation of a screening model for lung cancer using machine learning: a large-scale, multi-center study of biomarkers in breath
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9531270/
https://www.ncbi.nlm.nih.gov/pubmed/36203414
http://dx.doi.org/10.3389/fonc.2022.975563
work_keys_str_mv AT lijing developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath
AT zhangyuwei developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath
AT chenqing developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath
AT panzhenhua developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath
AT chenjun developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath
AT sunmeixiu developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath
AT wangjunfeng developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath
AT liyingxin developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath
AT yeqing developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath