Cargando…
Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath
OBJECTIVES: Lung cancer (LC) is the largest single cause of death from cancer worldwide, and the lack of effective screening methods for early detection currently results in unsatisfactory curative treatments. We herein aimed to use breath analysis, a noninvasive and very simple method, to identify...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9531270/ https://www.ncbi.nlm.nih.gov/pubmed/36203414 http://dx.doi.org/10.3389/fonc.2022.975563 |
_version_ | 1784801868900204544 |
---|---|
author | Li, Jing Zhang, Yuwei Chen, Qing Pan, Zhenhua Chen, Jun Sun, Meixiu Wang, Junfeng Li, Yingxin Ye, Qing |
author_facet | Li, Jing Zhang, Yuwei Chen, Qing Pan, Zhenhua Chen, Jun Sun, Meixiu Wang, Junfeng Li, Yingxin Ye, Qing |
author_sort | Li, Jing |
collection | PubMed |
description | OBJECTIVES: Lung cancer (LC) is the largest single cause of death from cancer worldwide, and the lack of effective screening methods for early detection currently results in unsatisfactory curative treatments. We herein aimed to use breath analysis, a noninvasive and very simple method, to identify and validate biomarkers in breath for the screening of lung cancer. MATERIALS AND METHODS: We enrolled a total of 2308 participants from two centers for online breath analyses using proton transfer reaction time-of-flight mass spectrometry (PTR-TOF-MS). The derivation cohort included 1007 patients with primary LC and 1036 healthy controls, and the external validation cohort included 158 LC patients and 107 healthy controls. We used eXtreme Gradient Boosting (XGBoost) to create a panel of predictive features and derived a prediction model to identify LC. The optimal number of features was determined by the greatest area under the receiver‐operating characteristic (ROC) curve (AUC). RESULTS: Six features were defined as a breath-biomarkers panel for the detection of LC. In the training dataset, the model had an AUC of 0.963 (95% CI, 0.941–0.982), and a sensitivity of 87.1% and specificity of 93.5% at a positivity threshold of 0.5. Our model was tested on the independent validation dataset and achieved an AUC of 0.771 (0.718–0.823), and sensitivity of 67.7% and specificity of 73.0%. CONCLUSION: Our results suggested that breath analysis may serve as a valid method in screening lung cancer in a borderline population prior to hospital visits. Although our breath-biomarker panel is noninvasive, quick, and simple to use, it will require further calibration and validation in a prospective study within a primary care setting. |
format | Online Article Text |
id | pubmed-9531270 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95312702022-10-05 Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath Li, Jing Zhang, Yuwei Chen, Qing Pan, Zhenhua Chen, Jun Sun, Meixiu Wang, Junfeng Li, Yingxin Ye, Qing Front Oncol Oncology OBJECTIVES: Lung cancer (LC) is the largest single cause of death from cancer worldwide, and the lack of effective screening methods for early detection currently results in unsatisfactory curative treatments. We herein aimed to use breath analysis, a noninvasive and very simple method, to identify and validate biomarkers in breath for the screening of lung cancer. MATERIALS AND METHODS: We enrolled a total of 2308 participants from two centers for online breath analyses using proton transfer reaction time-of-flight mass spectrometry (PTR-TOF-MS). The derivation cohort included 1007 patients with primary LC and 1036 healthy controls, and the external validation cohort included 158 LC patients and 107 healthy controls. We used eXtreme Gradient Boosting (XGBoost) to create a panel of predictive features and derived a prediction model to identify LC. The optimal number of features was determined by the greatest area under the receiver‐operating characteristic (ROC) curve (AUC). RESULTS: Six features were defined as a breath-biomarkers panel for the detection of LC. In the training dataset, the model had an AUC of 0.963 (95% CI, 0.941–0.982), and a sensitivity of 87.1% and specificity of 93.5% at a positivity threshold of 0.5. Our model was tested on the independent validation dataset and achieved an AUC of 0.771 (0.718–0.823), and sensitivity of 67.7% and specificity of 73.0%. CONCLUSION: Our results suggested that breath analysis may serve as a valid method in screening lung cancer in a borderline population prior to hospital visits. Although our breath-biomarker panel is noninvasive, quick, and simple to use, it will require further calibration and validation in a prospective study within a primary care setting. Frontiers Media S.A. 2022-09-20 /pmc/articles/PMC9531270/ /pubmed/36203414 http://dx.doi.org/10.3389/fonc.2022.975563 Text en Copyright © 2022 Li, Zhang, Chen, Pan, Chen, Sun, Wang, Li and Ye https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Oncology Li, Jing Zhang, Yuwei Chen, Qing Pan, Zhenhua Chen, Jun Sun, Meixiu Wang, Junfeng Li, Yingxin Ye, Qing Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath |
title | Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath |
title_full | Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath |
title_fullStr | Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath |
title_full_unstemmed | Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath |
title_short | Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath |
title_sort | development and validation of a screening model for lung cancer using machine learning: a large-scale, multi-center study of biomarkers in breath |
topic | Oncology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9531270/ https://www.ncbi.nlm.nih.gov/pubmed/36203414 http://dx.doi.org/10.3389/fonc.2022.975563 |
work_keys_str_mv | AT lijing developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT zhangyuwei developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT chenqing developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT panzhenhua developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT chenjun developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT sunmeixiu developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT wangjunfeng developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT liyingxin developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT yeqing developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath |