Cargando…

Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath

OBJECTIVES: Lung cancer (LC) is the largest single cause of death from cancer worldwide, and the lack of effective screening methods for early detection currently results in unsatisfactory curative treatments. We herein aimed to use breath analysis, a noninvasive and very simple method, to identify...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Jing, Zhang, Yuwei, Chen, Qing, Pan, Zhenhua, Chen, Jun, Sun, Meixiu, Wang, Junfeng, Li, Yingxin, Ye, Qing
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2022
Materias:	Oncology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9531270/ https://www.ncbi.nlm.nih.gov/pubmed/36203414 http://dx.doi.org/10.3389/fonc.2022.975563

_version_	1784801868900204544
author	Li, Jing Zhang, Yuwei Chen, Qing Pan, Zhenhua Chen, Jun Sun, Meixiu Wang, Junfeng Li, Yingxin Ye, Qing
author_facet	Li, Jing Zhang, Yuwei Chen, Qing Pan, Zhenhua Chen, Jun Sun, Meixiu Wang, Junfeng Li, Yingxin Ye, Qing
author_sort	Li, Jing
collection	PubMed
description	OBJECTIVES: Lung cancer (LC) is the largest single cause of death from cancer worldwide, and the lack of effective screening methods for early detection currently results in unsatisfactory curative treatments. We herein aimed to use breath analysis, a noninvasive and very simple method, to identify and validate biomarkers in breath for the screening of lung cancer. MATERIALS AND METHODS: We enrolled a total of 2308 participants from two centers for online breath analyses using proton transfer reaction time-of-flight mass spectrometry (PTR-TOF-MS). The derivation cohort included 1007 patients with primary LC and 1036 healthy controls, and the external validation cohort included 158 LC patients and 107 healthy controls. We used eXtreme Gradient Boosting (XGBoost) to create a panel of predictive features and derived a prediction model to identify LC. The optimal number of features was determined by the greatest area under the receiver‐operating characteristic (ROC) curve (AUC). RESULTS: Six features were defined as a breath-biomarkers panel for the detection of LC. In the training dataset, the model had an AUC of 0.963 (95% CI, 0.941–0.982), and a sensitivity of 87.1% and specificity of 93.5% at a positivity threshold of 0.5. Our model was tested on the independent validation dataset and achieved an AUC of 0.771 (0.718–0.823), and sensitivity of 67.7% and specificity of 73.0%. CONCLUSION: Our results suggested that breath analysis may serve as a valid method in screening lung cancer in a borderline population prior to hospital visits. Although our breath-biomarker panel is noninvasive, quick, and simple to use, it will require further calibration and validation in a prospective study within a primary care setting.
format	Online Article Text
id	pubmed-9531270
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-95312702022-10-05 Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath Li, Jing Zhang, Yuwei Chen, Qing Pan, Zhenhua Chen, Jun Sun, Meixiu Wang, Junfeng Li, Yingxin Ye, Qing Front Oncol Oncology OBJECTIVES: Lung cancer (LC) is the largest single cause of death from cancer worldwide, and the lack of effective screening methods for early detection currently results in unsatisfactory curative treatments. We herein aimed to use breath analysis, a noninvasive and very simple method, to identify and validate biomarkers in breath for the screening of lung cancer. MATERIALS AND METHODS: We enrolled a total of 2308 participants from two centers for online breath analyses using proton transfer reaction time-of-flight mass spectrometry (PTR-TOF-MS). The derivation cohort included 1007 patients with primary LC and 1036 healthy controls, and the external validation cohort included 158 LC patients and 107 healthy controls. We used eXtreme Gradient Boosting (XGBoost) to create a panel of predictive features and derived a prediction model to identify LC. The optimal number of features was determined by the greatest area under the receiver‐operating characteristic (ROC) curve (AUC). RESULTS: Six features were defined as a breath-biomarkers panel for the detection of LC. In the training dataset, the model had an AUC of 0.963 (95% CI, 0.941–0.982), and a sensitivity of 87.1% and specificity of 93.5% at a positivity threshold of 0.5. Our model was tested on the independent validation dataset and achieved an AUC of 0.771 (0.718–0.823), and sensitivity of 67.7% and specificity of 73.0%. CONCLUSION: Our results suggested that breath analysis may serve as a valid method in screening lung cancer in a borderline population prior to hospital visits. Although our breath-biomarker panel is noninvasive, quick, and simple to use, it will require further calibration and validation in a prospective study within a primary care setting. Frontiers Media S.A. 2022-09-20 /pmc/articles/PMC9531270/ /pubmed/36203414 http://dx.doi.org/10.3389/fonc.2022.975563 Text en Copyright © 2022 Li, Zhang, Chen, Pan, Chen, Sun, Wang, Li and Ye https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Oncology Li, Jing Zhang, Yuwei Chen, Qing Pan, Zhenhua Chen, Jun Sun, Meixiu Wang, Junfeng Li, Yingxin Ye, Qing Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath
title	Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath
title_full	Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath
title_fullStr	Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath
title_full_unstemmed	Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath
title_short	Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath
title_sort	development and validation of a screening model for lung cancer using machine learning: a large-scale, multi-center study of biomarkers in breath
topic	Oncology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9531270/ https://www.ncbi.nlm.nih.gov/pubmed/36203414 http://dx.doi.org/10.3389/fonc.2022.975563
work_keys_str_mv	AT lijing developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT zhangyuwei developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT chenqing developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT panzhenhua developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT chenjun developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT sunmeixiu developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT wangjunfeng developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT liyingxin developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath AT yeqing developmentandvalidationofascreeningmodelforlungcancerusingmachinelearningalargescalemulticenterstudyofbiomarkersinbreath

Development and validation of a screening model for lung cancer using machine learning: A large-scale, multi-center study of biomarkers in breath

Ejemplares similares