Cargando…

Development and Validation of Novel Deep-Learning Models Using Multiple Data Types for Lung Cancer Survival

SIMPLE SUMMARY: Previous survival-prediction studies have had several limitations, such as a lack of comprehensive clinical data types, testing only in limited machine-learning algorithms, or a lack of a sufficient external testing set. This lung-cancer-survival-prediction model is based on multiple...

Descripción completa

Detalles Bibliográficos
Autores principales: Hsu, Jason C., Nguyen, Phung-Anh, Phuc, Phan Thanh, Lo, Tsai-Chih, Hsu, Min-Huei, Hsieh, Min-Shu, Le, Nguyen Quoc Khanh, Cheng, Chi-Tsun, Chang, Tzu-Hao, Chen, Cheng-Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9688689/
https://www.ncbi.nlm.nih.gov/pubmed/36428655
http://dx.doi.org/10.3390/cancers14225562
_version_ 1784836332850249728
author Hsu, Jason C.
Nguyen, Phung-Anh
Phuc, Phan Thanh
Lo, Tsai-Chih
Hsu, Min-Huei
Hsieh, Min-Shu
Le, Nguyen Quoc Khanh
Cheng, Chi-Tsun
Chang, Tzu-Hao
Chen, Cheng-Yu
author_facet Hsu, Jason C.
Nguyen, Phung-Anh
Phuc, Phan Thanh
Lo, Tsai-Chih
Hsu, Min-Huei
Hsieh, Min-Shu
Le, Nguyen Quoc Khanh
Cheng, Chi-Tsun
Chang, Tzu-Hao
Chen, Cheng-Yu
author_sort Hsu, Jason C.
collection PubMed
description SIMPLE SUMMARY: Previous survival-prediction studies have had several limitations, such as a lack of comprehensive clinical data types, testing only in limited machine-learning algorithms, or a lack of a sufficient external testing set. This lung-cancer-survival-prediction model is based on multiple data types, multiple novel machine-learning algorithms, and external testing. This predicted model demonstrated a higher performance (ANN, AUC, 0.89; accuracy, 0.82; precision, 0.91) than previous similar studies. ABSTRACT: A well-established lung-cancer-survival-prediction model that relies on multiple data types, multiple novel machine-learning algorithms, and external testing is absent in the literature. This study aims to address this gap and determine the critical factors of lung cancer survival. We selected non-small-cell lung cancer patients from a retrospective dataset of the Taipei Medical University Clinical Research Database and Taiwan Cancer Registry between January 2008 and December 2018. All patients were monitored from the index date of cancer diagnosis until the event of death. Variables, including demographics, comorbidities, medications, laboratories, and patient gene tests, were used. Nine machine-learning algorithms with various modes were used. The performance of the algorithms was measured by the area under the receiver operating characteristic curve (AUC). In total, 3714 patients were included. The best performance of the artificial neural network (ANN) model was achieved when integrating all variables with the AUC, accuracy, precision, recall, and F1-score of 0.89, 0.82, 0.91, 0.75, and 0.65, respectively. The most important features were cancer stage, cancer size, age of diagnosis, smoking, drinking status, EGFR gene, and body mass index. Overall, the ANN model improved predictive performance when integrating different data types.
format Online
Article
Text
id pubmed-9688689
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96886892022-11-25 Development and Validation of Novel Deep-Learning Models Using Multiple Data Types for Lung Cancer Survival Hsu, Jason C. Nguyen, Phung-Anh Phuc, Phan Thanh Lo, Tsai-Chih Hsu, Min-Huei Hsieh, Min-Shu Le, Nguyen Quoc Khanh Cheng, Chi-Tsun Chang, Tzu-Hao Chen, Cheng-Yu Cancers (Basel) Article SIMPLE SUMMARY: Previous survival-prediction studies have had several limitations, such as a lack of comprehensive clinical data types, testing only in limited machine-learning algorithms, or a lack of a sufficient external testing set. This lung-cancer-survival-prediction model is based on multiple data types, multiple novel machine-learning algorithms, and external testing. This predicted model demonstrated a higher performance (ANN, AUC, 0.89; accuracy, 0.82; precision, 0.91) than previous similar studies. ABSTRACT: A well-established lung-cancer-survival-prediction model that relies on multiple data types, multiple novel machine-learning algorithms, and external testing is absent in the literature. This study aims to address this gap and determine the critical factors of lung cancer survival. We selected non-small-cell lung cancer patients from a retrospective dataset of the Taipei Medical University Clinical Research Database and Taiwan Cancer Registry between January 2008 and December 2018. All patients were monitored from the index date of cancer diagnosis until the event of death. Variables, including demographics, comorbidities, medications, laboratories, and patient gene tests, were used. Nine machine-learning algorithms with various modes were used. The performance of the algorithms was measured by the area under the receiver operating characteristic curve (AUC). In total, 3714 patients were included. The best performance of the artificial neural network (ANN) model was achieved when integrating all variables with the AUC, accuracy, precision, recall, and F1-score of 0.89, 0.82, 0.91, 0.75, and 0.65, respectively. The most important features were cancer stage, cancer size, age of diagnosis, smoking, drinking status, EGFR gene, and body mass index. Overall, the ANN model improved predictive performance when integrating different data types. MDPI 2022-11-12 /pmc/articles/PMC9688689/ /pubmed/36428655 http://dx.doi.org/10.3390/cancers14225562 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hsu, Jason C.
Nguyen, Phung-Anh
Phuc, Phan Thanh
Lo, Tsai-Chih
Hsu, Min-Huei
Hsieh, Min-Shu
Le, Nguyen Quoc Khanh
Cheng, Chi-Tsun
Chang, Tzu-Hao
Chen, Cheng-Yu
Development and Validation of Novel Deep-Learning Models Using Multiple Data Types for Lung Cancer Survival
title Development and Validation of Novel Deep-Learning Models Using Multiple Data Types for Lung Cancer Survival
title_full Development and Validation of Novel Deep-Learning Models Using Multiple Data Types for Lung Cancer Survival
title_fullStr Development and Validation of Novel Deep-Learning Models Using Multiple Data Types for Lung Cancer Survival
title_full_unstemmed Development and Validation of Novel Deep-Learning Models Using Multiple Data Types for Lung Cancer Survival
title_short Development and Validation of Novel Deep-Learning Models Using Multiple Data Types for Lung Cancer Survival
title_sort development and validation of novel deep-learning models using multiple data types for lung cancer survival
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9688689/
https://www.ncbi.nlm.nih.gov/pubmed/36428655
http://dx.doi.org/10.3390/cancers14225562
work_keys_str_mv AT hsujasonc developmentandvalidationofnoveldeeplearningmodelsusingmultipledatatypesforlungcancersurvival
AT nguyenphunganh developmentandvalidationofnoveldeeplearningmodelsusingmultipledatatypesforlungcancersurvival
AT phucphanthanh developmentandvalidationofnoveldeeplearningmodelsusingmultipledatatypesforlungcancersurvival
AT lotsaichih developmentandvalidationofnoveldeeplearningmodelsusingmultipledatatypesforlungcancersurvival
AT hsuminhuei developmentandvalidationofnoveldeeplearningmodelsusingmultipledatatypesforlungcancersurvival
AT hsiehminshu developmentandvalidationofnoveldeeplearningmodelsusingmultipledatatypesforlungcancersurvival
AT lenguyenquockhanh developmentandvalidationofnoveldeeplearningmodelsusingmultipledatatypesforlungcancersurvival
AT chengchitsun developmentandvalidationofnoveldeeplearningmodelsusingmultipledatatypesforlungcancersurvival
AT changtzuhao developmentandvalidationofnoveldeeplearningmodelsusingmultipledatatypesforlungcancersurvival
AT chenchengyu developmentandvalidationofnoveldeeplearningmodelsusingmultipledatatypesforlungcancersurvival