Cargando…
Total mutational load and clinical features as predictors of the metastatic status in lung adenocarcinoma and squamous cell carcinoma patients
BACKGROUND: Recently, extensive cancer genomic studies have revealed mutational and clinical data of large cohorts of cancer patients. For example, the Pan-Lung Cancer 2016 dataset (part of The Cancer Genome Atlas project), summarises the mutational and clinical profiles of different subtypes of Lun...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9389677/ https://www.ncbi.nlm.nih.gov/pubmed/35982500 http://dx.doi.org/10.1186/s12967-022-03572-8 |
_version_ | 1784770509909524480 |
---|---|
author | Oróstica, Karen Y. Saez-Hidalgo, Juan de Santiago, Pamela R. Rivas, Solange Contreras, Sebastian Navarro, Gonzalo Asenjo, Juan A. Olivera-Nappa, Álvaro Armisén, Ricardo |
author_facet | Oróstica, Karen Y. Saez-Hidalgo, Juan de Santiago, Pamela R. Rivas, Solange Contreras, Sebastian Navarro, Gonzalo Asenjo, Juan A. Olivera-Nappa, Álvaro Armisén, Ricardo |
author_sort | Oróstica, Karen Y. |
collection | PubMed |
description | BACKGROUND: Recently, extensive cancer genomic studies have revealed mutational and clinical data of large cohorts of cancer patients. For example, the Pan-Lung Cancer 2016 dataset (part of The Cancer Genome Atlas project), summarises the mutational and clinical profiles of different subtypes of Lung Cancer (LC). Mutational and clinical signatures have been used independently for tumour typification and prediction of metastasis in LC patients. Is it then possible to achieve better typifications and predictions when combining both data streams? METHODS: In a cohort of 1144 Lung Adenocarcinoma (LUAD) and Lung Squamous Cell Carcinoma (LSCC) patients, we studied the number of missense mutations (hereafter, the Total Mutational Load TML) and distribution of clinical variables, for different classes of patients. Using the TML and different sets of clinical variables (tumour stage, age, sex, smoking status, and packs of cigarettes smoked per year), we built Random Forest classification models that calculate the likelihood of developing metastasis. RESULTS: We found that LC patients different in age, smoking status, and tumour type had significantly different mean TMLs. Although TML was an informative feature, its effect was secondary to the "tumour stage" feature. However, its contribution to the classification is not redundant with the latter; models trained using both TML and tumour stage performed better than models trained using only one of these variables. We found that models trained in the entire dataset (i.e., without using dimensionality reduction techniques) and without resampling achieved the highest performance, with an F1 score of 0.64 (95%CrI [0.62, 0.66]). CONCLUSIONS: Clinical variables and TML should be considered together when assessing the likelihood of LC patients progressing to metastatic states, as the information these encode is not redundant. Altogether, we provide new evidence of the need for comprehensive diagnostic tools for metastasis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12967-022-03572-8. |
format | Online Article Text |
id | pubmed-9389677 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-93896772022-08-20 Total mutational load and clinical features as predictors of the metastatic status in lung adenocarcinoma and squamous cell carcinoma patients Oróstica, Karen Y. Saez-Hidalgo, Juan de Santiago, Pamela R. Rivas, Solange Contreras, Sebastian Navarro, Gonzalo Asenjo, Juan A. Olivera-Nappa, Álvaro Armisén, Ricardo J Transl Med Research BACKGROUND: Recently, extensive cancer genomic studies have revealed mutational and clinical data of large cohorts of cancer patients. For example, the Pan-Lung Cancer 2016 dataset (part of The Cancer Genome Atlas project), summarises the mutational and clinical profiles of different subtypes of Lung Cancer (LC). Mutational and clinical signatures have been used independently for tumour typification and prediction of metastasis in LC patients. Is it then possible to achieve better typifications and predictions when combining both data streams? METHODS: In a cohort of 1144 Lung Adenocarcinoma (LUAD) and Lung Squamous Cell Carcinoma (LSCC) patients, we studied the number of missense mutations (hereafter, the Total Mutational Load TML) and distribution of clinical variables, for different classes of patients. Using the TML and different sets of clinical variables (tumour stage, age, sex, smoking status, and packs of cigarettes smoked per year), we built Random Forest classification models that calculate the likelihood of developing metastasis. RESULTS: We found that LC patients different in age, smoking status, and tumour type had significantly different mean TMLs. Although TML was an informative feature, its effect was secondary to the "tumour stage" feature. However, its contribution to the classification is not redundant with the latter; models trained using both TML and tumour stage performed better than models trained using only one of these variables. We found that models trained in the entire dataset (i.e., without using dimensionality reduction techniques) and without resampling achieved the highest performance, with an F1 score of 0.64 (95%CrI [0.62, 0.66]). CONCLUSIONS: Clinical variables and TML should be considered together when assessing the likelihood of LC patients progressing to metastatic states, as the information these encode is not redundant. Altogether, we provide new evidence of the need for comprehensive diagnostic tools for metastasis. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12967-022-03572-8. BioMed Central 2022-08-18 /pmc/articles/PMC9389677/ /pubmed/35982500 http://dx.doi.org/10.1186/s12967-022-03572-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Oróstica, Karen Y. Saez-Hidalgo, Juan de Santiago, Pamela R. Rivas, Solange Contreras, Sebastian Navarro, Gonzalo Asenjo, Juan A. Olivera-Nappa, Álvaro Armisén, Ricardo Total mutational load and clinical features as predictors of the metastatic status in lung adenocarcinoma and squamous cell carcinoma patients |
title | Total mutational load and clinical features as predictors of the metastatic status in lung adenocarcinoma and squamous cell carcinoma patients |
title_full | Total mutational load and clinical features as predictors of the metastatic status in lung adenocarcinoma and squamous cell carcinoma patients |
title_fullStr | Total mutational load and clinical features as predictors of the metastatic status in lung adenocarcinoma and squamous cell carcinoma patients |
title_full_unstemmed | Total mutational load and clinical features as predictors of the metastatic status in lung adenocarcinoma and squamous cell carcinoma patients |
title_short | Total mutational load and clinical features as predictors of the metastatic status in lung adenocarcinoma and squamous cell carcinoma patients |
title_sort | total mutational load and clinical features as predictors of the metastatic status in lung adenocarcinoma and squamous cell carcinoma patients |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9389677/ https://www.ncbi.nlm.nih.gov/pubmed/35982500 http://dx.doi.org/10.1186/s12967-022-03572-8 |
work_keys_str_mv | AT orosticakareny totalmutationalloadandclinicalfeaturesaspredictorsofthemetastaticstatusinlungadenocarcinomaandsquamouscellcarcinomapatients AT saezhidalgojuan totalmutationalloadandclinicalfeaturesaspredictorsofthemetastaticstatusinlungadenocarcinomaandsquamouscellcarcinomapatients AT desantiagopamelar totalmutationalloadandclinicalfeaturesaspredictorsofthemetastaticstatusinlungadenocarcinomaandsquamouscellcarcinomapatients AT rivassolange totalmutationalloadandclinicalfeaturesaspredictorsofthemetastaticstatusinlungadenocarcinomaandsquamouscellcarcinomapatients AT contrerassebastian totalmutationalloadandclinicalfeaturesaspredictorsofthemetastaticstatusinlungadenocarcinomaandsquamouscellcarcinomapatients AT navarrogonzalo totalmutationalloadandclinicalfeaturesaspredictorsofthemetastaticstatusinlungadenocarcinomaandsquamouscellcarcinomapatients AT asenjojuana totalmutationalloadandclinicalfeaturesaspredictorsofthemetastaticstatusinlungadenocarcinomaandsquamouscellcarcinomapatients AT oliveranappaalvaro totalmutationalloadandclinicalfeaturesaspredictorsofthemetastaticstatusinlungadenocarcinomaandsquamouscellcarcinomapatients AT armisenricardo totalmutationalloadandclinicalfeaturesaspredictorsofthemetastaticstatusinlungadenocarcinomaandsquamouscellcarcinomapatients |