Cargando…
Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features
Prognostication for patients with cancer is important for clinical planning and management, but remains challenging given the large number of factors that can influence outcomes. As such, there is a need to identify features that can robustly predict patient outcomes. We evaluated 8608 patient tumor...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8330450/ https://www.ncbi.nlm.nih.gov/pubmed/34376966 http://dx.doi.org/10.1177/11769351211035137 |
_version_ | 1783732718934163456 |
---|---|
author | Zhao, Daniel Kim, Daniel Y Chen, Peter Yu, Patrick Ho, Sophia Cheng, Stephanie W Zhao, Cindy Guo, Jimmy A Li, Yun R |
author_facet | Zhao, Daniel Kim, Daniel Y Chen, Peter Yu, Patrick Ho, Sophia Cheng, Stephanie W Zhao, Cindy Guo, Jimmy A Li, Yun R |
author_sort | Zhao, Daniel |
collection | PubMed |
description | Prognostication for patients with cancer is important for clinical planning and management, but remains challenging given the large number of factors that can influence outcomes. As such, there is a need to identify features that can robustly predict patient outcomes. We evaluated 8608 patient tumor samples across 16 cancer types from The Cancer Genome Atlas and generated distinct survival classifiers for each using clinical and histopathological data accessible to standard oncology workflows. For cancers that had poor model performance, we deployed a random-forest-embedded sequential forward selection approach that began with an initial subset of the 15 most predictive clinicopathological features before sequentially appending the next most informative gene as an additional feature. With classifiers derived from clinical and histopathological features alone, we observed cancer-type-dependent model performance and an area under the receiver operating curve (AUROC) range of 0.65 to 0.91 across all 16 cancer types for 1- and 3-year survival prediction, with some classifiers consistently outperforming those for others. As such, for cancers that had poor model performance, we posited that the addition of more complex biomolecular features could enhance our ability to prognose patients where clinicopathological features were insufficient. With the inclusion of gene expression data, model performance for 3 select cancers (glioblastoma, stomach/gastric adenocarcinoma, ovarian serous carcinoma) markedly increased from initial AUROC scores of 0.66, 0.69, and 0.67 to 0.76, 0.77, and 0.77, respectively. As a whole, this study provides a thorough examination of the relative contributions of clinical, pathological, and gene expression data in predicting overall survival and reveals cancer types for which clinical features are already strong predictors and those where additional biomolecular information is needed. |
format | Online Article Text |
id | pubmed-8330450 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-83304502021-08-09 Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features Zhao, Daniel Kim, Daniel Y Chen, Peter Yu, Patrick Ho, Sophia Cheng, Stephanie W Zhao, Cindy Guo, Jimmy A Li, Yun R Cancer Inform Original Research Prognostication for patients with cancer is important for clinical planning and management, but remains challenging given the large number of factors that can influence outcomes. As such, there is a need to identify features that can robustly predict patient outcomes. We evaluated 8608 patient tumor samples across 16 cancer types from The Cancer Genome Atlas and generated distinct survival classifiers for each using clinical and histopathological data accessible to standard oncology workflows. For cancers that had poor model performance, we deployed a random-forest-embedded sequential forward selection approach that began with an initial subset of the 15 most predictive clinicopathological features before sequentially appending the next most informative gene as an additional feature. With classifiers derived from clinical and histopathological features alone, we observed cancer-type-dependent model performance and an area under the receiver operating curve (AUROC) range of 0.65 to 0.91 across all 16 cancer types for 1- and 3-year survival prediction, with some classifiers consistently outperforming those for others. As such, for cancers that had poor model performance, we posited that the addition of more complex biomolecular features could enhance our ability to prognose patients where clinicopathological features were insufficient. With the inclusion of gene expression data, model performance for 3 select cancers (glioblastoma, stomach/gastric adenocarcinoma, ovarian serous carcinoma) markedly increased from initial AUROC scores of 0.66, 0.69, and 0.67 to 0.76, 0.77, and 0.77, respectively. As a whole, this study provides a thorough examination of the relative contributions of clinical, pathological, and gene expression data in predicting overall survival and reveals cancer types for which clinical features are already strong predictors and those where additional biomolecular information is needed. SAGE Publications 2021-07-28 /pmc/articles/PMC8330450/ /pubmed/34376966 http://dx.doi.org/10.1177/11769351211035137 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Research Zhao, Daniel Kim, Daniel Y Chen, Peter Yu, Patrick Ho, Sophia Cheng, Stephanie W Zhao, Cindy Guo, Jimmy A Li, Yun R Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features |
title | Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features |
title_full | Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features |
title_fullStr | Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features |
title_full_unstemmed | Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features |
title_short | Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features |
title_sort | pan-cancer survival classification with clinicopathological and targeted gene expression features |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8330450/ https://www.ncbi.nlm.nih.gov/pubmed/34376966 http://dx.doi.org/10.1177/11769351211035137 |
work_keys_str_mv | AT zhaodaniel pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures AT kimdaniely pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures AT chenpeter pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures AT yupatrick pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures AT hosophia pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures AT chengstephaniew pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures AT zhaocindy pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures AT guojimmya pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures AT liyunr pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures |