Cargando…

Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features

Prognostication for patients with cancer is important for clinical planning and management, but remains challenging given the large number of factors that can influence outcomes. As such, there is a need to identify features that can robustly predict patient outcomes. We evaluated 8608 patient tumor...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Daniel, Kim, Daniel Y, Chen, Peter, Yu, Patrick, Ho, Sophia, Cheng, Stephanie W, Zhao, Cindy, Guo, Jimmy A, Li, Yun R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8330450/
https://www.ncbi.nlm.nih.gov/pubmed/34376966
http://dx.doi.org/10.1177/11769351211035137
_version_ 1783732718934163456
author Zhao, Daniel
Kim, Daniel Y
Chen, Peter
Yu, Patrick
Ho, Sophia
Cheng, Stephanie W
Zhao, Cindy
Guo, Jimmy A
Li, Yun R
author_facet Zhao, Daniel
Kim, Daniel Y
Chen, Peter
Yu, Patrick
Ho, Sophia
Cheng, Stephanie W
Zhao, Cindy
Guo, Jimmy A
Li, Yun R
author_sort Zhao, Daniel
collection PubMed
description Prognostication for patients with cancer is important for clinical planning and management, but remains challenging given the large number of factors that can influence outcomes. As such, there is a need to identify features that can robustly predict patient outcomes. We evaluated 8608 patient tumor samples across 16 cancer types from The Cancer Genome Atlas and generated distinct survival classifiers for each using clinical and histopathological data accessible to standard oncology workflows. For cancers that had poor model performance, we deployed a random-forest-embedded sequential forward selection approach that began with an initial subset of the 15 most predictive clinicopathological features before sequentially appending the next most informative gene as an additional feature. With classifiers derived from clinical and histopathological features alone, we observed cancer-type-dependent model performance and an area under the receiver operating curve (AUROC) range of 0.65 to 0.91 across all 16 cancer types for 1- and 3-year survival prediction, with some classifiers consistently outperforming those for others. As such, for cancers that had poor model performance, we posited that the addition of more complex biomolecular features could enhance our ability to prognose patients where clinicopathological features were insufficient. With the inclusion of gene expression data, model performance for 3 select cancers (glioblastoma, stomach/gastric adenocarcinoma, ovarian serous carcinoma) markedly increased from initial AUROC scores of 0.66, 0.69, and 0.67 to 0.76, 0.77, and 0.77, respectively. As a whole, this study provides a thorough examination of the relative contributions of clinical, pathological, and gene expression data in predicting overall survival and reveals cancer types for which clinical features are already strong predictors and those where additional biomolecular information is needed.
format Online
Article
Text
id pubmed-8330450
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-83304502021-08-09 Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features Zhao, Daniel Kim, Daniel Y Chen, Peter Yu, Patrick Ho, Sophia Cheng, Stephanie W Zhao, Cindy Guo, Jimmy A Li, Yun R Cancer Inform Original Research Prognostication for patients with cancer is important for clinical planning and management, but remains challenging given the large number of factors that can influence outcomes. As such, there is a need to identify features that can robustly predict patient outcomes. We evaluated 8608 patient tumor samples across 16 cancer types from The Cancer Genome Atlas and generated distinct survival classifiers for each using clinical and histopathological data accessible to standard oncology workflows. For cancers that had poor model performance, we deployed a random-forest-embedded sequential forward selection approach that began with an initial subset of the 15 most predictive clinicopathological features before sequentially appending the next most informative gene as an additional feature. With classifiers derived from clinical and histopathological features alone, we observed cancer-type-dependent model performance and an area under the receiver operating curve (AUROC) range of 0.65 to 0.91 across all 16 cancer types for 1- and 3-year survival prediction, with some classifiers consistently outperforming those for others. As such, for cancers that had poor model performance, we posited that the addition of more complex biomolecular features could enhance our ability to prognose patients where clinicopathological features were insufficient. With the inclusion of gene expression data, model performance for 3 select cancers (glioblastoma, stomach/gastric adenocarcinoma, ovarian serous carcinoma) markedly increased from initial AUROC scores of 0.66, 0.69, and 0.67 to 0.76, 0.77, and 0.77, respectively. As a whole, this study provides a thorough examination of the relative contributions of clinical, pathological, and gene expression data in predicting overall survival and reveals cancer types for which clinical features are already strong predictors and those where additional biomolecular information is needed. SAGE Publications 2021-07-28 /pmc/articles/PMC8330450/ /pubmed/34376966 http://dx.doi.org/10.1177/11769351211035137 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Zhao, Daniel
Kim, Daniel Y
Chen, Peter
Yu, Patrick
Ho, Sophia
Cheng, Stephanie W
Zhao, Cindy
Guo, Jimmy A
Li, Yun R
Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features
title Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features
title_full Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features
title_fullStr Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features
title_full_unstemmed Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features
title_short Pan-Cancer Survival Classification With Clinicopathological and Targeted Gene Expression Features
title_sort pan-cancer survival classification with clinicopathological and targeted gene expression features
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8330450/
https://www.ncbi.nlm.nih.gov/pubmed/34376966
http://dx.doi.org/10.1177/11769351211035137
work_keys_str_mv AT zhaodaniel pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures
AT kimdaniely pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures
AT chenpeter pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures
AT yupatrick pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures
AT hosophia pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures
AT chengstephaniew pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures
AT zhaocindy pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures
AT guojimmya pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures
AT liyunr pancancersurvivalclassificationwithclinicopathologicalandtargetedgeneexpressionfeatures