Cargando…

Pan-cancer evaluation of gene expression and somatic alteration data for cancer prognosis prediction

BACKGROUND: Over the past decades, approaches for diagnosing and treating cancer have seen significant improvement. However, the variability of patient and tumor characteristics has limited progress on methods for prognosis prediction. The development of high-throughput omics technologies now provid...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Xingyu, Amos, Christopher I., Frost, H. Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8467202/
https://www.ncbi.nlm.nih.gov/pubmed/34563154
http://dx.doi.org/10.1186/s12885-021-08796-3
_version_ 1784573337001787392
author Zheng, Xingyu
Amos, Christopher I.
Frost, H. Robert
author_facet Zheng, Xingyu
Amos, Christopher I.
Frost, H. Robert
author_sort Zheng, Xingyu
collection PubMed
description BACKGROUND: Over the past decades, approaches for diagnosing and treating cancer have seen significant improvement. However, the variability of patient and tumor characteristics has limited progress on methods for prognosis prediction. The development of high-throughput omics technologies now provides multiple approaches for characterizing tumors. Although a large number of published studies have focused on integration of multi-omics data and use of pathway-level models for cancer prognosis prediction, there still exists a gap of knowledge regarding the prognostic landscape across multi-omics data for multiple cancer types using both gene-level and pathway-level predictors. METHODS: In this study, we systematically evaluated three often available types of omics data (gene expression, copy number variation and somatic point mutation) covering both DNA-level and RNA-level features. We evaluated the landscape of predictive performance of these three omics modalities for 33 cancer types in the TCGA using a Lasso or Group Lasso-penalized Cox model and either gene or pathway level predictors. RESULTS: We constructed the prognostic landscape using three types of omics data for 33 cancer types on both the gene and pathway levels. Based on this landscape, we found that predictive performance is cancer type dependent and we also highlighted the cancer types and omics modalities that support the most accurate prognostic models. In general, models estimated on gene expression data provide the best predictive performance on either gene or pathway level and adding copy number variation or somatic point mutation data to gene expression data does not improve predictive performance, with some exceptional cohorts including low grade glioma and thyroid cancer. In general, pathway-level models have better interpretative performance, higher stability and smaller model size across multiple cancer types and omics data types relative to gene-level models. CONCLUSIONS: Based on this landscape and comprehensively comparison, models estimated on gene expression data provide the best predictive performance on either gene or pathway level. Pathway-level models have better interpretative performance, higher stability and smaller model size relative to gene-level models. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12885-021-08796-3.
format Online
Article
Text
id pubmed-8467202
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-84672022021-09-28 Pan-cancer evaluation of gene expression and somatic alteration data for cancer prognosis prediction Zheng, Xingyu Amos, Christopher I. Frost, H. Robert BMC Cancer Research BACKGROUND: Over the past decades, approaches for diagnosing and treating cancer have seen significant improvement. However, the variability of patient and tumor characteristics has limited progress on methods for prognosis prediction. The development of high-throughput omics technologies now provides multiple approaches for characterizing tumors. Although a large number of published studies have focused on integration of multi-omics data and use of pathway-level models for cancer prognosis prediction, there still exists a gap of knowledge regarding the prognostic landscape across multi-omics data for multiple cancer types using both gene-level and pathway-level predictors. METHODS: In this study, we systematically evaluated three often available types of omics data (gene expression, copy number variation and somatic point mutation) covering both DNA-level and RNA-level features. We evaluated the landscape of predictive performance of these three omics modalities for 33 cancer types in the TCGA using a Lasso or Group Lasso-penalized Cox model and either gene or pathway level predictors. RESULTS: We constructed the prognostic landscape using three types of omics data for 33 cancer types on both the gene and pathway levels. Based on this landscape, we found that predictive performance is cancer type dependent and we also highlighted the cancer types and omics modalities that support the most accurate prognostic models. In general, models estimated on gene expression data provide the best predictive performance on either gene or pathway level and adding copy number variation or somatic point mutation data to gene expression data does not improve predictive performance, with some exceptional cohorts including low grade glioma and thyroid cancer. In general, pathway-level models have better interpretative performance, higher stability and smaller model size across multiple cancer types and omics data types relative to gene-level models. CONCLUSIONS: Based on this landscape and comprehensively comparison, models estimated on gene expression data provide the best predictive performance on either gene or pathway level. Pathway-level models have better interpretative performance, higher stability and smaller model size relative to gene-level models. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12885-021-08796-3. BioMed Central 2021-09-25 /pmc/articles/PMC8467202/ /pubmed/34563154 http://dx.doi.org/10.1186/s12885-021-08796-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zheng, Xingyu
Amos, Christopher I.
Frost, H. Robert
Pan-cancer evaluation of gene expression and somatic alteration data for cancer prognosis prediction
title Pan-cancer evaluation of gene expression and somatic alteration data for cancer prognosis prediction
title_full Pan-cancer evaluation of gene expression and somatic alteration data for cancer prognosis prediction
title_fullStr Pan-cancer evaluation of gene expression and somatic alteration data for cancer prognosis prediction
title_full_unstemmed Pan-cancer evaluation of gene expression and somatic alteration data for cancer prognosis prediction
title_short Pan-cancer evaluation of gene expression and somatic alteration data for cancer prognosis prediction
title_sort pan-cancer evaluation of gene expression and somatic alteration data for cancer prognosis prediction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8467202/
https://www.ncbi.nlm.nih.gov/pubmed/34563154
http://dx.doi.org/10.1186/s12885-021-08796-3
work_keys_str_mv AT zhengxingyu pancancerevaluationofgeneexpressionandsomaticalterationdataforcancerprognosisprediction
AT amoschristopheri pancancerevaluationofgeneexpressionandsomaticalterationdataforcancerprognosisprediction
AT frosthrobert pancancerevaluationofgeneexpressionandsomaticalterationdataforcancerprognosisprediction