Cargando…

Toward the precision breast cancer survival prediction utilizing combined whole genome-wide expression and somatic mutation analysis

BACKGROUND: Breast cancer is the most common type of invasive cancer in woman. It accounts for approximately 18% of all cancer deaths worldwide. It is well known that somatic mutation plays an essential role in cancer development. Hence, we propose that a prognostic prediction model that integrates...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yifan, Yang, William, Li, Dan, Yang, Jack Y, Guan, Renchu, Yang, Mary Qu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245494/
https://www.ncbi.nlm.nih.gov/pubmed/30454048
http://dx.doi.org/10.1186/s12920-018-0419-x
_version_ 1783372253693476864
author Zhang, Yifan
Yang, William
Li, Dan
Yang, Jack Y
Guan, Renchu
Yang, Mary Qu
author_facet Zhang, Yifan
Yang, William
Li, Dan
Yang, Jack Y
Guan, Renchu
Yang, Mary Qu
author_sort Zhang, Yifan
collection PubMed
description BACKGROUND: Breast cancer is the most common type of invasive cancer in woman. It accounts for approximately 18% of all cancer deaths worldwide. It is well known that somatic mutation plays an essential role in cancer development. Hence, we propose that a prognostic prediction model that integrates somatic mutations with gene expression can improve survival prediction for cancer patients and also be able to reveal the genetic mutations associated with survival. METHOD: Differential expression analysis was used to identify breast cancer related genes. Genetic algorithm (GA) and univariate Cox regression analysis were applied to filter out survival related genes. DAVID was used for enrichment analysis on somatic mutated gene set. The performance of survival predictors were assessed by Cox regression model and concordance index(C-index). RESULTS: We investigated the genome-wide gene expression profile and somatic mutations of 1091 breast invasive carcinoma cases from The Cancer Genome Atlas (TCGA). We identified 118 genes with high hazard ratios as breast cancer survival risk gene candidates (log rank p <  0.0001 and c-index = 0.636). Multiple breast cancer survival related genes were found in this gene set, including FOXR2, FOXD1, MTNR1B and SDC1. Further genetic algorithm (GA) revealed an optimal gene set consisted of 88 genes with higher c-index (log rank p <  0.0001 and c-index = 0.656). We validated this gene set on an independent breast cancer data set and achieved a similar performance (log rank p <  0.0001 and c-index = 0.614). Moreover, we revealed 25 functional annotations, 15 gene ontology terms and 14 pathways that were significantly enriched in the genes that showed distinct mutation patterns in the different survival risk groups. These functional gene sets were used as new features for the survival prediction model. In particular, our results suggested that the Fanconi anemia pathway had an important role in breast cancer prognosis. CONCLUSIONS: Our study indicated that the expression levels of the gene signatures remain the effective indicators for breast cancer survival prediction. Combining the gene expression information with other types of features derived from somatic mutations can further improve the performance of survival prediction. The pathways that were associated with survival risk suggested by our study can be further investigated for improving cancer patient survival. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12920-018-0419-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6245494
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62454942018-11-26 Toward the precision breast cancer survival prediction utilizing combined whole genome-wide expression and somatic mutation analysis Zhang, Yifan Yang, William Li, Dan Yang, Jack Y Guan, Renchu Yang, Mary Qu BMC Med Genomics Research BACKGROUND: Breast cancer is the most common type of invasive cancer in woman. It accounts for approximately 18% of all cancer deaths worldwide. It is well known that somatic mutation plays an essential role in cancer development. Hence, we propose that a prognostic prediction model that integrates somatic mutations with gene expression can improve survival prediction for cancer patients and also be able to reveal the genetic mutations associated with survival. METHOD: Differential expression analysis was used to identify breast cancer related genes. Genetic algorithm (GA) and univariate Cox regression analysis were applied to filter out survival related genes. DAVID was used for enrichment analysis on somatic mutated gene set. The performance of survival predictors were assessed by Cox regression model and concordance index(C-index). RESULTS: We investigated the genome-wide gene expression profile and somatic mutations of 1091 breast invasive carcinoma cases from The Cancer Genome Atlas (TCGA). We identified 118 genes with high hazard ratios as breast cancer survival risk gene candidates (log rank p <  0.0001 and c-index = 0.636). Multiple breast cancer survival related genes were found in this gene set, including FOXR2, FOXD1, MTNR1B and SDC1. Further genetic algorithm (GA) revealed an optimal gene set consisted of 88 genes with higher c-index (log rank p <  0.0001 and c-index = 0.656). We validated this gene set on an independent breast cancer data set and achieved a similar performance (log rank p <  0.0001 and c-index = 0.614). Moreover, we revealed 25 functional annotations, 15 gene ontology terms and 14 pathways that were significantly enriched in the genes that showed distinct mutation patterns in the different survival risk groups. These functional gene sets were used as new features for the survival prediction model. In particular, our results suggested that the Fanconi anemia pathway had an important role in breast cancer prognosis. CONCLUSIONS: Our study indicated that the expression levels of the gene signatures remain the effective indicators for breast cancer survival prediction. Combining the gene expression information with other types of features derived from somatic mutations can further improve the performance of survival prediction. The pathways that were associated with survival risk suggested by our study can be further investigated for improving cancer patient survival. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12920-018-0419-x) contains supplementary material, which is available to authorized users. BioMed Central 2018-11-20 /pmc/articles/PMC6245494/ /pubmed/30454048 http://dx.doi.org/10.1186/s12920-018-0419-x Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zhang, Yifan
Yang, William
Li, Dan
Yang, Jack Y
Guan, Renchu
Yang, Mary Qu
Toward the precision breast cancer survival prediction utilizing combined whole genome-wide expression and somatic mutation analysis
title Toward the precision breast cancer survival prediction utilizing combined whole genome-wide expression and somatic mutation analysis
title_full Toward the precision breast cancer survival prediction utilizing combined whole genome-wide expression and somatic mutation analysis
title_fullStr Toward the precision breast cancer survival prediction utilizing combined whole genome-wide expression and somatic mutation analysis
title_full_unstemmed Toward the precision breast cancer survival prediction utilizing combined whole genome-wide expression and somatic mutation analysis
title_short Toward the precision breast cancer survival prediction utilizing combined whole genome-wide expression and somatic mutation analysis
title_sort toward the precision breast cancer survival prediction utilizing combined whole genome-wide expression and somatic mutation analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245494/
https://www.ncbi.nlm.nih.gov/pubmed/30454048
http://dx.doi.org/10.1186/s12920-018-0419-x
work_keys_str_mv AT zhangyifan towardtheprecisionbreastcancersurvivalpredictionutilizingcombinedwholegenomewideexpressionandsomaticmutationanalysis
AT yangwilliam towardtheprecisionbreastcancersurvivalpredictionutilizingcombinedwholegenomewideexpressionandsomaticmutationanalysis
AT lidan towardtheprecisionbreastcancersurvivalpredictionutilizingcombinedwholegenomewideexpressionandsomaticmutationanalysis
AT yangjacky towardtheprecisionbreastcancersurvivalpredictionutilizingcombinedwholegenomewideexpressionandsomaticmutationanalysis
AT guanrenchu towardtheprecisionbreastcancersurvivalpredictionutilizingcombinedwholegenomewideexpressionandsomaticmutationanalysis
AT yangmaryqu towardtheprecisionbreastcancersurvivalpredictionutilizingcombinedwholegenomewideexpressionandsomaticmutationanalysis