Cargando…

Network based stratification of major cancers by integrating somatic mutation and gene expression data

The stratification of cancer into subtypes that are significantly associated with clinical outcomes is beneficial for targeted prognosis and treatment. In this study, we integrated somatic mutation and gene expression data to identify clusters of patients. In contrast to previous studies, we constru...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Zongzhen, Zhang, Junying, Yuan, Xiguo, Liu, Zhaowen, Liu, Baobao, Tuo, Shouheng, Liu, Yajun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5433734/
https://www.ncbi.nlm.nih.gov/pubmed/28520777
http://dx.doi.org/10.1371/journal.pone.0177662
_version_ 1783236911055241216
author He, Zongzhen
Zhang, Junying
Yuan, Xiguo
Liu, Zhaowen
Liu, Baobao
Tuo, Shouheng
Liu, Yajun
author_facet He, Zongzhen
Zhang, Junying
Yuan, Xiguo
Liu, Zhaowen
Liu, Baobao
Tuo, Shouheng
Liu, Yajun
author_sort He, Zongzhen
collection PubMed
description The stratification of cancer into subtypes that are significantly associated with clinical outcomes is beneficial for targeted prognosis and treatment. In this study, we integrated somatic mutation and gene expression data to identify clusters of patients. In contrast to previous studies, we constructed cancer-type-specific significant co-expression networks (SCNs) rather than using a fixed gene network across all cancers, such as the network-based stratification (NBS) method, which ignores cancer heterogeneity. For each type of cancer, the gene expression data were used to construct the SCN network, while the gene somatic mutation data were mapped onto the network, propagated, and used for further clustering. For the clustering, we adopted an improved network-regularized non-negative matrix factorization (netNMF) (netNMF_HC) for a more precise classification. We applied our method to various datasets, including ovarian cancer (OV), lung adenocarcinoma (LUAD) and uterine corpus endometrial carcinoma (UCEC) cohorts derived from the TCGA (The Cancer Genome Atlas) project. Based on the results, we evaluated the performance of our method to identify survival-relevant subtypes and further compared it to the NBS method, which adopts priori networks and netNMF algorithm. The proposed algorithm outperformed the NBS method in identifying informative cancer subtypes that were significantly associated with clinical outcomes in most cancer types we studied. In particular, our method identified survival-associated UCEC subtypes that were not identified by the NBS method. Our analysis indicated valid subtyping of patient could be applied by mutation data with cancer-type-specific SCNs and netNMF_HC for individual cancers because of specific cancer co-expression patterns and more precise clustering.
format Online
Article
Text
id pubmed-5433734
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-54337342017-05-26 Network based stratification of major cancers by integrating somatic mutation and gene expression data He, Zongzhen Zhang, Junying Yuan, Xiguo Liu, Zhaowen Liu, Baobao Tuo, Shouheng Liu, Yajun PLoS One Research Article The stratification of cancer into subtypes that are significantly associated with clinical outcomes is beneficial for targeted prognosis and treatment. In this study, we integrated somatic mutation and gene expression data to identify clusters of patients. In contrast to previous studies, we constructed cancer-type-specific significant co-expression networks (SCNs) rather than using a fixed gene network across all cancers, such as the network-based stratification (NBS) method, which ignores cancer heterogeneity. For each type of cancer, the gene expression data were used to construct the SCN network, while the gene somatic mutation data were mapped onto the network, propagated, and used for further clustering. For the clustering, we adopted an improved network-regularized non-negative matrix factorization (netNMF) (netNMF_HC) for a more precise classification. We applied our method to various datasets, including ovarian cancer (OV), lung adenocarcinoma (LUAD) and uterine corpus endometrial carcinoma (UCEC) cohorts derived from the TCGA (The Cancer Genome Atlas) project. Based on the results, we evaluated the performance of our method to identify survival-relevant subtypes and further compared it to the NBS method, which adopts priori networks and netNMF algorithm. The proposed algorithm outperformed the NBS method in identifying informative cancer subtypes that were significantly associated with clinical outcomes in most cancer types we studied. In particular, our method identified survival-associated UCEC subtypes that were not identified by the NBS method. Our analysis indicated valid subtyping of patient could be applied by mutation data with cancer-type-specific SCNs and netNMF_HC for individual cancers because of specific cancer co-expression patterns and more precise clustering. Public Library of Science 2017-05-16 /pmc/articles/PMC5433734/ /pubmed/28520777 http://dx.doi.org/10.1371/journal.pone.0177662 Text en © 2017 He et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
He, Zongzhen
Zhang, Junying
Yuan, Xiguo
Liu, Zhaowen
Liu, Baobao
Tuo, Shouheng
Liu, Yajun
Network based stratification of major cancers by integrating somatic mutation and gene expression data
title Network based stratification of major cancers by integrating somatic mutation and gene expression data
title_full Network based stratification of major cancers by integrating somatic mutation and gene expression data
title_fullStr Network based stratification of major cancers by integrating somatic mutation and gene expression data
title_full_unstemmed Network based stratification of major cancers by integrating somatic mutation and gene expression data
title_short Network based stratification of major cancers by integrating somatic mutation and gene expression data
title_sort network based stratification of major cancers by integrating somatic mutation and gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5433734/
https://www.ncbi.nlm.nih.gov/pubmed/28520777
http://dx.doi.org/10.1371/journal.pone.0177662
work_keys_str_mv AT hezongzhen networkbasedstratificationofmajorcancersbyintegratingsomaticmutationandgeneexpressiondata
AT zhangjunying networkbasedstratificationofmajorcancersbyintegratingsomaticmutationandgeneexpressiondata
AT yuanxiguo networkbasedstratificationofmajorcancersbyintegratingsomaticmutationandgeneexpressiondata
AT liuzhaowen networkbasedstratificationofmajorcancersbyintegratingsomaticmutationandgeneexpressiondata
AT liubaobao networkbasedstratificationofmajorcancersbyintegratingsomaticmutationandgeneexpressiondata
AT tuoshouheng networkbasedstratificationofmajorcancersbyintegratingsomaticmutationandgeneexpressiondata
AT liuyajun networkbasedstratificationofmajorcancersbyintegratingsomaticmutationandgeneexpressiondata