Cargando…
Network based stratification of major cancers by integrating somatic mutation and gene expression data
The stratification of cancer into subtypes that are significantly associated with clinical outcomes is beneficial for targeted prognosis and treatment. In this study, we integrated somatic mutation and gene expression data to identify clusters of patients. In contrast to previous studies, we constru...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5433734/ https://www.ncbi.nlm.nih.gov/pubmed/28520777 http://dx.doi.org/10.1371/journal.pone.0177662 |
_version_ | 1783236911055241216 |
---|---|
author | He, Zongzhen Zhang, Junying Yuan, Xiguo Liu, Zhaowen Liu, Baobao Tuo, Shouheng Liu, Yajun |
author_facet | He, Zongzhen Zhang, Junying Yuan, Xiguo Liu, Zhaowen Liu, Baobao Tuo, Shouheng Liu, Yajun |
author_sort | He, Zongzhen |
collection | PubMed |
description | The stratification of cancer into subtypes that are significantly associated with clinical outcomes is beneficial for targeted prognosis and treatment. In this study, we integrated somatic mutation and gene expression data to identify clusters of patients. In contrast to previous studies, we constructed cancer-type-specific significant co-expression networks (SCNs) rather than using a fixed gene network across all cancers, such as the network-based stratification (NBS) method, which ignores cancer heterogeneity. For each type of cancer, the gene expression data were used to construct the SCN network, while the gene somatic mutation data were mapped onto the network, propagated, and used for further clustering. For the clustering, we adopted an improved network-regularized non-negative matrix factorization (netNMF) (netNMF_HC) for a more precise classification. We applied our method to various datasets, including ovarian cancer (OV), lung adenocarcinoma (LUAD) and uterine corpus endometrial carcinoma (UCEC) cohorts derived from the TCGA (The Cancer Genome Atlas) project. Based on the results, we evaluated the performance of our method to identify survival-relevant subtypes and further compared it to the NBS method, which adopts priori networks and netNMF algorithm. The proposed algorithm outperformed the NBS method in identifying informative cancer subtypes that were significantly associated with clinical outcomes in most cancer types we studied. In particular, our method identified survival-associated UCEC subtypes that were not identified by the NBS method. Our analysis indicated valid subtyping of patient could be applied by mutation data with cancer-type-specific SCNs and netNMF_HC for individual cancers because of specific cancer co-expression patterns and more precise clustering. |
format | Online Article Text |
id | pubmed-5433734 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-54337342017-05-26 Network based stratification of major cancers by integrating somatic mutation and gene expression data He, Zongzhen Zhang, Junying Yuan, Xiguo Liu, Zhaowen Liu, Baobao Tuo, Shouheng Liu, Yajun PLoS One Research Article The stratification of cancer into subtypes that are significantly associated with clinical outcomes is beneficial for targeted prognosis and treatment. In this study, we integrated somatic mutation and gene expression data to identify clusters of patients. In contrast to previous studies, we constructed cancer-type-specific significant co-expression networks (SCNs) rather than using a fixed gene network across all cancers, such as the network-based stratification (NBS) method, which ignores cancer heterogeneity. For each type of cancer, the gene expression data were used to construct the SCN network, while the gene somatic mutation data were mapped onto the network, propagated, and used for further clustering. For the clustering, we adopted an improved network-regularized non-negative matrix factorization (netNMF) (netNMF_HC) for a more precise classification. We applied our method to various datasets, including ovarian cancer (OV), lung adenocarcinoma (LUAD) and uterine corpus endometrial carcinoma (UCEC) cohorts derived from the TCGA (The Cancer Genome Atlas) project. Based on the results, we evaluated the performance of our method to identify survival-relevant subtypes and further compared it to the NBS method, which adopts priori networks and netNMF algorithm. The proposed algorithm outperformed the NBS method in identifying informative cancer subtypes that were significantly associated with clinical outcomes in most cancer types we studied. In particular, our method identified survival-associated UCEC subtypes that were not identified by the NBS method. Our analysis indicated valid subtyping of patient could be applied by mutation data with cancer-type-specific SCNs and netNMF_HC for individual cancers because of specific cancer co-expression patterns and more precise clustering. Public Library of Science 2017-05-16 /pmc/articles/PMC5433734/ /pubmed/28520777 http://dx.doi.org/10.1371/journal.pone.0177662 Text en © 2017 He et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article He, Zongzhen Zhang, Junying Yuan, Xiguo Liu, Zhaowen Liu, Baobao Tuo, Shouheng Liu, Yajun Network based stratification of major cancers by integrating somatic mutation and gene expression data |
title | Network based stratification of major cancers by integrating somatic mutation and gene expression data |
title_full | Network based stratification of major cancers by integrating somatic mutation and gene expression data |
title_fullStr | Network based stratification of major cancers by integrating somatic mutation and gene expression data |
title_full_unstemmed | Network based stratification of major cancers by integrating somatic mutation and gene expression data |
title_short | Network based stratification of major cancers by integrating somatic mutation and gene expression data |
title_sort | network based stratification of major cancers by integrating somatic mutation and gene expression data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5433734/ https://www.ncbi.nlm.nih.gov/pubmed/28520777 http://dx.doi.org/10.1371/journal.pone.0177662 |
work_keys_str_mv | AT hezongzhen networkbasedstratificationofmajorcancersbyintegratingsomaticmutationandgeneexpressiondata AT zhangjunying networkbasedstratificationofmajorcancersbyintegratingsomaticmutationandgeneexpressiondata AT yuanxiguo networkbasedstratificationofmajorcancersbyintegratingsomaticmutationandgeneexpressiondata AT liuzhaowen networkbasedstratificationofmajorcancersbyintegratingsomaticmutationandgeneexpressiondata AT liubaobao networkbasedstratificationofmajorcancersbyintegratingsomaticmutationandgeneexpressiondata AT tuoshouheng networkbasedstratificationofmajorcancersbyintegratingsomaticmutationandgeneexpressiondata AT liuyajun networkbasedstratificationofmajorcancersbyintegratingsomaticmutationandgeneexpressiondata |