Cargando…
Integrative network analysis of TCGA data for ovarian cancer
BACKGROUND: Over the past years, tremendous efforts have been made to elucidate the molecular basis of the initiation and progression of ovarian cancer. However, most existing studies have been focused on individual genes or a single type of data, which may lack the power to detect the complex mecha...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4331442/ https://www.ncbi.nlm.nih.gov/pubmed/25551281 http://dx.doi.org/10.1186/s12918-014-0136-9 |
_version_ | 1782357716215267328 |
---|---|
author | Zhang, Qingyang Burdette, Joanna E Wang, Ji-Ping |
author_facet | Zhang, Qingyang Burdette, Joanna E Wang, Ji-Ping |
author_sort | Zhang, Qingyang |
collection | PubMed |
description | BACKGROUND: Over the past years, tremendous efforts have been made to elucidate the molecular basis of the initiation and progression of ovarian cancer. However, most existing studies have been focused on individual genes or a single type of data, which may lack the power to detect the complex mechanisms of cancer formation by overlooking the interactions of different genetic and epigenetic factors. RESULTS: We propose an integrative framework to identify genetic and epigenetic features related to ovarian cancer and to quantify the causal relationships among these features using a probabilistic graphical model based on the Cancer Genome Atlas (TCGA) data. In the feature selection, we first defined a set of seed genes by including 48 candidate tumor suppressors or oncogenes and an additional 20 ovarian cancer related genes reported in the literature. The seed genes were then fed into a stepwise correlation-based selector to identify 271 additional features including 177 genes, 82 copy number variation sites, 11 methylation sites and 1 somatic mutation (at gene TP53). We built a Bayesian network model with a logit link function to quantify the causal relationships among these features and discovered a set of 13 hub genes including ARID1A, C19orf53, CSKN2A1 and COL5A2. The directed graph revealed many potential genetic pathways, some of which confirmed the existing results in the literature. Clustering analysis further suggested four gene clusters, three of which correspond to well-defined cellular processes including cell division, tumor invasion and mitochondrial system. In addition, two genes related to glycoprotein synthesis, PSG11 and GALNT10, were found highly predictive for the overall survival time of ovarian cancer patients. CONCLUSIONS: The proposed framework is effective in identifying possible important genetic and epigenetic features that are related to complex cancer diseases. The constructed Bayesian network has identified some new genetic/epigenetic pathways, which may shed new light into the molecular mechanisms of ovarian cancer. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-014-0136-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4331442 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43314422015-02-19 Integrative network analysis of TCGA data for ovarian cancer Zhang, Qingyang Burdette, Joanna E Wang, Ji-Ping BMC Syst Biol Methodology Article BACKGROUND: Over the past years, tremendous efforts have been made to elucidate the molecular basis of the initiation and progression of ovarian cancer. However, most existing studies have been focused on individual genes or a single type of data, which may lack the power to detect the complex mechanisms of cancer formation by overlooking the interactions of different genetic and epigenetic factors. RESULTS: We propose an integrative framework to identify genetic and epigenetic features related to ovarian cancer and to quantify the causal relationships among these features using a probabilistic graphical model based on the Cancer Genome Atlas (TCGA) data. In the feature selection, we first defined a set of seed genes by including 48 candidate tumor suppressors or oncogenes and an additional 20 ovarian cancer related genes reported in the literature. The seed genes were then fed into a stepwise correlation-based selector to identify 271 additional features including 177 genes, 82 copy number variation sites, 11 methylation sites and 1 somatic mutation (at gene TP53). We built a Bayesian network model with a logit link function to quantify the causal relationships among these features and discovered a set of 13 hub genes including ARID1A, C19orf53, CSKN2A1 and COL5A2. The directed graph revealed many potential genetic pathways, some of which confirmed the existing results in the literature. Clustering analysis further suggested four gene clusters, three of which correspond to well-defined cellular processes including cell division, tumor invasion and mitochondrial system. In addition, two genes related to glycoprotein synthesis, PSG11 and GALNT10, were found highly predictive for the overall survival time of ovarian cancer patients. CONCLUSIONS: The proposed framework is effective in identifying possible important genetic and epigenetic features that are related to complex cancer diseases. The constructed Bayesian network has identified some new genetic/epigenetic pathways, which may shed new light into the molecular mechanisms of ovarian cancer. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-014-0136-9) contains supplementary material, which is available to authorized users. BioMed Central 2014-12-31 /pmc/articles/PMC4331442/ /pubmed/25551281 http://dx.doi.org/10.1186/s12918-014-0136-9 Text en © Zhang et al.; licensee BioMed Central. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Zhang, Qingyang Burdette, Joanna E Wang, Ji-Ping Integrative network analysis of TCGA data for ovarian cancer |
title | Integrative network analysis of TCGA data for ovarian cancer |
title_full | Integrative network analysis of TCGA data for ovarian cancer |
title_fullStr | Integrative network analysis of TCGA data for ovarian cancer |
title_full_unstemmed | Integrative network analysis of TCGA data for ovarian cancer |
title_short | Integrative network analysis of TCGA data for ovarian cancer |
title_sort | integrative network analysis of tcga data for ovarian cancer |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4331442/ https://www.ncbi.nlm.nih.gov/pubmed/25551281 http://dx.doi.org/10.1186/s12918-014-0136-9 |
work_keys_str_mv | AT zhangqingyang integrativenetworkanalysisoftcgadataforovariancancer AT burdettejoannae integrativenetworkanalysisoftcgadataforovariancancer AT wangjiping integrativenetworkanalysisoftcgadataforovariancancer |