Cargando…
Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties
Recently, computational approaches integrating copy number aberrations (CNAs) and gene expression (GE) have been extensively studied to identify cancer-related genes and pathways. In this work, we integrate these two data sets with protein-protein interaction (PPI) information to find cancer-related...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3734239/ https://www.ncbi.nlm.nih.gov/pubmed/23940583 http://dx.doi.org/10.1371/journal.pone.0070498 |
_version_ | 1782279504123658240 |
---|---|
author | Azad, A. K. M. Lee, Hyunju |
author_facet | Azad, A. K. M. Lee, Hyunju |
author_sort | Azad, A. K. M. |
collection | PubMed |
description | Recently, computational approaches integrating copy number aberrations (CNAs) and gene expression (GE) have been extensively studied to identify cancer-related genes and pathways. In this work, we integrate these two data sets with protein-protein interaction (PPI) information to find cancer-related functional modules. To integrate CNA and GE data, we first built a gene-gene relationship network from a set of seed genes by enumerating all types of pairwise correlations, e.g. GE-GE, CNA-GE, and CNA-CNA, over multiple patients. Next, we propose a voting-based cancer module identification algorithm by combining topological and data-driven properties (VToD algorithm) by using the gene-gene relationship network as a source of data-driven information, and the PPI data as topological information. We applied the VToD algorithm to 266 glioblastoma multiforme (GBM) and 96 ovarian carcinoma (OVC) samples that have both expression and copy number measurements, and identified 22 GBM modules and 23 OVC modules. Among 22 GBM modules, 15, 12, and 20 modules were significantly enriched with cancer-related KEGG, BioCarta pathways, and GO terms, respectively. Among 23 OVC modules, 19, 18, and 23 modules were significantly enriched with cancer-related KEGG, BioCarta pathways, and GO terms, respectively. Similarly, we also observed that 9 and 2 GBM modules and 15 and 18 OVC modules were enriched with cancer gene census (CGC) and specific cancer driver genes, respectively. Our proposed module-detection algorithm significantly outperformed other existing methods in terms of both functional and cancer gene set enrichments. Most of the cancer-related pathways from both cancer data sets found in our algorithm contained more than two types of gene-gene relationships, showing strong positive correlations between the number of different types of relationship and CGC enrichment [Image: see text]-values (0.64 for GBM and 0.49 for OVC). This study suggests that identified modules containing both expression changes and CNAs can explain cancer-related activities with greater insights. |
format | Online Article Text |
id | pubmed-3734239 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-37342392013-08-12 Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties Azad, A. K. M. Lee, Hyunju PLoS One Research Article Recently, computational approaches integrating copy number aberrations (CNAs) and gene expression (GE) have been extensively studied to identify cancer-related genes and pathways. In this work, we integrate these two data sets with protein-protein interaction (PPI) information to find cancer-related functional modules. To integrate CNA and GE data, we first built a gene-gene relationship network from a set of seed genes by enumerating all types of pairwise correlations, e.g. GE-GE, CNA-GE, and CNA-CNA, over multiple patients. Next, we propose a voting-based cancer module identification algorithm by combining topological and data-driven properties (VToD algorithm) by using the gene-gene relationship network as a source of data-driven information, and the PPI data as topological information. We applied the VToD algorithm to 266 glioblastoma multiforme (GBM) and 96 ovarian carcinoma (OVC) samples that have both expression and copy number measurements, and identified 22 GBM modules and 23 OVC modules. Among 22 GBM modules, 15, 12, and 20 modules were significantly enriched with cancer-related KEGG, BioCarta pathways, and GO terms, respectively. Among 23 OVC modules, 19, 18, and 23 modules were significantly enriched with cancer-related KEGG, BioCarta pathways, and GO terms, respectively. Similarly, we also observed that 9 and 2 GBM modules and 15 and 18 OVC modules were enriched with cancer gene census (CGC) and specific cancer driver genes, respectively. Our proposed module-detection algorithm significantly outperformed other existing methods in terms of both functional and cancer gene set enrichments. Most of the cancer-related pathways from both cancer data sets found in our algorithm contained more than two types of gene-gene relationships, showing strong positive correlations between the number of different types of relationship and CGC enrichment [Image: see text]-values (0.64 for GBM and 0.49 for OVC). This study suggests that identified modules containing both expression changes and CNAs can explain cancer-related activities with greater insights. Public Library of Science 2013-08-05 /pmc/articles/PMC3734239/ /pubmed/23940583 http://dx.doi.org/10.1371/journal.pone.0070498 Text en © 2013 Azad, Lee http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Azad, A. K. M. Lee, Hyunju Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties |
title | Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties |
title_full | Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties |
title_fullStr | Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties |
title_full_unstemmed | Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties |
title_short | Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties |
title_sort | voting-based cancer module identification by combining topological and data-driven properties |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3734239/ https://www.ncbi.nlm.nih.gov/pubmed/23940583 http://dx.doi.org/10.1371/journal.pone.0070498 |
work_keys_str_mv | AT azadakm votingbasedcancermoduleidentificationbycombiningtopologicalanddatadrivenproperties AT leehyunju votingbasedcancermoduleidentificationbycombiningtopologicalanddatadrivenproperties |