Cargando…

Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties

Recently, computational approaches integrating copy number aberrations (CNAs) and gene expression (GE) have been extensively studied to identify cancer-related genes and pathways. In this work, we integrate these two data sets with protein-protein interaction (PPI) information to find cancer-related...

Descripción completa

Detalles Bibliográficos
Autores principales: Azad, A. K. M., Lee, Hyunju
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3734239/
https://www.ncbi.nlm.nih.gov/pubmed/23940583
http://dx.doi.org/10.1371/journal.pone.0070498
_version_ 1782279504123658240
author Azad, A. K. M.
Lee, Hyunju
author_facet Azad, A. K. M.
Lee, Hyunju
author_sort Azad, A. K. M.
collection PubMed
description Recently, computational approaches integrating copy number aberrations (CNAs) and gene expression (GE) have been extensively studied to identify cancer-related genes and pathways. In this work, we integrate these two data sets with protein-protein interaction (PPI) information to find cancer-related functional modules. To integrate CNA and GE data, we first built a gene-gene relationship network from a set of seed genes by enumerating all types of pairwise correlations, e.g. GE-GE, CNA-GE, and CNA-CNA, over multiple patients. Next, we propose a voting-based cancer module identification algorithm by combining topological and data-driven properties (VToD algorithm) by using the gene-gene relationship network as a source of data-driven information, and the PPI data as topological information. We applied the VToD algorithm to 266 glioblastoma multiforme (GBM) and 96 ovarian carcinoma (OVC) samples that have both expression and copy number measurements, and identified 22 GBM modules and 23 OVC modules. Among 22 GBM modules, 15, 12, and 20 modules were significantly enriched with cancer-related KEGG, BioCarta pathways, and GO terms, respectively. Among 23 OVC modules, 19, 18, and 23 modules were significantly enriched with cancer-related KEGG, BioCarta pathways, and GO terms, respectively. Similarly, we also observed that 9 and 2 GBM modules and 15 and 18 OVC modules were enriched with cancer gene census (CGC) and specific cancer driver genes, respectively. Our proposed module-detection algorithm significantly outperformed other existing methods in terms of both functional and cancer gene set enrichments. Most of the cancer-related pathways from both cancer data sets found in our algorithm contained more than two types of gene-gene relationships, showing strong positive correlations between the number of different types of relationship and CGC enrichment [Image: see text]-values (0.64 for GBM and 0.49 for OVC). This study suggests that identified modules containing both expression changes and CNAs can explain cancer-related activities with greater insights.
format Online
Article
Text
id pubmed-3734239
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37342392013-08-12 Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties Azad, A. K. M. Lee, Hyunju PLoS One Research Article Recently, computational approaches integrating copy number aberrations (CNAs) and gene expression (GE) have been extensively studied to identify cancer-related genes and pathways. In this work, we integrate these two data sets with protein-protein interaction (PPI) information to find cancer-related functional modules. To integrate CNA and GE data, we first built a gene-gene relationship network from a set of seed genes by enumerating all types of pairwise correlations, e.g. GE-GE, CNA-GE, and CNA-CNA, over multiple patients. Next, we propose a voting-based cancer module identification algorithm by combining topological and data-driven properties (VToD algorithm) by using the gene-gene relationship network as a source of data-driven information, and the PPI data as topological information. We applied the VToD algorithm to 266 glioblastoma multiforme (GBM) and 96 ovarian carcinoma (OVC) samples that have both expression and copy number measurements, and identified 22 GBM modules and 23 OVC modules. Among 22 GBM modules, 15, 12, and 20 modules were significantly enriched with cancer-related KEGG, BioCarta pathways, and GO terms, respectively. Among 23 OVC modules, 19, 18, and 23 modules were significantly enriched with cancer-related KEGG, BioCarta pathways, and GO terms, respectively. Similarly, we also observed that 9 and 2 GBM modules and 15 and 18 OVC modules were enriched with cancer gene census (CGC) and specific cancer driver genes, respectively. Our proposed module-detection algorithm significantly outperformed other existing methods in terms of both functional and cancer gene set enrichments. Most of the cancer-related pathways from both cancer data sets found in our algorithm contained more than two types of gene-gene relationships, showing strong positive correlations between the number of different types of relationship and CGC enrichment [Image: see text]-values (0.64 for GBM and 0.49 for OVC). This study suggests that identified modules containing both expression changes and CNAs can explain cancer-related activities with greater insights. Public Library of Science 2013-08-05 /pmc/articles/PMC3734239/ /pubmed/23940583 http://dx.doi.org/10.1371/journal.pone.0070498 Text en © 2013 Azad, Lee http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Azad, A. K. M.
Lee, Hyunju
Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties
title Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties
title_full Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties
title_fullStr Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties
title_full_unstemmed Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties
title_short Voting-Based Cancer Module Identification by Combining Topological and Data-Driven Properties
title_sort voting-based cancer module identification by combining topological and data-driven properties
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3734239/
https://www.ncbi.nlm.nih.gov/pubmed/23940583
http://dx.doi.org/10.1371/journal.pone.0070498
work_keys_str_mv AT azadakm votingbasedcancermoduleidentificationbycombiningtopologicalanddatadrivenproperties
AT leehyunju votingbasedcancermoduleidentificationbycombiningtopologicalanddatadrivenproperties