Cargando…

Centrality-based pathway enrichment: a systematic approach for finding significant pathways dominated by key genes

BACKGROUND: Biological pathways are important for understanding biological mechanisms. Thus, finding important pathways that underlie biological problems helps researchers to focus on the most relevant sets of genes. Pathways resemble networks with complicated structures, but most of the existing pa...

Descripción completa

Detalles Bibliográficos
Autores principales: Gu, Zuguang, Liu, Jialin, Cao, Kunming, Zhang, Junfeng, Wang, Jin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3443660/
https://www.ncbi.nlm.nih.gov/pubmed/22672776
http://dx.doi.org/10.1186/1752-0509-6-56
_version_ 1782243590255149056
author Gu, Zuguang
Liu, Jialin
Cao, Kunming
Zhang, Junfeng
Wang, Jin
author_facet Gu, Zuguang
Liu, Jialin
Cao, Kunming
Zhang, Junfeng
Wang, Jin
author_sort Gu, Zuguang
collection PubMed
description BACKGROUND: Biological pathways are important for understanding biological mechanisms. Thus, finding important pathways that underlie biological problems helps researchers to focus on the most relevant sets of genes. Pathways resemble networks with complicated structures, but most of the existing pathway enrichment tools ignore topological information embedded within pathways, which limits their applicability. RESULTS: A systematic and extensible pathway enrichment method in which nodes are weighted by network centrality was proposed. We demonstrate how choice of pathway structure and centrality measurement, as well as the presence of key genes, affects pathway significance. We emphasize two improvements of our method over current methods. First, allowing for the diversity of genes’ characters and the difficulty of covering gene importance from all aspects, we set centrality as an optional parameter in the model. Second, nodes rather than genes form the basic unit of pathways, such that one node can be composed of several genes and one gene may reside in different nodes. By comparing our methodology to the original enrichment method using both simulation data and real-world data, we demonstrate the efficacy of our method in finding new pathways from biological perspective. CONCLUSIONS: Our method can benefit the systematic analysis of biological pathways and help to extract more meaningful information from gene expression data. The algorithm has been implemented as an R package CePa, and also a web-based version of CePa is provided.
format Online
Article
Text
id pubmed-3443660
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34436602012-09-18 Centrality-based pathway enrichment: a systematic approach for finding significant pathways dominated by key genes Gu, Zuguang Liu, Jialin Cao, Kunming Zhang, Junfeng Wang, Jin BMC Syst Biol Methodology Article BACKGROUND: Biological pathways are important for understanding biological mechanisms. Thus, finding important pathways that underlie biological problems helps researchers to focus on the most relevant sets of genes. Pathways resemble networks with complicated structures, but most of the existing pathway enrichment tools ignore topological information embedded within pathways, which limits their applicability. RESULTS: A systematic and extensible pathway enrichment method in which nodes are weighted by network centrality was proposed. We demonstrate how choice of pathway structure and centrality measurement, as well as the presence of key genes, affects pathway significance. We emphasize two improvements of our method over current methods. First, allowing for the diversity of genes’ characters and the difficulty of covering gene importance from all aspects, we set centrality as an optional parameter in the model. Second, nodes rather than genes form the basic unit of pathways, such that one node can be composed of several genes and one gene may reside in different nodes. By comparing our methodology to the original enrichment method using both simulation data and real-world data, we demonstrate the efficacy of our method in finding new pathways from biological perspective. CONCLUSIONS: Our method can benefit the systematic analysis of biological pathways and help to extract more meaningful information from gene expression data. The algorithm has been implemented as an R package CePa, and also a web-based version of CePa is provided. BioMed Central 2012-06-06 /pmc/articles/PMC3443660/ /pubmed/22672776 http://dx.doi.org/10.1186/1752-0509-6-56 Text en Copyright ©2012 Gu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Gu, Zuguang
Liu, Jialin
Cao, Kunming
Zhang, Junfeng
Wang, Jin
Centrality-based pathway enrichment: a systematic approach for finding significant pathways dominated by key genes
title Centrality-based pathway enrichment: a systematic approach for finding significant pathways dominated by key genes
title_full Centrality-based pathway enrichment: a systematic approach for finding significant pathways dominated by key genes
title_fullStr Centrality-based pathway enrichment: a systematic approach for finding significant pathways dominated by key genes
title_full_unstemmed Centrality-based pathway enrichment: a systematic approach for finding significant pathways dominated by key genes
title_short Centrality-based pathway enrichment: a systematic approach for finding significant pathways dominated by key genes
title_sort centrality-based pathway enrichment: a systematic approach for finding significant pathways dominated by key genes
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3443660/
https://www.ncbi.nlm.nih.gov/pubmed/22672776
http://dx.doi.org/10.1186/1752-0509-6-56
work_keys_str_mv AT guzuguang centralitybasedpathwayenrichmentasystematicapproachforfindingsignificantpathwaysdominatedbykeygenes
AT liujialin centralitybasedpathwayenrichmentasystematicapproachforfindingsignificantpathwaysdominatedbykeygenes
AT caokunming centralitybasedpathwayenrichmentasystematicapproachforfindingsignificantpathwaysdominatedbykeygenes
AT zhangjunfeng centralitybasedpathwayenrichmentasystematicapproachforfindingsignificantpathwaysdominatedbykeygenes
AT wangjin centralitybasedpathwayenrichmentasystematicapproachforfindingsignificantpathwaysdominatedbykeygenes