Cargando…

A comparative study of topology-based pathway enrichment analysis methods

BACKGROUND: Pathway enrichment extensively used in the analysis of Omics data for gaining biological insights into the functional roles of pre-defined subsets of genes, proteins and metabolites. A large number of methods have been proposed in the literature for this task. The vast majority of these...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Jing, Shojaie, Ali, Michailidis, George
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6829999/
https://www.ncbi.nlm.nih.gov/pubmed/31684881
http://dx.doi.org/10.1186/s12859-019-3146-1
_version_ 1783465689656328192
author Ma, Jing
Shojaie, Ali
Michailidis, George
author_facet Ma, Jing
Shojaie, Ali
Michailidis, George
author_sort Ma, Jing
collection PubMed
description BACKGROUND: Pathway enrichment extensively used in the analysis of Omics data for gaining biological insights into the functional roles of pre-defined subsets of genes, proteins and metabolites. A large number of methods have been proposed in the literature for this task. The vast majority of these methods use as input expression levels of the biomolecules under study together with their membership in pathways of interest. The latest generation of pathway enrichment methods also leverages information on the topology of the underlying pathways, which as evidence from their evaluation reveals, lead to improved sensitivity and specificity. Nevertheless, a systematic empirical comparison of such methods is still lacking, making selection of the most suitable method for a specific experimental setting challenging. This comparative study of nine network-based methods for pathway enrichment analysis aims to provide a systematic evaluation of their performance based on three real data sets with different number of features (genes/metabolites) and number of samples. RESULTS: The findings highlight both methodological and empirical differences across the nine methods. In particular, certain methods assess pathway enrichment due to differences both across expression levels and in the strength of the interconnectedness of the members of the pathway, while others only leverage differential expression levels. In the more challenging setting involving a metabolomics data set, the results show that methods that utilize both pieces of information (with NetGSA being a prototypical one) exhibit superior statistical power in detecting pathway enrichment. CONCLUSION: The analysis reveals that a number of methods perform equally well when testing large size pathways, which is the case with genomic data. On the other hand, NetGSA that takes into consideration both differential expression of the biomolecules in the pathway, as well as changes in the topology exhibits a superior performance when testing small size pathways, which is usually the case for metabolomics data.
format Online
Article
Text
id pubmed-6829999
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68299992019-11-08 A comparative study of topology-based pathway enrichment analysis methods Ma, Jing Shojaie, Ali Michailidis, George BMC Bioinformatics Research Article BACKGROUND: Pathway enrichment extensively used in the analysis of Omics data for gaining biological insights into the functional roles of pre-defined subsets of genes, proteins and metabolites. A large number of methods have been proposed in the literature for this task. The vast majority of these methods use as input expression levels of the biomolecules under study together with their membership in pathways of interest. The latest generation of pathway enrichment methods also leverages information on the topology of the underlying pathways, which as evidence from their evaluation reveals, lead to improved sensitivity and specificity. Nevertheless, a systematic empirical comparison of such methods is still lacking, making selection of the most suitable method for a specific experimental setting challenging. This comparative study of nine network-based methods for pathway enrichment analysis aims to provide a systematic evaluation of their performance based on three real data sets with different number of features (genes/metabolites) and number of samples. RESULTS: The findings highlight both methodological and empirical differences across the nine methods. In particular, certain methods assess pathway enrichment due to differences both across expression levels and in the strength of the interconnectedness of the members of the pathway, while others only leverage differential expression levels. In the more challenging setting involving a metabolomics data set, the results show that methods that utilize both pieces of information (with NetGSA being a prototypical one) exhibit superior statistical power in detecting pathway enrichment. CONCLUSION: The analysis reveals that a number of methods perform equally well when testing large size pathways, which is the case with genomic data. On the other hand, NetGSA that takes into consideration both differential expression of the biomolecules in the pathway, as well as changes in the topology exhibits a superior performance when testing small size pathways, which is usually the case for metabolomics data. BioMed Central 2019-11-04 /pmc/articles/PMC6829999/ /pubmed/31684881 http://dx.doi.org/10.1186/s12859-019-3146-1 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Ma, Jing
Shojaie, Ali
Michailidis, George
A comparative study of topology-based pathway enrichment analysis methods
title A comparative study of topology-based pathway enrichment analysis methods
title_full A comparative study of topology-based pathway enrichment analysis methods
title_fullStr A comparative study of topology-based pathway enrichment analysis methods
title_full_unstemmed A comparative study of topology-based pathway enrichment analysis methods
title_short A comparative study of topology-based pathway enrichment analysis methods
title_sort comparative study of topology-based pathway enrichment analysis methods
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6829999/
https://www.ncbi.nlm.nih.gov/pubmed/31684881
http://dx.doi.org/10.1186/s12859-019-3146-1
work_keys_str_mv AT majing acomparativestudyoftopologybasedpathwayenrichmentanalysismethods
AT shojaieali acomparativestudyoftopologybasedpathwayenrichmentanalysismethods
AT michailidisgeorge acomparativestudyoftopologybasedpathwayenrichmentanalysismethods
AT majing comparativestudyoftopologybasedpathwayenrichmentanalysismethods
AT shojaieali comparativestudyoftopologybasedpathwayenrichmentanalysismethods
AT michailidisgeorge comparativestudyoftopologybasedpathwayenrichmentanalysismethods