Cargando…

NEArender: an R package for functional interpretation of ‘omics’ data via network enrichment analysis

BACKGROUND: The statistical evaluation of pathway enrichment, i.e. of gene profiles' confluence to the pathway level, allows exploring molecular landscapes using functionally annotated gene sets. However, pathway scores can also be used as predictive features in machine learning. That requires,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jeggari, Ashwini, Alexeyenko, Andrey
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2017
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374688/ https://www.ncbi.nlm.nih.gov/pubmed/28361684 http://dx.doi.org/10.1186/s12859-017-1534-y

_version_	1782518943604277248
author	Jeggari, Ashwini Alexeyenko, Andrey
author_facet	Jeggari, Ashwini Alexeyenko, Andrey
author_sort	Jeggari, Ashwini
collection	PubMed
description	BACKGROUND: The statistical evaluation of pathway enrichment, i.e. of gene profiles' confluence to the pathway level, allows exploring molecular landscapes using functionally annotated gene sets. However, pathway scores can also be used as predictive features in machine learning. That requires, firstly, increasing statistical power and biological relevance via a network enrichment analysis (NEA) and, secondly, a fast and convenient procedure for rendering the original data into a space of pathway scores. However, previous implementations of NEA involved multiple runs of network randomization and were therefore slow. RESULTS: Here, we present a new R package NEArender which can transform raw 'omics' features of experimental or clinical samples into matrices describing the same samples with many fewer NEA-based pathway scores. This is done via a parametric estimation of the null binomial distribution and is thus much faster and less biased than randomization procedures. Further, we compare estimates from these two alternative procedures and demonstrate that the summarization of individual genes to pathways increases the statistical power compared to both the default differential expression analysis on individual genes and the state-of-the-art gene set enrichment analysis. The package also contains functions for preparing input, modeling null distributions, and evaluating alternative versions of the global network. CONCLUSIONS: Beyond the state-of-the-art exploration of molecular data through pathway enrichment, score matrices produced by NEArender can be used in larger bioinformatics pipelines as input for phenotype modeling, predicting disease outcomes etc. This approach is often more sensitive and robust than using the original data. The package NEArender is complementary to the online NEA tool EviNet (https://www.evinet.org) and, unlike of the latter, enables high performance of computations off-line. The R package NEArender version 1.4 is available at CRAN repository https://cran.r-project.org/web/packages/NEArender/ ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1534-y) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-5374688
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-53746882017-04-03 NEArender: an R package for functional interpretation of ‘omics’ data via network enrichment analysis Jeggari, Ashwini Alexeyenko, Andrey BMC Bioinformatics Research BACKGROUND: The statistical evaluation of pathway enrichment, i.e. of gene profiles' confluence to the pathway level, allows exploring molecular landscapes using functionally annotated gene sets. However, pathway scores can also be used as predictive features in machine learning. That requires, firstly, increasing statistical power and biological relevance via a network enrichment analysis (NEA) and, secondly, a fast and convenient procedure for rendering the original data into a space of pathway scores. However, previous implementations of NEA involved multiple runs of network randomization and were therefore slow. RESULTS: Here, we present a new R package NEArender which can transform raw 'omics' features of experimental or clinical samples into matrices describing the same samples with many fewer NEA-based pathway scores. This is done via a parametric estimation of the null binomial distribution and is thus much faster and less biased than randomization procedures. Further, we compare estimates from these two alternative procedures and demonstrate that the summarization of individual genes to pathways increases the statistical power compared to both the default differential expression analysis on individual genes and the state-of-the-art gene set enrichment analysis. The package also contains functions for preparing input, modeling null distributions, and evaluating alternative versions of the global network. CONCLUSIONS: Beyond the state-of-the-art exploration of molecular data through pathway enrichment, score matrices produced by NEArender can be used in larger bioinformatics pipelines as input for phenotype modeling, predicting disease outcomes etc. This approach is often more sensitive and robust than using the original data. The package NEArender is complementary to the online NEA tool EviNet (https://www.evinet.org) and, unlike of the latter, enables high performance of computations off-line. The R package NEArender version 1.4 is available at CRAN repository https://cran.r-project.org/web/packages/NEArender/ ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1534-y) contains supplementary material, which is available to authorized users. BioMed Central 2017-03-23 /pmc/articles/PMC5374688/ /pubmed/28361684 http://dx.doi.org/10.1186/s12859-017-1534-y Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Jeggari, Ashwini Alexeyenko, Andrey NEArender: an R package for functional interpretation of ‘omics’ data via network enrichment analysis
title	NEArender: an R package for functional interpretation of ‘omics’ data via network enrichment analysis
title_full	NEArender: an R package for functional interpretation of ‘omics’ data via network enrichment analysis
title_fullStr	NEArender: an R package for functional interpretation of ‘omics’ data via network enrichment analysis
title_full_unstemmed	NEArender: an R package for functional interpretation of ‘omics’ data via network enrichment analysis
title_short	NEArender: an R package for functional interpretation of ‘omics’ data via network enrichment analysis
title_sort	nearender: an r package for functional interpretation of ‘omics’ data via network enrichment analysis
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5374688/ https://www.ncbi.nlm.nih.gov/pubmed/28361684 http://dx.doi.org/10.1186/s12859-017-1534-y
work_keys_str_mv	AT jeggariashwini nearenderanrpackageforfunctionalinterpretationofomicsdatavianetworkenrichmentanalysis AT alexeyenkoandrey nearenderanrpackageforfunctionalinterpretationofomicsdatavianetworkenrichmentanalysis

NEArender: an R package for functional interpretation of ‘omics’ data via network enrichment analysis

Ejemplares similares