Cargando…

A simple null model for inferences from network enrichment analysis

A prevailing technique to infer function from lists of identifications, from molecular biological high-throughput experiments, is over-representation analysis, where the identifications are compared to predefined sets of related genes often referred to as pathways. As at least some pathways are know...

Descripción completa

Detalles Bibliográficos
Autores principales: Jeuken, Gustavo S., Käll, Lukas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6226187/
https://www.ncbi.nlm.nih.gov/pubmed/30412619
http://dx.doi.org/10.1371/journal.pone.0206864
_version_ 1783369918111023104
author Jeuken, Gustavo S.
Käll, Lukas
author_facet Jeuken, Gustavo S.
Käll, Lukas
author_sort Jeuken, Gustavo S.
collection PubMed
description A prevailing technique to infer function from lists of identifications, from molecular biological high-throughput experiments, is over-representation analysis, where the identifications are compared to predefined sets of related genes often referred to as pathways. As at least some pathways are known to be incomplete in their annotation, algorithmic efforts have been made to complement them with information from functional association networks. While the terminology varies in the literature, we will here refer to such methods as Network Enrichment Analysis (NEA). Traditionally, the significance of inferences from NEA has been assigned using a null model constructed from randomizations of the network. Here we instead argue for a null model that more directly relates to the set of genes being studied, and have designed one dynamic programming algorithm that calculates the score distribution of NEA scores that makes it possible to assign unbiased mid p values to inferences. We also implemented a random sampling method, carrying out the same task. We demonstrate that our method obtains a superior statistical calibration as compared to the popular NEA inference engine, BinoX, while also providing statistics that are easier to interpret.
format Online
Article
Text
id pubmed-6226187
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-62261872018-11-19 A simple null model for inferences from network enrichment analysis Jeuken, Gustavo S. Käll, Lukas PLoS One Research Article A prevailing technique to infer function from lists of identifications, from molecular biological high-throughput experiments, is over-representation analysis, where the identifications are compared to predefined sets of related genes often referred to as pathways. As at least some pathways are known to be incomplete in their annotation, algorithmic efforts have been made to complement them with information from functional association networks. While the terminology varies in the literature, we will here refer to such methods as Network Enrichment Analysis (NEA). Traditionally, the significance of inferences from NEA has been assigned using a null model constructed from randomizations of the network. Here we instead argue for a null model that more directly relates to the set of genes being studied, and have designed one dynamic programming algorithm that calculates the score distribution of NEA scores that makes it possible to assign unbiased mid p values to inferences. We also implemented a random sampling method, carrying out the same task. We demonstrate that our method obtains a superior statistical calibration as compared to the popular NEA inference engine, BinoX, while also providing statistics that are easier to interpret. Public Library of Science 2018-11-09 /pmc/articles/PMC6226187/ /pubmed/30412619 http://dx.doi.org/10.1371/journal.pone.0206864 Text en © 2018 Jeuken, Käll http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Jeuken, Gustavo S.
Käll, Lukas
A simple null model for inferences from network enrichment analysis
title A simple null model for inferences from network enrichment analysis
title_full A simple null model for inferences from network enrichment analysis
title_fullStr A simple null model for inferences from network enrichment analysis
title_full_unstemmed A simple null model for inferences from network enrichment analysis
title_short A simple null model for inferences from network enrichment analysis
title_sort simple null model for inferences from network enrichment analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6226187/
https://www.ncbi.nlm.nih.gov/pubmed/30412619
http://dx.doi.org/10.1371/journal.pone.0206864
work_keys_str_mv AT jeukengustavos asimplenullmodelforinferencesfromnetworkenrichmentanalysis
AT kalllukas asimplenullmodelforinferencesfromnetworkenrichmentanalysis
AT jeukengustavos simplenullmodelforinferencesfromnetworkenrichmentanalysis
AT kalllukas simplenullmodelforinferencesfromnetworkenrichmentanalysis