Cargando…
A simple null model for inferences from network enrichment analysis
A prevailing technique to infer function from lists of identifications, from molecular biological high-throughput experiments, is over-representation analysis, where the identifications are compared to predefined sets of related genes often referred to as pathways. As at least some pathways are know...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6226187/ https://www.ncbi.nlm.nih.gov/pubmed/30412619 http://dx.doi.org/10.1371/journal.pone.0206864 |
_version_ | 1783369918111023104 |
---|---|
author | Jeuken, Gustavo S. Käll, Lukas |
author_facet | Jeuken, Gustavo S. Käll, Lukas |
author_sort | Jeuken, Gustavo S. |
collection | PubMed |
description | A prevailing technique to infer function from lists of identifications, from molecular biological high-throughput experiments, is over-representation analysis, where the identifications are compared to predefined sets of related genes often referred to as pathways. As at least some pathways are known to be incomplete in their annotation, algorithmic efforts have been made to complement them with information from functional association networks. While the terminology varies in the literature, we will here refer to such methods as Network Enrichment Analysis (NEA). Traditionally, the significance of inferences from NEA has been assigned using a null model constructed from randomizations of the network. Here we instead argue for a null model that more directly relates to the set of genes being studied, and have designed one dynamic programming algorithm that calculates the score distribution of NEA scores that makes it possible to assign unbiased mid p values to inferences. We also implemented a random sampling method, carrying out the same task. We demonstrate that our method obtains a superior statistical calibration as compared to the popular NEA inference engine, BinoX, while also providing statistics that are easier to interpret. |
format | Online Article Text |
id | pubmed-6226187 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-62261872018-11-19 A simple null model for inferences from network enrichment analysis Jeuken, Gustavo S. Käll, Lukas PLoS One Research Article A prevailing technique to infer function from lists of identifications, from molecular biological high-throughput experiments, is over-representation analysis, where the identifications are compared to predefined sets of related genes often referred to as pathways. As at least some pathways are known to be incomplete in their annotation, algorithmic efforts have been made to complement them with information from functional association networks. While the terminology varies in the literature, we will here refer to such methods as Network Enrichment Analysis (NEA). Traditionally, the significance of inferences from NEA has been assigned using a null model constructed from randomizations of the network. Here we instead argue for a null model that more directly relates to the set of genes being studied, and have designed one dynamic programming algorithm that calculates the score distribution of NEA scores that makes it possible to assign unbiased mid p values to inferences. We also implemented a random sampling method, carrying out the same task. We demonstrate that our method obtains a superior statistical calibration as compared to the popular NEA inference engine, BinoX, while also providing statistics that are easier to interpret. Public Library of Science 2018-11-09 /pmc/articles/PMC6226187/ /pubmed/30412619 http://dx.doi.org/10.1371/journal.pone.0206864 Text en © 2018 Jeuken, Käll http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Jeuken, Gustavo S. Käll, Lukas A simple null model for inferences from network enrichment analysis |
title | A simple null model for inferences from network enrichment analysis |
title_full | A simple null model for inferences from network enrichment analysis |
title_fullStr | A simple null model for inferences from network enrichment analysis |
title_full_unstemmed | A simple null model for inferences from network enrichment analysis |
title_short | A simple null model for inferences from network enrichment analysis |
title_sort | simple null model for inferences from network enrichment analysis |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6226187/ https://www.ncbi.nlm.nih.gov/pubmed/30412619 http://dx.doi.org/10.1371/journal.pone.0206864 |
work_keys_str_mv | AT jeukengustavos asimplenullmodelforinferencesfromnetworkenrichmentanalysis AT kalllukas asimplenullmodelforinferencesfromnetworkenrichmentanalysis AT jeukengustavos simplenullmodelforinferencesfromnetworkenrichmentanalysis AT kalllukas simplenullmodelforinferencesfromnetworkenrichmentanalysis |