Cargando…
A shortcut for multiple testing on the directed acyclic graph of gene ontology
BACKGROUND: Gene set testing has become an important analysis technique in high throughput microarray and next generation sequencing studies for uncovering patterns of differential expression of various biological processes. Often, the large number of gene sets that are tested simultaneously require...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4232707/ https://www.ncbi.nlm.nih.gov/pubmed/25366961 http://dx.doi.org/10.1186/s12859-014-0349-3 |
_version_ | 1782344620624052224 |
---|---|
author | Saunders, Garrett Stevens, John R Isom, S Clay |
author_facet | Saunders, Garrett Stevens, John R Isom, S Clay |
author_sort | Saunders, Garrett |
collection | PubMed |
description | BACKGROUND: Gene set testing has become an important analysis technique in high throughput microarray and next generation sequencing studies for uncovering patterns of differential expression of various biological processes. Often, the large number of gene sets that are tested simultaneously require some sort of multiplicity correction to account for the multiplicity effect. This work provides a substantial computational improvement to an existing familywise error rate controlling multiplicity approach (the Focus Level method) for gene set testing in high throughput microarray and next generation sequencing studies using Gene Ontology graphs, which we call the Short Focus Level. RESULTS: The Short Focus Level procedure, which performs a shortcut of the full Focus Level procedure, is achieved by extending the reach of graphical weighted Bonferroni testing to closed testing situations where restricted hypotheses are present, such as in the Gene Ontology graphs. The Short Focus Level multiplicity adjustment can perform the full top-down approach of the original Focus Level procedure, overcoming a significant disadvantage of the otherwise powerful Focus Level multiplicity adjustment. The computational and power differences of the Short Focus Level procedure as compared to the original Focus Level procedure are demonstrated both through simulation and using real data. CONCLUSIONS: The Short Focus Level procedure shows a significant increase in computation speed over the original Focus Level procedure (as much as ∼15,000 times faster). The Short Focus Level should be used in place of the Focus Level procedure whenever the logical assumptions of the Gene Ontology graph structure are appropriate for the study objectives and when either no a priori focus level of interest can be specified or the focus level is selected at a higher level of the graph, where the Focus Level procedure is computationally intractable. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0349-3) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4232707 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42327072014-11-17 A shortcut for multiple testing on the directed acyclic graph of gene ontology Saunders, Garrett Stevens, John R Isom, S Clay BMC Bioinformatics Methodology Article BACKGROUND: Gene set testing has become an important analysis technique in high throughput microarray and next generation sequencing studies for uncovering patterns of differential expression of various biological processes. Often, the large number of gene sets that are tested simultaneously require some sort of multiplicity correction to account for the multiplicity effect. This work provides a substantial computational improvement to an existing familywise error rate controlling multiplicity approach (the Focus Level method) for gene set testing in high throughput microarray and next generation sequencing studies using Gene Ontology graphs, which we call the Short Focus Level. RESULTS: The Short Focus Level procedure, which performs a shortcut of the full Focus Level procedure, is achieved by extending the reach of graphical weighted Bonferroni testing to closed testing situations where restricted hypotheses are present, such as in the Gene Ontology graphs. The Short Focus Level multiplicity adjustment can perform the full top-down approach of the original Focus Level procedure, overcoming a significant disadvantage of the otherwise powerful Focus Level multiplicity adjustment. The computational and power differences of the Short Focus Level procedure as compared to the original Focus Level procedure are demonstrated both through simulation and using real data. CONCLUSIONS: The Short Focus Level procedure shows a significant increase in computation speed over the original Focus Level procedure (as much as ∼15,000 times faster). The Short Focus Level should be used in place of the Focus Level procedure whenever the logical assumptions of the Gene Ontology graph structure are appropriate for the study objectives and when either no a priori focus level of interest can be specified or the focus level is selected at a higher level of the graph, where the Focus Level procedure is computationally intractable. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-014-0349-3) contains supplementary material, which is available to authorized users. BioMed Central 2014-11-01 /pmc/articles/PMC4232707/ /pubmed/25366961 http://dx.doi.org/10.1186/s12859-014-0349-3 Text en © Saunders et al.; licensee BioMed Central Ltd. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Saunders, Garrett Stevens, John R Isom, S Clay A shortcut for multiple testing on the directed acyclic graph of gene ontology |
title | A shortcut for multiple testing on the directed acyclic graph of gene ontology |
title_full | A shortcut for multiple testing on the directed acyclic graph of gene ontology |
title_fullStr | A shortcut for multiple testing on the directed acyclic graph of gene ontology |
title_full_unstemmed | A shortcut for multiple testing on the directed acyclic graph of gene ontology |
title_short | A shortcut for multiple testing on the directed acyclic graph of gene ontology |
title_sort | shortcut for multiple testing on the directed acyclic graph of gene ontology |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4232707/ https://www.ncbi.nlm.nih.gov/pubmed/25366961 http://dx.doi.org/10.1186/s12859-014-0349-3 |
work_keys_str_mv | AT saundersgarrett ashortcutformultipletestingonthedirectedacyclicgraphofgeneontology AT stevensjohnr ashortcutformultipletestingonthedirectedacyclicgraphofgeneontology AT isomsclay ashortcutformultipletestingonthedirectedacyclicgraphofgeneontology AT saundersgarrett shortcutformultipletestingonthedirectedacyclicgraphofgeneontology AT stevensjohnr shortcutformultipletestingonthedirectedacyclicgraphofgeneontology AT isomsclay shortcutformultipletestingonthedirectedacyclicgraphofgeneontology |