Cargando…
Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods
Studying sets of genomic features is increasingly popular in genomics, proteomics and metabolomics since analyzing at set level not only creates a natural connection to biological knowledge but also offers more statistical power. Currently, there are two gene-set testing approaches, self-contained a...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7373179/ https://www.ncbi.nlm.nih.gov/pubmed/31297505 http://dx.doi.org/10.1093/bib/bbz074 |
_version_ | 1783561453491453952 |
---|---|
author | Ebrahimpoor, Mitra Spitali, Pietro Hettne, Kristina Tsonaka, Roula Goeman, Jelle |
author_facet | Ebrahimpoor, Mitra Spitali, Pietro Hettne, Kristina Tsonaka, Roula Goeman, Jelle |
author_sort | Ebrahimpoor, Mitra |
collection | PubMed |
description | Studying sets of genomic features is increasingly popular in genomics, proteomics and metabolomics since analyzing at set level not only creates a natural connection to biological knowledge but also offers more statistical power. Currently, there are two gene-set testing approaches, self-contained and competitive, both of which have their advantages and disadvantages, but neither offers the final solution. We introduce simultaneous enrichment analysis (SEA), a new approach for analysis of feature sets in genomics and other omics based on a new unified null hypothesis, which includes the self-contained and competitive null hypotheses as special cases. We employ closed testing using Simes tests to test this new hypothesis. For every feature set, the proportion of active features is estimated, and a confidence bound is provided. Also, for every unified null hypotheses, a [Formula: see text]-value is calculated, which is adjusted for family-wise error rate. SEA does not need to assume that the features are independent. Moreover, users are allowed to choose the feature set(s) of interest after observing the data. We develop a novel pipeline and apply it on RNA-seq data of dystrophin-deficient mdx mice, showcasing the flexibility of the method. Finally, the power properties of the method are evaluated through simulation studies. |
format | Online Article Text |
id | pubmed-7373179 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-73731792020-07-23 Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods Ebrahimpoor, Mitra Spitali, Pietro Hettne, Kristina Tsonaka, Roula Goeman, Jelle Brief Bioinform Review Articles Studying sets of genomic features is increasingly popular in genomics, proteomics and metabolomics since analyzing at set level not only creates a natural connection to biological knowledge but also offers more statistical power. Currently, there are two gene-set testing approaches, self-contained and competitive, both of which have their advantages and disadvantages, but neither offers the final solution. We introduce simultaneous enrichment analysis (SEA), a new approach for analysis of feature sets in genomics and other omics based on a new unified null hypothesis, which includes the self-contained and competitive null hypotheses as special cases. We employ closed testing using Simes tests to test this new hypothesis. For every feature set, the proportion of active features is estimated, and a confidence bound is provided. Also, for every unified null hypotheses, a [Formula: see text]-value is calculated, which is adjusted for family-wise error rate. SEA does not need to assume that the features are independent. Moreover, users are allowed to choose the feature set(s) of interest after observing the data. We develop a novel pipeline and apply it on RNA-seq data of dystrophin-deficient mdx mice, showcasing the flexibility of the method. Finally, the power properties of the method are evaluated through simulation studies. Oxford University Press 2019-07-09 /pmc/articles/PMC7373179/ /pubmed/31297505 http://dx.doi.org/10.1093/bib/bbz074 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Review Articles Ebrahimpoor, Mitra Spitali, Pietro Hettne, Kristina Tsonaka, Roula Goeman, Jelle Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods |
title | Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods |
title_full | Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods |
title_fullStr | Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods |
title_full_unstemmed | Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods |
title_short | Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods |
title_sort | simultaneous enrichment analysis of all possible gene-sets: unifying self-contained and competitive methods |
topic | Review Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7373179/ https://www.ncbi.nlm.nih.gov/pubmed/31297505 http://dx.doi.org/10.1093/bib/bbz074 |
work_keys_str_mv | AT ebrahimpoormitra simultaneousenrichmentanalysisofallpossiblegenesetsunifyingselfcontainedandcompetitivemethods AT spitalipietro simultaneousenrichmentanalysisofallpossiblegenesetsunifyingselfcontainedandcompetitivemethods AT hettnekristina simultaneousenrichmentanalysisofallpossiblegenesetsunifyingselfcontainedandcompetitivemethods AT tsonakaroula simultaneousenrichmentanalysisofallpossiblegenesetsunifyingselfcontainedandcompetitivemethods AT goemanjelle simultaneousenrichmentanalysisofallpossiblegenesetsunifyingselfcontainedandcompetitivemethods |