Cargando…

Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods

Studying sets of genomic features is increasingly popular in genomics, proteomics and metabolomics since analyzing at set level not only creates a natural connection to biological knowledge but also offers more statistical power. Currently, there are two gene-set testing approaches, self-contained a...

Descripción completa

Detalles Bibliográficos
Autores principales: Ebrahimpoor, Mitra, Spitali, Pietro, Hettne, Kristina, Tsonaka, Roula, Goeman, Jelle
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7373179/
https://www.ncbi.nlm.nih.gov/pubmed/31297505
http://dx.doi.org/10.1093/bib/bbz074
_version_ 1783561453491453952
author Ebrahimpoor, Mitra
Spitali, Pietro
Hettne, Kristina
Tsonaka, Roula
Goeman, Jelle
author_facet Ebrahimpoor, Mitra
Spitali, Pietro
Hettne, Kristina
Tsonaka, Roula
Goeman, Jelle
author_sort Ebrahimpoor, Mitra
collection PubMed
description Studying sets of genomic features is increasingly popular in genomics, proteomics and metabolomics since analyzing at set level not only creates a natural connection to biological knowledge but also offers more statistical power. Currently, there are two gene-set testing approaches, self-contained and competitive, both of which have their advantages and disadvantages, but neither offers the final solution. We introduce simultaneous enrichment analysis (SEA), a new approach for analysis of feature sets in genomics and other omics based on a new unified null hypothesis, which includes the self-contained and competitive null hypotheses as special cases. We employ closed testing using Simes tests to test this new hypothesis. For every feature set, the proportion of active features is estimated, and a confidence bound is provided. Also, for every unified null hypotheses, a [Formula: see text]-value is calculated, which is adjusted for family-wise error rate. SEA does not need to assume that the features are independent. Moreover, users are allowed to choose the feature set(s) of interest after observing the data. We develop a novel pipeline and apply it on RNA-seq data of dystrophin-deficient mdx mice, showcasing the flexibility of the method. Finally, the power properties of the method are evaluated through simulation studies.
format Online
Article
Text
id pubmed-7373179
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73731792020-07-23 Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods Ebrahimpoor, Mitra Spitali, Pietro Hettne, Kristina Tsonaka, Roula Goeman, Jelle Brief Bioinform Review Articles Studying sets of genomic features is increasingly popular in genomics, proteomics and metabolomics since analyzing at set level not only creates a natural connection to biological knowledge but also offers more statistical power. Currently, there are two gene-set testing approaches, self-contained and competitive, both of which have their advantages and disadvantages, but neither offers the final solution. We introduce simultaneous enrichment analysis (SEA), a new approach for analysis of feature sets in genomics and other omics based on a new unified null hypothesis, which includes the self-contained and competitive null hypotheses as special cases. We employ closed testing using Simes tests to test this new hypothesis. For every feature set, the proportion of active features is estimated, and a confidence bound is provided. Also, for every unified null hypotheses, a [Formula: see text]-value is calculated, which is adjusted for family-wise error rate. SEA does not need to assume that the features are independent. Moreover, users are allowed to choose the feature set(s) of interest after observing the data. We develop a novel pipeline and apply it on RNA-seq data of dystrophin-deficient mdx mice, showcasing the flexibility of the method. Finally, the power properties of the method are evaluated through simulation studies. Oxford University Press 2019-07-09 /pmc/articles/PMC7373179/ /pubmed/31297505 http://dx.doi.org/10.1093/bib/bbz074 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Review Articles
Ebrahimpoor, Mitra
Spitali, Pietro
Hettne, Kristina
Tsonaka, Roula
Goeman, Jelle
Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods
title Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods
title_full Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods
title_fullStr Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods
title_full_unstemmed Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods
title_short Simultaneous Enrichment Analysis of all Possible Gene-sets: Unifying Self-Contained and Competitive Methods
title_sort simultaneous enrichment analysis of all possible gene-sets: unifying self-contained and competitive methods
topic Review Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7373179/
https://www.ncbi.nlm.nih.gov/pubmed/31297505
http://dx.doi.org/10.1093/bib/bbz074
work_keys_str_mv AT ebrahimpoormitra simultaneousenrichmentanalysisofallpossiblegenesetsunifyingselfcontainedandcompetitivemethods
AT spitalipietro simultaneousenrichmentanalysisofallpossiblegenesetsunifyingselfcontainedandcompetitivemethods
AT hettnekristina simultaneousenrichmentanalysisofallpossiblegenesetsunifyingselfcontainedandcompetitivemethods
AT tsonakaroula simultaneousenrichmentanalysisofallpossiblegenesetsunifyingselfcontainedandcompetitivemethods
AT goemanjelle simultaneousenrichmentanalysisofallpossiblegenesetsunifyingselfcontainedandcompetitivemethods