Cargando…
FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis
MOTIVATION: Gene set enrichment analyses (GSEAs) are widely used in genomic research to identify underlying biological mechanisms (defined by the gene sets), such as Gene Ontology terms and molecular pathways. There are two caveats in the currently available methods: (i) they are typically designed...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5939227/ https://www.ncbi.nlm.nih.gov/pubmed/28334094 http://dx.doi.org/10.1093/bioinformatics/btx104 |
_version_ | 1783320921774227456 |
---|---|
author | Zhang, Yun Topham, David J Thakar, Juilee Qiu, Xing |
author_facet | Zhang, Yun Topham, David J Thakar, Juilee Qiu, Xing |
author_sort | Zhang, Yun |
collection | PubMed |
description | MOTIVATION: Gene set enrichment analyses (GSEAs) are widely used in genomic research to identify underlying biological mechanisms (defined by the gene sets), such as Gene Ontology terms and molecular pathways. There are two caveats in the currently available methods: (i) they are typically designed for group comparisons or regression analyses, which do not utilize temporal information efficiently in time-series of transcriptomics measurements; and (ii) genes overlapping in multiple molecular pathways are considered multiple times in hypothesis testing. RESULTS: We propose an inferential framework for GSEA based on functional data analysis, which utilizes the temporal information based on functional principal component analysis, and disentangles the effects of overlapping genes by a functional extension of the elastic-net regression. Furthermore, the hypothesis testing for the gene sets is performed by an extension of Mann-Whitney U test which is based on weighted rank sums computed from correlated observations. By using both simulated datasets and a large-scale time-course gene expression data on human influenza infection, we demonstrate that our method has uniformly better receiver operating characteristic curves, and identifies more pathways relevant to immune-response to human influenza infection than the competing approaches. AVAILABILITY AND IMPLEMENTATION: The methods are implemented in R package FUNNEL, freely and publicly available at: https://github.com/yunzhang813/FUNNEL-GSEA-R-Package. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-5939227 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-59392272018-05-10 FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis Zhang, Yun Topham, David J Thakar, Juilee Qiu, Xing Bioinformatics Original Papers MOTIVATION: Gene set enrichment analyses (GSEAs) are widely used in genomic research to identify underlying biological mechanisms (defined by the gene sets), such as Gene Ontology terms and molecular pathways. There are two caveats in the currently available methods: (i) they are typically designed for group comparisons or regression analyses, which do not utilize temporal information efficiently in time-series of transcriptomics measurements; and (ii) genes overlapping in multiple molecular pathways are considered multiple times in hypothesis testing. RESULTS: We propose an inferential framework for GSEA based on functional data analysis, which utilizes the temporal information based on functional principal component analysis, and disentangles the effects of overlapping genes by a functional extension of the elastic-net regression. Furthermore, the hypothesis testing for the gene sets is performed by an extension of Mann-Whitney U test which is based on weighted rank sums computed from correlated observations. By using both simulated datasets and a large-scale time-course gene expression data on human influenza infection, we demonstrate that our method has uniformly better receiver operating characteristic curves, and identifies more pathways relevant to immune-response to human influenza infection than the competing approaches. AVAILABILITY AND IMPLEMENTATION: The methods are implemented in R package FUNNEL, freely and publicly available at: https://github.com/yunzhang813/FUNNEL-GSEA-R-Package. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-07-01 2017-02-21 /pmc/articles/PMC5939227/ /pubmed/28334094 http://dx.doi.org/10.1093/bioinformatics/btx104 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Zhang, Yun Topham, David J Thakar, Juilee Qiu, Xing FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis |
title | FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis |
title_full | FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis |
title_fullStr | FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis |
title_full_unstemmed | FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis |
title_short | FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis |
title_sort | funnel-gsea: functional elastic-net regression in time-course gene set enrichment analysis |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5939227/ https://www.ncbi.nlm.nih.gov/pubmed/28334094 http://dx.doi.org/10.1093/bioinformatics/btx104 |
work_keys_str_mv | AT zhangyun funnelgseafunctionalelasticnetregressionintimecoursegenesetenrichmentanalysis AT tophamdavidj funnelgseafunctionalelasticnetregressionintimecoursegenesetenrichmentanalysis AT thakarjuilee funnelgseafunctionalelasticnetregressionintimecoursegenesetenrichmentanalysis AT qiuxing funnelgseafunctionalelasticnetregressionintimecoursegenesetenrichmentanalysis |