Cargando…

FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis

MOTIVATION: Gene set enrichment analyses (GSEAs) are widely used in genomic research to identify underlying biological mechanisms (defined by the gene sets), such as Gene Ontology terms and molecular pathways. There are two caveats in the currently available methods: (i) they are typically designed...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yun, Topham, David J, Thakar, Juilee, Qiu, Xing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5939227/
https://www.ncbi.nlm.nih.gov/pubmed/28334094
http://dx.doi.org/10.1093/bioinformatics/btx104
_version_ 1783320921774227456
author Zhang, Yun
Topham, David J
Thakar, Juilee
Qiu, Xing
author_facet Zhang, Yun
Topham, David J
Thakar, Juilee
Qiu, Xing
author_sort Zhang, Yun
collection PubMed
description MOTIVATION: Gene set enrichment analyses (GSEAs) are widely used in genomic research to identify underlying biological mechanisms (defined by the gene sets), such as Gene Ontology terms and molecular pathways. There are two caveats in the currently available methods: (i) they are typically designed for group comparisons or regression analyses, which do not utilize temporal information efficiently in time-series of transcriptomics measurements; and (ii) genes overlapping in multiple molecular pathways are considered multiple times in hypothesis testing. RESULTS: We propose an inferential framework for GSEA based on functional data analysis, which utilizes the temporal information based on functional principal component analysis, and disentangles the effects of overlapping genes by a functional extension of the elastic-net regression. Furthermore, the hypothesis testing for the gene sets is performed by an extension of Mann-Whitney U test which is based on weighted rank sums computed from correlated observations. By using both simulated datasets and a large-scale time-course gene expression data on human influenza infection, we demonstrate that our method has uniformly better receiver operating characteristic curves, and identifies more pathways relevant to immune-response to human influenza infection than the competing approaches. AVAILABILITY AND IMPLEMENTATION: The methods are implemented in R package FUNNEL, freely and publicly available at: https://github.com/yunzhang813/FUNNEL-GSEA-R-Package. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5939227
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-59392272018-05-10 FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis Zhang, Yun Topham, David J Thakar, Juilee Qiu, Xing Bioinformatics Original Papers MOTIVATION: Gene set enrichment analyses (GSEAs) are widely used in genomic research to identify underlying biological mechanisms (defined by the gene sets), such as Gene Ontology terms and molecular pathways. There are two caveats in the currently available methods: (i) they are typically designed for group comparisons or regression analyses, which do not utilize temporal information efficiently in time-series of transcriptomics measurements; and (ii) genes overlapping in multiple molecular pathways are considered multiple times in hypothesis testing. RESULTS: We propose an inferential framework for GSEA based on functional data analysis, which utilizes the temporal information based on functional principal component analysis, and disentangles the effects of overlapping genes by a functional extension of the elastic-net regression. Furthermore, the hypothesis testing for the gene sets is performed by an extension of Mann-Whitney U test which is based on weighted rank sums computed from correlated observations. By using both simulated datasets and a large-scale time-course gene expression data on human influenza infection, we demonstrate that our method has uniformly better receiver operating characteristic curves, and identifies more pathways relevant to immune-response to human influenza infection than the competing approaches. AVAILABILITY AND IMPLEMENTATION: The methods are implemented in R package FUNNEL, freely and publicly available at: https://github.com/yunzhang813/FUNNEL-GSEA-R-Package. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-07-01 2017-02-21 /pmc/articles/PMC5939227/ /pubmed/28334094 http://dx.doi.org/10.1093/bioinformatics/btx104 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Zhang, Yun
Topham, David J
Thakar, Juilee
Qiu, Xing
FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis
title FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis
title_full FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis
title_fullStr FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis
title_full_unstemmed FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis
title_short FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis
title_sort funnel-gsea: functional elastic-net regression in time-course gene set enrichment analysis
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5939227/
https://www.ncbi.nlm.nih.gov/pubmed/28334094
http://dx.doi.org/10.1093/bioinformatics/btx104
work_keys_str_mv AT zhangyun funnelgseafunctionalelasticnetregressionintimecoursegenesetenrichmentanalysis
AT tophamdavidj funnelgseafunctionalelasticnetregressionintimecoursegenesetenrichmentanalysis
AT thakarjuilee funnelgseafunctionalelasticnetregressionintimecoursegenesetenrichmentanalysis
AT qiuxing funnelgseafunctionalelasticnetregressionintimecoursegenesetenrichmentanalysis