Cargando…

Gene set enrichment analysis for multiple continuous phenotypes

BACKGROUND: Gene set analysis (GSA) methods test the association of sets of genes with phenotypes in gene expression microarray studies. While GSA methods on a single binary or categorical phenotype abounds, little attention has been paid to the case of a continuous phenotype, and there is no method...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xiaoming, Pyne, Saumyadipta, Dinu, Irina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4129103/
https://www.ncbi.nlm.nih.gov/pubmed/25086605
http://dx.doi.org/10.1186/1471-2105-15-260
_version_ 1782330193298325504
author Wang, Xiaoming
Pyne, Saumyadipta
Dinu, Irina
author_facet Wang, Xiaoming
Pyne, Saumyadipta
Dinu, Irina
author_sort Wang, Xiaoming
collection PubMed
description BACKGROUND: Gene set analysis (GSA) methods test the association of sets of genes with phenotypes in gene expression microarray studies. While GSA methods on a single binary or categorical phenotype abounds, little attention has been paid to the case of a continuous phenotype, and there is no method to accommodate correlated multiple continuous phenotypes. RESULT: We propose here an extension of the linear combination test (LCT) to its new version for multiple continuous phenotypes, incorporating correlations among gene expressions of functionally related gene sets, as well as correlations among multiple phenotypes. Further, we extend our new method to its nonlinear version, referred as nonlinear combination test (NLCT), to test potential nonlinear association of gene sets with multiple phenotypes. Simulation study and a real microarray example demonstrate the practical aspects of the proposed methods. CONCLUSION: The proposed approaches are effective in controlling type I errors and powerful in testing associations between gene-sets and multiple continuous phenotypes. They are both computationally effective. Naively (univariately) analyzing a group of multiple correlated phenotypes could be dangerous. R-codes to perform LCT and NLCT for multiple continuous phenotypes are available at http://www.ualberta.ca/~yyasui/homepage.html. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-260) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4129103
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41291032014-08-13 Gene set enrichment analysis for multiple continuous phenotypes Wang, Xiaoming Pyne, Saumyadipta Dinu, Irina BMC Bioinformatics Methodology Article BACKGROUND: Gene set analysis (GSA) methods test the association of sets of genes with phenotypes in gene expression microarray studies. While GSA methods on a single binary or categorical phenotype abounds, little attention has been paid to the case of a continuous phenotype, and there is no method to accommodate correlated multiple continuous phenotypes. RESULT: We propose here an extension of the linear combination test (LCT) to its new version for multiple continuous phenotypes, incorporating correlations among gene expressions of functionally related gene sets, as well as correlations among multiple phenotypes. Further, we extend our new method to its nonlinear version, referred as nonlinear combination test (NLCT), to test potential nonlinear association of gene sets with multiple phenotypes. Simulation study and a real microarray example demonstrate the practical aspects of the proposed methods. CONCLUSION: The proposed approaches are effective in controlling type I errors and powerful in testing associations between gene-sets and multiple continuous phenotypes. They are both computationally effective. Naively (univariately) analyzing a group of multiple correlated phenotypes could be dangerous. R-codes to perform LCT and NLCT for multiple continuous phenotypes are available at http://www.ualberta.ca/~yyasui/homepage.html. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-260) contains supplementary material, which is available to authorized users. BioMed Central 2014-08-03 /pmc/articles/PMC4129103/ /pubmed/25086605 http://dx.doi.org/10.1186/1471-2105-15-260 Text en © Wang et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Wang, Xiaoming
Pyne, Saumyadipta
Dinu, Irina
Gene set enrichment analysis for multiple continuous phenotypes
title Gene set enrichment analysis for multiple continuous phenotypes
title_full Gene set enrichment analysis for multiple continuous phenotypes
title_fullStr Gene set enrichment analysis for multiple continuous phenotypes
title_full_unstemmed Gene set enrichment analysis for multiple continuous phenotypes
title_short Gene set enrichment analysis for multiple continuous phenotypes
title_sort gene set enrichment analysis for multiple continuous phenotypes
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4129103/
https://www.ncbi.nlm.nih.gov/pubmed/25086605
http://dx.doi.org/10.1186/1471-2105-15-260
work_keys_str_mv AT wangxiaoming genesetenrichmentanalysisformultiplecontinuousphenotypes
AT pynesaumyadipta genesetenrichmentanalysisformultiplecontinuousphenotypes
AT dinuirina genesetenrichmentanalysisformultiplecontinuousphenotypes