Cargando…
Gene set enrichment analysis for multiple continuous phenotypes
BACKGROUND: Gene set analysis (GSA) methods test the association of sets of genes with phenotypes in gene expression microarray studies. While GSA methods on a single binary or categorical phenotype abounds, little attention has been paid to the case of a continuous phenotype, and there is no method...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4129103/ https://www.ncbi.nlm.nih.gov/pubmed/25086605 http://dx.doi.org/10.1186/1471-2105-15-260 |
_version_ | 1782330193298325504 |
---|---|
author | Wang, Xiaoming Pyne, Saumyadipta Dinu, Irina |
author_facet | Wang, Xiaoming Pyne, Saumyadipta Dinu, Irina |
author_sort | Wang, Xiaoming |
collection | PubMed |
description | BACKGROUND: Gene set analysis (GSA) methods test the association of sets of genes with phenotypes in gene expression microarray studies. While GSA methods on a single binary or categorical phenotype abounds, little attention has been paid to the case of a continuous phenotype, and there is no method to accommodate correlated multiple continuous phenotypes. RESULT: We propose here an extension of the linear combination test (LCT) to its new version for multiple continuous phenotypes, incorporating correlations among gene expressions of functionally related gene sets, as well as correlations among multiple phenotypes. Further, we extend our new method to its nonlinear version, referred as nonlinear combination test (NLCT), to test potential nonlinear association of gene sets with multiple phenotypes. Simulation study and a real microarray example demonstrate the practical aspects of the proposed methods. CONCLUSION: The proposed approaches are effective in controlling type I errors and powerful in testing associations between gene-sets and multiple continuous phenotypes. They are both computationally effective. Naively (univariately) analyzing a group of multiple correlated phenotypes could be dangerous. R-codes to perform LCT and NLCT for multiple continuous phenotypes are available at http://www.ualberta.ca/~yyasui/homepage.html. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-260) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4129103 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41291032014-08-13 Gene set enrichment analysis for multiple continuous phenotypes Wang, Xiaoming Pyne, Saumyadipta Dinu, Irina BMC Bioinformatics Methodology Article BACKGROUND: Gene set analysis (GSA) methods test the association of sets of genes with phenotypes in gene expression microarray studies. While GSA methods on a single binary or categorical phenotype abounds, little attention has been paid to the case of a continuous phenotype, and there is no method to accommodate correlated multiple continuous phenotypes. RESULT: We propose here an extension of the linear combination test (LCT) to its new version for multiple continuous phenotypes, incorporating correlations among gene expressions of functionally related gene sets, as well as correlations among multiple phenotypes. Further, we extend our new method to its nonlinear version, referred as nonlinear combination test (NLCT), to test potential nonlinear association of gene sets with multiple phenotypes. Simulation study and a real microarray example demonstrate the practical aspects of the proposed methods. CONCLUSION: The proposed approaches are effective in controlling type I errors and powerful in testing associations between gene-sets and multiple continuous phenotypes. They are both computationally effective. Naively (univariately) analyzing a group of multiple correlated phenotypes could be dangerous. R-codes to perform LCT and NLCT for multiple continuous phenotypes are available at http://www.ualberta.ca/~yyasui/homepage.html. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-260) contains supplementary material, which is available to authorized users. BioMed Central 2014-08-03 /pmc/articles/PMC4129103/ /pubmed/25086605 http://dx.doi.org/10.1186/1471-2105-15-260 Text en © Wang et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Wang, Xiaoming Pyne, Saumyadipta Dinu, Irina Gene set enrichment analysis for multiple continuous phenotypes |
title | Gene set enrichment analysis for multiple continuous phenotypes |
title_full | Gene set enrichment analysis for multiple continuous phenotypes |
title_fullStr | Gene set enrichment analysis for multiple continuous phenotypes |
title_full_unstemmed | Gene set enrichment analysis for multiple continuous phenotypes |
title_short | Gene set enrichment analysis for multiple continuous phenotypes |
title_sort | gene set enrichment analysis for multiple continuous phenotypes |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4129103/ https://www.ncbi.nlm.nih.gov/pubmed/25086605 http://dx.doi.org/10.1186/1471-2105-15-260 |
work_keys_str_mv | AT wangxiaoming genesetenrichmentanalysisformultiplecontinuousphenotypes AT pynesaumyadipta genesetenrichmentanalysisformultiplecontinuousphenotypes AT dinuirina genesetenrichmentanalysisformultiplecontinuousphenotypes |