Cargando…

Gene Sets Net Correlations Analysis (GSNCA): a multivariate differential coexpression test for gene sets

Motivation: To date, gene set analysis approaches primarily focus on identifying differentially expressed gene sets (pathways). Methods for identifying differentially coexpressed pathways also exist but are mostly based on aggregated pairwise correlations or other pairwise measures of coexpression....

Descripción completa

Detalles Bibliográficos
Autores principales: Rahmatallah, Yasir, Emmert-Streib, Frank, Glazko, Galina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4023302/
https://www.ncbi.nlm.nih.gov/pubmed/24292935
http://dx.doi.org/10.1093/bioinformatics/btt687
_version_ 1782316536167399424
author Rahmatallah, Yasir
Emmert-Streib, Frank
Glazko, Galina
author_facet Rahmatallah, Yasir
Emmert-Streib, Frank
Glazko, Galina
author_sort Rahmatallah, Yasir
collection PubMed
description Motivation: To date, gene set analysis approaches primarily focus on identifying differentially expressed gene sets (pathways). Methods for identifying differentially coexpressed pathways also exist but are mostly based on aggregated pairwise correlations or other pairwise measures of coexpression. Instead, we propose Gene Sets Net Correlations Analysis (GSNCA), a multivariate differential coexpression test that accounts for the complete correlation structure between genes. Results: In GSNCA, weight factors are assigned to genes in proportion to the genes’ cross-correlations (intergene correlations). The problem of finding the weight vectors is formulated as an eigenvector problem with a unique solution. GSNCA tests the null hypothesis that for a gene set there is no difference in the weight vectors of the genes between two conditions. In simulation studies and the analyses of experimental data, we demonstrate that GSNCA captures changes in the structure of genes’ cross-correlations rather than differences in the averaged pairwise correlations. Thus, GSNCA infers differences in coexpression networks, however, bypassing method-dependent steps of network inference. As an additional result from GSNCA, we define hub genes as genes with the largest weights and show that these genes correspond frequently to major and specific pathway regulators, as well as to genes that are most affected by the biological difference between two conditions. In summary, GSNCA is a new approach for the analysis of differentially coexpressed pathways that also evaluates the importance of the genes in the pathways, thus providing unique information that may result in the generation of novel biological hypotheses. Availability and implementation: Implementation of the GSNCA test in R is available upon request from the authors. Contact: YRahmatallah@uams.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4023302
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40233022014-06-18 Gene Sets Net Correlations Analysis (GSNCA): a multivariate differential coexpression test for gene sets Rahmatallah, Yasir Emmert-Streib, Frank Glazko, Galina Bioinformatics Original Papers Motivation: To date, gene set analysis approaches primarily focus on identifying differentially expressed gene sets (pathways). Methods for identifying differentially coexpressed pathways also exist but are mostly based on aggregated pairwise correlations or other pairwise measures of coexpression. Instead, we propose Gene Sets Net Correlations Analysis (GSNCA), a multivariate differential coexpression test that accounts for the complete correlation structure between genes. Results: In GSNCA, weight factors are assigned to genes in proportion to the genes’ cross-correlations (intergene correlations). The problem of finding the weight vectors is formulated as an eigenvector problem with a unique solution. GSNCA tests the null hypothesis that for a gene set there is no difference in the weight vectors of the genes between two conditions. In simulation studies and the analyses of experimental data, we demonstrate that GSNCA captures changes in the structure of genes’ cross-correlations rather than differences in the averaged pairwise correlations. Thus, GSNCA infers differences in coexpression networks, however, bypassing method-dependent steps of network inference. As an additional result from GSNCA, we define hub genes as genes with the largest weights and show that these genes correspond frequently to major and specific pathway regulators, as well as to genes that are most affected by the biological difference between two conditions. In summary, GSNCA is a new approach for the analysis of differentially coexpressed pathways that also evaluates the importance of the genes in the pathways, thus providing unique information that may result in the generation of novel biological hypotheses. Availability and implementation: Implementation of the GSNCA test in R is available upon request from the authors. Contact: YRahmatallah@uams.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2014-02-01 2013-11-30 /pmc/articles/PMC4023302/ /pubmed/24292935 http://dx.doi.org/10.1093/bioinformatics/btt687 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Rahmatallah, Yasir
Emmert-Streib, Frank
Glazko, Galina
Gene Sets Net Correlations Analysis (GSNCA): a multivariate differential coexpression test for gene sets
title Gene Sets Net Correlations Analysis (GSNCA): a multivariate differential coexpression test for gene sets
title_full Gene Sets Net Correlations Analysis (GSNCA): a multivariate differential coexpression test for gene sets
title_fullStr Gene Sets Net Correlations Analysis (GSNCA): a multivariate differential coexpression test for gene sets
title_full_unstemmed Gene Sets Net Correlations Analysis (GSNCA): a multivariate differential coexpression test for gene sets
title_short Gene Sets Net Correlations Analysis (GSNCA): a multivariate differential coexpression test for gene sets
title_sort gene sets net correlations analysis (gsnca): a multivariate differential coexpression test for gene sets
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4023302/
https://www.ncbi.nlm.nih.gov/pubmed/24292935
http://dx.doi.org/10.1093/bioinformatics/btt687
work_keys_str_mv AT rahmatallahyasir genesetsnetcorrelationsanalysisgsncaamultivariatedifferentialcoexpressiontestforgenesets
AT emmertstreibfrank genesetsnetcorrelationsanalysisgsncaamultivariatedifferentialcoexpressiontestforgenesets
AT glazkogalina genesetsnetcorrelationsanalysisgsncaamultivariatedifferentialcoexpressiontestforgenesets