Cargando…

Mixed-model coexpression: calculating gene coexpression while accounting for expression heterogeneity

Motivation: The analysis of gene coexpression is at the core of many types of genetic analysis. The coexpression between two genes can be calculated by using a traditional Pearson's correlation coefficient. However, unobserved confounding effects may cause inflation of the Pearson's correl...

Descripción completa

Detalles Bibliográficos
Autores principales: Furlotte, Nicholas A., Kang, Hyun Min, Ye, Chun, Eskin, Eleazar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117390/
https://www.ncbi.nlm.nih.gov/pubmed/21685083
http://dx.doi.org/10.1093/bioinformatics/btr221
_version_ 1782206327953555456
author Furlotte, Nicholas A.
Kang, Hyun Min
Ye, Chun
Eskin, Eleazar
author_facet Furlotte, Nicholas A.
Kang, Hyun Min
Ye, Chun
Eskin, Eleazar
author_sort Furlotte, Nicholas A.
collection PubMed
description Motivation: The analysis of gene coexpression is at the core of many types of genetic analysis. The coexpression between two genes can be calculated by using a traditional Pearson's correlation coefficient. However, unobserved confounding effects may cause inflation of the Pearson's correlation so that uncorrelated genes appear correlated. Many general methods have been suggested, which aim to remove the effects of confounding from gene expression data. However, the residual confounding which is not accounted for by these generic correction procedures has the potential to induce correlation between genes. Therefore, a method that specifically aims to calculate gene coexpression between gene expression arrays, while accounting for confounding effects, is desirable. Results: In this article, we present a statistical model for calculating gene coexpression called mixed model coexpression (MMC), which models coexpression within a mixed model framework. Confounding effects are expected to be encoded in the matrix representing the correlation between arrays, the inter-sample correlation matrix. By conditioning on the information in the inter-sample correlation matrix, MMC is able to produce gene coexpressions that are not influenced by global confounding effects and thus significantly reduce the number of spurious coexpressions observed. We applied MMC to both human and yeast datasets and show it is better able to effectively prioritize strong coexpressions when compared to a traditional Pearson's correlation and a Pearson's correlation applied to data corrected with surrogate variable analysis (SVA). Availability: The method is implemented in the R programming language and may be found at http://genetics.cs.ucla.edu/mmc. Contact: nfurlott@cs.ucla.edu; eeskin@cs.ucla.edu
format Online
Article
Text
id pubmed-3117390
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-31173902011-06-17 Mixed-model coexpression: calculating gene coexpression while accounting for expression heterogeneity Furlotte, Nicholas A. Kang, Hyun Min Ye, Chun Eskin, Eleazar Bioinformatics Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria Motivation: The analysis of gene coexpression is at the core of many types of genetic analysis. The coexpression between two genes can be calculated by using a traditional Pearson's correlation coefficient. However, unobserved confounding effects may cause inflation of the Pearson's correlation so that uncorrelated genes appear correlated. Many general methods have been suggested, which aim to remove the effects of confounding from gene expression data. However, the residual confounding which is not accounted for by these generic correction procedures has the potential to induce correlation between genes. Therefore, a method that specifically aims to calculate gene coexpression between gene expression arrays, while accounting for confounding effects, is desirable. Results: In this article, we present a statistical model for calculating gene coexpression called mixed model coexpression (MMC), which models coexpression within a mixed model framework. Confounding effects are expected to be encoded in the matrix representing the correlation between arrays, the inter-sample correlation matrix. By conditioning on the information in the inter-sample correlation matrix, MMC is able to produce gene coexpressions that are not influenced by global confounding effects and thus significantly reduce the number of spurious coexpressions observed. We applied MMC to both human and yeast datasets and show it is better able to effectively prioritize strong coexpressions when compared to a traditional Pearson's correlation and a Pearson's correlation applied to data corrected with surrogate variable analysis (SVA). Availability: The method is implemented in the R programming language and may be found at http://genetics.cs.ucla.edu/mmc. Contact: nfurlott@cs.ucla.edu; eeskin@cs.ucla.edu Oxford University Press 2011-07-01 2011-06-14 /pmc/articles/PMC3117390/ /pubmed/21685083 http://dx.doi.org/10.1093/bioinformatics/btr221 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria
Furlotte, Nicholas A.
Kang, Hyun Min
Ye, Chun
Eskin, Eleazar
Mixed-model coexpression: calculating gene coexpression while accounting for expression heterogeneity
title Mixed-model coexpression: calculating gene coexpression while accounting for expression heterogeneity
title_full Mixed-model coexpression: calculating gene coexpression while accounting for expression heterogeneity
title_fullStr Mixed-model coexpression: calculating gene coexpression while accounting for expression heterogeneity
title_full_unstemmed Mixed-model coexpression: calculating gene coexpression while accounting for expression heterogeneity
title_short Mixed-model coexpression: calculating gene coexpression while accounting for expression heterogeneity
title_sort mixed-model coexpression: calculating gene coexpression while accounting for expression heterogeneity
topic Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117390/
https://www.ncbi.nlm.nih.gov/pubmed/21685083
http://dx.doi.org/10.1093/bioinformatics/btr221
work_keys_str_mv AT furlottenicholasa mixedmodelcoexpressioncalculatinggenecoexpressionwhileaccountingforexpressionheterogeneity
AT kanghyunmin mixedmodelcoexpressioncalculatinggenecoexpressionwhileaccountingforexpressionheterogeneity
AT yechun mixedmodelcoexpressioncalculatinggenecoexpressionwhileaccountingforexpressionheterogeneity
AT eskineleazar mixedmodelcoexpressioncalculatinggenecoexpressionwhileaccountingforexpressionheterogeneity