Cargando…

Untangling the effects of cellular composition on coexpression analysis

Coexpression analysis is widely used for inferring regulatory networks, predicting gene function, and interpretation of transcriptome profiling studies, based on methods such as clustering. The majority of such studies use data collected from bulk tissue, where the effects of cellular composition pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Farahbod, Marjan, Pavlidis, Paul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7370889/
https://www.ncbi.nlm.nih.gov/pubmed/32580998
http://dx.doi.org/10.1101/gr.256735.119
_version_ 1783561053039230976
author Farahbod, Marjan
Pavlidis, Paul
author_facet Farahbod, Marjan
Pavlidis, Paul
author_sort Farahbod, Marjan
collection PubMed
description Coexpression analysis is widely used for inferring regulatory networks, predicting gene function, and interpretation of transcriptome profiling studies, based on methods such as clustering. The majority of such studies use data collected from bulk tissue, where the effects of cellular composition present a potential confound. However, the impact of composition on coexpression analysis has not been studied in detail. Here, we examine this issue for the case of human RNA analysis. Focusing on brain tissue, we found that, for most genes, differences in expression levels across cell types account for a large fraction of the variance of their measured RNA levels (median R(2) = 0.68). We then show that genes that have similar expression patterns across cell types will have correlated RNA levels in bulk tissue, due to the effect of variation in cellular composition. We demonstrate that much of the coexpression and the formation of coexpression clusters can be attributed to this effect for both brain and blood transcriptomes. For brain, we further show how this composition-induced coexpression masks underlying intra-cell-type coexpression observed in single-cell data. An attempt to correct for composition yielded mixed results. Our conclusion is that the dominant coexpression signal in brain, blood, and, likely, other complex tissues can be attributed to cellular compositional effects, rather than intra-cell-type regulatory relationships. These results have implications for the relevance and interpretation of coexpression analysis.
format Online
Article
Text
id pubmed-7370889
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-73708892020-12-01 Untangling the effects of cellular composition on coexpression analysis Farahbod, Marjan Pavlidis, Paul Genome Res Research Coexpression analysis is widely used for inferring regulatory networks, predicting gene function, and interpretation of transcriptome profiling studies, based on methods such as clustering. The majority of such studies use data collected from bulk tissue, where the effects of cellular composition present a potential confound. However, the impact of composition on coexpression analysis has not been studied in detail. Here, we examine this issue for the case of human RNA analysis. Focusing on brain tissue, we found that, for most genes, differences in expression levels across cell types account for a large fraction of the variance of their measured RNA levels (median R(2) = 0.68). We then show that genes that have similar expression patterns across cell types will have correlated RNA levels in bulk tissue, due to the effect of variation in cellular composition. We demonstrate that much of the coexpression and the formation of coexpression clusters can be attributed to this effect for both brain and blood transcriptomes. For brain, we further show how this composition-induced coexpression masks underlying intra-cell-type coexpression observed in single-cell data. An attempt to correct for composition yielded mixed results. Our conclusion is that the dominant coexpression signal in brain, blood, and, likely, other complex tissues can be attributed to cellular compositional effects, rather than intra-cell-type regulatory relationships. These results have implications for the relevance and interpretation of coexpression analysis. Cold Spring Harbor Laboratory Press 2020-06 /pmc/articles/PMC7370889/ /pubmed/32580998 http://dx.doi.org/10.1101/gr.256735.119 Text en © 2020 Farahbod and Pavlidis; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Research
Farahbod, Marjan
Pavlidis, Paul
Untangling the effects of cellular composition on coexpression analysis
title Untangling the effects of cellular composition on coexpression analysis
title_full Untangling the effects of cellular composition on coexpression analysis
title_fullStr Untangling the effects of cellular composition on coexpression analysis
title_full_unstemmed Untangling the effects of cellular composition on coexpression analysis
title_short Untangling the effects of cellular composition on coexpression analysis
title_sort untangling the effects of cellular composition on coexpression analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7370889/
https://www.ncbi.nlm.nih.gov/pubmed/32580998
http://dx.doi.org/10.1101/gr.256735.119
work_keys_str_mv AT farahbodmarjan untanglingtheeffectsofcellularcompositiononcoexpressionanalysis
AT pavlidispaul untanglingtheeffectsofcellularcompositiononcoexpressionanalysis