Cargando…
Untangling the effects of cellular composition on coexpression analysis
Coexpression analysis is widely used for inferring regulatory networks, predicting gene function, and interpretation of transcriptome profiling studies, based on methods such as clustering. The majority of such studies use data collected from bulk tissue, where the effects of cellular composition pr...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7370889/ https://www.ncbi.nlm.nih.gov/pubmed/32580998 http://dx.doi.org/10.1101/gr.256735.119 |
_version_ | 1783561053039230976 |
---|---|
author | Farahbod, Marjan Pavlidis, Paul |
author_facet | Farahbod, Marjan Pavlidis, Paul |
author_sort | Farahbod, Marjan |
collection | PubMed |
description | Coexpression analysis is widely used for inferring regulatory networks, predicting gene function, and interpretation of transcriptome profiling studies, based on methods such as clustering. The majority of such studies use data collected from bulk tissue, where the effects of cellular composition present a potential confound. However, the impact of composition on coexpression analysis has not been studied in detail. Here, we examine this issue for the case of human RNA analysis. Focusing on brain tissue, we found that, for most genes, differences in expression levels across cell types account for a large fraction of the variance of their measured RNA levels (median R(2) = 0.68). We then show that genes that have similar expression patterns across cell types will have correlated RNA levels in bulk tissue, due to the effect of variation in cellular composition. We demonstrate that much of the coexpression and the formation of coexpression clusters can be attributed to this effect for both brain and blood transcriptomes. For brain, we further show how this composition-induced coexpression masks underlying intra-cell-type coexpression observed in single-cell data. An attempt to correct for composition yielded mixed results. Our conclusion is that the dominant coexpression signal in brain, blood, and, likely, other complex tissues can be attributed to cellular compositional effects, rather than intra-cell-type regulatory relationships. These results have implications for the relevance and interpretation of coexpression analysis. |
format | Online Article Text |
id | pubmed-7370889 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-73708892020-12-01 Untangling the effects of cellular composition on coexpression analysis Farahbod, Marjan Pavlidis, Paul Genome Res Research Coexpression analysis is widely used for inferring regulatory networks, predicting gene function, and interpretation of transcriptome profiling studies, based on methods such as clustering. The majority of such studies use data collected from bulk tissue, where the effects of cellular composition present a potential confound. However, the impact of composition on coexpression analysis has not been studied in detail. Here, we examine this issue for the case of human RNA analysis. Focusing on brain tissue, we found that, for most genes, differences in expression levels across cell types account for a large fraction of the variance of their measured RNA levels (median R(2) = 0.68). We then show that genes that have similar expression patterns across cell types will have correlated RNA levels in bulk tissue, due to the effect of variation in cellular composition. We demonstrate that much of the coexpression and the formation of coexpression clusters can be attributed to this effect for both brain and blood transcriptomes. For brain, we further show how this composition-induced coexpression masks underlying intra-cell-type coexpression observed in single-cell data. An attempt to correct for composition yielded mixed results. Our conclusion is that the dominant coexpression signal in brain, blood, and, likely, other complex tissues can be attributed to cellular compositional effects, rather than intra-cell-type regulatory relationships. These results have implications for the relevance and interpretation of coexpression analysis. Cold Spring Harbor Laboratory Press 2020-06 /pmc/articles/PMC7370889/ /pubmed/32580998 http://dx.doi.org/10.1101/gr.256735.119 Text en © 2020 Farahbod and Pavlidis; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/. |
spellingShingle | Research Farahbod, Marjan Pavlidis, Paul Untangling the effects of cellular composition on coexpression analysis |
title | Untangling the effects of cellular composition on coexpression analysis |
title_full | Untangling the effects of cellular composition on coexpression analysis |
title_fullStr | Untangling the effects of cellular composition on coexpression analysis |
title_full_unstemmed | Untangling the effects of cellular composition on coexpression analysis |
title_short | Untangling the effects of cellular composition on coexpression analysis |
title_sort | untangling the effects of cellular composition on coexpression analysis |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7370889/ https://www.ncbi.nlm.nih.gov/pubmed/32580998 http://dx.doi.org/10.1101/gr.256735.119 |
work_keys_str_mv | AT farahbodmarjan untanglingtheeffectsofcellularcompositiononcoexpressionanalysis AT pavlidispaul untanglingtheeffectsofcellularcompositiononcoexpressionanalysis |