Cargando…
Exploiting single-cell expression to characterize co-expression replicability
BACKGROUND: Co-expression networks have been a useful tool for functional genomics, providing important clues about the cellular and biochemical mechanisms that are active in normal and disease processes. However, co-expression analysis is often treated as a black box with results being hard to trac...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4862082/ https://www.ncbi.nlm.nih.gov/pubmed/27165153 http://dx.doi.org/10.1186/s13059-016-0964-6 |
_version_ | 1782431305020997632 |
---|---|
author | Crow, Megan Paul, Anirban Ballouz, Sara Huang, Z. Josh Gillis, Jesse |
author_facet | Crow, Megan Paul, Anirban Ballouz, Sara Huang, Z. Josh Gillis, Jesse |
author_sort | Crow, Megan |
collection | PubMed |
description | BACKGROUND: Co-expression networks have been a useful tool for functional genomics, providing important clues about the cellular and biochemical mechanisms that are active in normal and disease processes. However, co-expression analysis is often treated as a black box with results being hard to trace to their basis in the data. Here, we use both published and novel single-cell RNA sequencing (RNA-seq) data to understand fundamental drivers of gene-gene connectivity and replicability in co-expression networks. RESULTS: We perform the first major analysis of single-cell co-expression, sampling from 31 individual studies. Using neighbor voting in cross-validation, we find that single-cell network connectivity is less likely to overlap with known functions than co-expression derived from bulk data, with functional variation within cell types strongly resembling that also occurring across cell types. To identify features and analysis practices that contribute to this connectivity, we perform our own single-cell RNA-seq experiment of 126 cortical interneurons in an experimental design targeted to co-expression. By assessing network replicability, semantic similarity and overall functional connectivity, we identify technical factors influencing co-expression and suggest how they can be controlled for. Many of the technical effects we identify are expression-level dependent, making expression level itself highly predictive of network topology. We show this occurs generally through re-analysis of the BrainSpan RNA-seq data. CONCLUSIONS: Technical properties of single-cell RNA-seq data create confounds in co-expression networks which can be identified and explicitly controlled for in any supervised analysis. This is useful both in improving co-expression performance and in characterizing single-cell data in generally applicable terms, permitting cross-laboratory comparison within a common framework. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-016-0964-6) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4862082 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-48620822016-05-11 Exploiting single-cell expression to characterize co-expression replicability Crow, Megan Paul, Anirban Ballouz, Sara Huang, Z. Josh Gillis, Jesse Genome Biol Research BACKGROUND: Co-expression networks have been a useful tool for functional genomics, providing important clues about the cellular and biochemical mechanisms that are active in normal and disease processes. However, co-expression analysis is often treated as a black box with results being hard to trace to their basis in the data. Here, we use both published and novel single-cell RNA sequencing (RNA-seq) data to understand fundamental drivers of gene-gene connectivity and replicability in co-expression networks. RESULTS: We perform the first major analysis of single-cell co-expression, sampling from 31 individual studies. Using neighbor voting in cross-validation, we find that single-cell network connectivity is less likely to overlap with known functions than co-expression derived from bulk data, with functional variation within cell types strongly resembling that also occurring across cell types. To identify features and analysis practices that contribute to this connectivity, we perform our own single-cell RNA-seq experiment of 126 cortical interneurons in an experimental design targeted to co-expression. By assessing network replicability, semantic similarity and overall functional connectivity, we identify technical factors influencing co-expression and suggest how they can be controlled for. Many of the technical effects we identify are expression-level dependent, making expression level itself highly predictive of network topology. We show this occurs generally through re-analysis of the BrainSpan RNA-seq data. CONCLUSIONS: Technical properties of single-cell RNA-seq data create confounds in co-expression networks which can be identified and explicitly controlled for in any supervised analysis. This is useful both in improving co-expression performance and in characterizing single-cell data in generally applicable terms, permitting cross-laboratory comparison within a common framework. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-016-0964-6) contains supplementary material, which is available to authorized users. BioMed Central 2016-05-06 /pmc/articles/PMC4862082/ /pubmed/27165153 http://dx.doi.org/10.1186/s13059-016-0964-6 Text en © Crow et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Crow, Megan Paul, Anirban Ballouz, Sara Huang, Z. Josh Gillis, Jesse Exploiting single-cell expression to characterize co-expression replicability |
title | Exploiting single-cell expression to characterize co-expression replicability |
title_full | Exploiting single-cell expression to characterize co-expression replicability |
title_fullStr | Exploiting single-cell expression to characterize co-expression replicability |
title_full_unstemmed | Exploiting single-cell expression to characterize co-expression replicability |
title_short | Exploiting single-cell expression to characterize co-expression replicability |
title_sort | exploiting single-cell expression to characterize co-expression replicability |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4862082/ https://www.ncbi.nlm.nih.gov/pubmed/27165153 http://dx.doi.org/10.1186/s13059-016-0964-6 |
work_keys_str_mv | AT crowmegan exploitingsinglecellexpressiontocharacterizecoexpressionreplicability AT paulanirban exploitingsinglecellexpressiontocharacterizecoexpressionreplicability AT ballouzsara exploitingsinglecellexpressiontocharacterizecoexpressionreplicability AT huangzjosh exploitingsinglecellexpressiontocharacterizecoexpressionreplicability AT gillisjesse exploitingsinglecellexpressiontocharacterizecoexpressionreplicability |