Cargando…

Exploiting single-cell expression to characterize co-expression replicability

BACKGROUND: Co-expression networks have been a useful tool for functional genomics, providing important clues about the cellular and biochemical mechanisms that are active in normal and disease processes. However, co-expression analysis is often treated as a black box with results being hard to trac...

Descripción completa

Detalles Bibliográficos
Autores principales: Crow, Megan, Paul, Anirban, Ballouz, Sara, Huang, Z. Josh, Gillis, Jesse
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4862082/
https://www.ncbi.nlm.nih.gov/pubmed/27165153
http://dx.doi.org/10.1186/s13059-016-0964-6
_version_ 1782431305020997632
author Crow, Megan
Paul, Anirban
Ballouz, Sara
Huang, Z. Josh
Gillis, Jesse
author_facet Crow, Megan
Paul, Anirban
Ballouz, Sara
Huang, Z. Josh
Gillis, Jesse
author_sort Crow, Megan
collection PubMed
description BACKGROUND: Co-expression networks have been a useful tool for functional genomics, providing important clues about the cellular and biochemical mechanisms that are active in normal and disease processes. However, co-expression analysis is often treated as a black box with results being hard to trace to their basis in the data. Here, we use both published and novel single-cell RNA sequencing (RNA-seq) data to understand fundamental drivers of gene-gene connectivity and replicability in co-expression networks. RESULTS: We perform the first major analysis of single-cell co-expression, sampling from 31 individual studies. Using neighbor voting in cross-validation, we find that single-cell network connectivity is less likely to overlap with known functions than co-expression derived from bulk data, with functional variation within cell types strongly resembling that also occurring across cell types. To identify features and analysis practices that contribute to this connectivity, we perform our own single-cell RNA-seq experiment of 126 cortical interneurons in an experimental design targeted to co-expression. By assessing network replicability, semantic similarity and overall functional connectivity, we identify technical factors influencing co-expression and suggest how they can be controlled for. Many of the technical effects we identify are expression-level dependent, making expression level itself highly predictive of network topology. We show this occurs generally through re-analysis of the BrainSpan RNA-seq data. CONCLUSIONS: Technical properties of single-cell RNA-seq data create confounds in co-expression networks which can be identified and explicitly controlled for in any supervised analysis. This is useful both in improving co-expression performance and in characterizing single-cell data in generally applicable terms, permitting cross-laboratory comparison within a common framework. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-016-0964-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4862082
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48620822016-05-11 Exploiting single-cell expression to characterize co-expression replicability Crow, Megan Paul, Anirban Ballouz, Sara Huang, Z. Josh Gillis, Jesse Genome Biol Research BACKGROUND: Co-expression networks have been a useful tool for functional genomics, providing important clues about the cellular and biochemical mechanisms that are active in normal and disease processes. However, co-expression analysis is often treated as a black box with results being hard to trace to their basis in the data. Here, we use both published and novel single-cell RNA sequencing (RNA-seq) data to understand fundamental drivers of gene-gene connectivity and replicability in co-expression networks. RESULTS: We perform the first major analysis of single-cell co-expression, sampling from 31 individual studies. Using neighbor voting in cross-validation, we find that single-cell network connectivity is less likely to overlap with known functions than co-expression derived from bulk data, with functional variation within cell types strongly resembling that also occurring across cell types. To identify features and analysis practices that contribute to this connectivity, we perform our own single-cell RNA-seq experiment of 126 cortical interneurons in an experimental design targeted to co-expression. By assessing network replicability, semantic similarity and overall functional connectivity, we identify technical factors influencing co-expression and suggest how they can be controlled for. Many of the technical effects we identify are expression-level dependent, making expression level itself highly predictive of network topology. We show this occurs generally through re-analysis of the BrainSpan RNA-seq data. CONCLUSIONS: Technical properties of single-cell RNA-seq data create confounds in co-expression networks which can be identified and explicitly controlled for in any supervised analysis. This is useful both in improving co-expression performance and in characterizing single-cell data in generally applicable terms, permitting cross-laboratory comparison within a common framework. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-016-0964-6) contains supplementary material, which is available to authorized users. BioMed Central 2016-05-06 /pmc/articles/PMC4862082/ /pubmed/27165153 http://dx.doi.org/10.1186/s13059-016-0964-6 Text en © Crow et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Crow, Megan
Paul, Anirban
Ballouz, Sara
Huang, Z. Josh
Gillis, Jesse
Exploiting single-cell expression to characterize co-expression replicability
title Exploiting single-cell expression to characterize co-expression replicability
title_full Exploiting single-cell expression to characterize co-expression replicability
title_fullStr Exploiting single-cell expression to characterize co-expression replicability
title_full_unstemmed Exploiting single-cell expression to characterize co-expression replicability
title_short Exploiting single-cell expression to characterize co-expression replicability
title_sort exploiting single-cell expression to characterize co-expression replicability
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4862082/
https://www.ncbi.nlm.nih.gov/pubmed/27165153
http://dx.doi.org/10.1186/s13059-016-0964-6
work_keys_str_mv AT crowmegan exploitingsinglecellexpressiontocharacterizecoexpressionreplicability
AT paulanirban exploitingsinglecellexpressiontocharacterizecoexpressionreplicability
AT ballouzsara exploitingsinglecellexpressiontocharacterizecoexpressionreplicability
AT huangzjosh exploitingsinglecellexpressiontocharacterizecoexpressionreplicability
AT gillisjesse exploitingsinglecellexpressiontocharacterizecoexpressionreplicability