Cargando…

MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data

BACKGROUND: Biological data often originate from samples containing mixtures of subpopulations, corresponding e.g. to distinct cellular phenotypes. However, identification of distinct subpopulations may be difficult if biological measurements yield distributions that are not easily separable. RESULT...

Descripción completa

Detalles Bibliográficos
Autores principales: Feigelman, Justin, Theis, Fabian J, Marr, Carsten
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4227291/
https://www.ncbi.nlm.nih.gov/pubmed/25015590
http://dx.doi.org/10.1186/1471-2105-15-240
_version_ 1782343777431584768
author Feigelman, Justin
Theis, Fabian J
Marr, Carsten
author_facet Feigelman, Justin
Theis, Fabian J
Marr, Carsten
author_sort Feigelman, Justin
collection PubMed
description BACKGROUND: Biological data often originate from samples containing mixtures of subpopulations, corresponding e.g. to distinct cellular phenotypes. However, identification of distinct subpopulations may be difficult if biological measurements yield distributions that are not easily separable. RESULTS: We present Multiresolution Correlation Analysis (MCA), a method for visually identifying subpopulations based on the local pairwise correlation between covariates, without needing to define an a priori interaction scale. We demonstrate that MCA facilitates the identification of differentially regulated subpopulations in simulated data from a small gene regulatory network, followed by application to previously published single-cell qPCR data from mouse embryonic stem cells. We show that MCA recovers previously identified subpopulations, provides additional insight into the underlying correlation structure, reveals potentially spurious compartmentalizations, and provides insight into novel subpopulations. CONCLUSIONS: MCA is a useful method for the identification of subpopulations in low-dimensional expression data, as emerging from qPCR or FACS measurements. With MCA it is possible to investigate the robustness of covariate correlations with respect subpopulations, graphically identify outliers, and identify factors contributing to differential regulation between pairs of covariates. MCA thus provides a framework for investigation of expression correlations for genes of interests and biological hypothesis generation.
format Online
Article
Text
id pubmed-4227291
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42272912014-11-12 MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data Feigelman, Justin Theis, Fabian J Marr, Carsten BMC Bioinformatics Research Article BACKGROUND: Biological data often originate from samples containing mixtures of subpopulations, corresponding e.g. to distinct cellular phenotypes. However, identification of distinct subpopulations may be difficult if biological measurements yield distributions that are not easily separable. RESULTS: We present Multiresolution Correlation Analysis (MCA), a method for visually identifying subpopulations based on the local pairwise correlation between covariates, without needing to define an a priori interaction scale. We demonstrate that MCA facilitates the identification of differentially regulated subpopulations in simulated data from a small gene regulatory network, followed by application to previously published single-cell qPCR data from mouse embryonic stem cells. We show that MCA recovers previously identified subpopulations, provides additional insight into the underlying correlation structure, reveals potentially spurious compartmentalizations, and provides insight into novel subpopulations. CONCLUSIONS: MCA is a useful method for the identification of subpopulations in low-dimensional expression data, as emerging from qPCR or FACS measurements. With MCA it is possible to investigate the robustness of covariate correlations with respect subpopulations, graphically identify outliers, and identify factors contributing to differential regulation between pairs of covariates. MCA thus provides a framework for investigation of expression correlations for genes of interests and biological hypothesis generation. BioMed Central 2014-07-11 /pmc/articles/PMC4227291/ /pubmed/25015590 http://dx.doi.org/10.1186/1471-2105-15-240 Text en Copyright © 2014 Feigelman et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Feigelman, Justin
Theis, Fabian J
Marr, Carsten
MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data
title MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data
title_full MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data
title_fullStr MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data
title_full_unstemmed MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data
title_short MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data
title_sort mca: multiresolution correlation analysis, a graphical tool for subpopulation identification in single-cell gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4227291/
https://www.ncbi.nlm.nih.gov/pubmed/25015590
http://dx.doi.org/10.1186/1471-2105-15-240
work_keys_str_mv AT feigelmanjustin mcamultiresolutioncorrelationanalysisagraphicaltoolforsubpopulationidentificationinsinglecellgeneexpressiondata
AT theisfabianj mcamultiresolutioncorrelationanalysisagraphicaltoolforsubpopulationidentificationinsinglecellgeneexpressiondata
AT marrcarsten mcamultiresolutioncorrelationanalysisagraphicaltoolforsubpopulationidentificationinsinglecellgeneexpressiondata