Cargando…
MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data
BACKGROUND: Biological data often originate from samples containing mixtures of subpopulations, corresponding e.g. to distinct cellular phenotypes. However, identification of distinct subpopulations may be difficult if biological measurements yield distributions that are not easily separable. RESULT...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4227291/ https://www.ncbi.nlm.nih.gov/pubmed/25015590 http://dx.doi.org/10.1186/1471-2105-15-240 |
_version_ | 1782343777431584768 |
---|---|
author | Feigelman, Justin Theis, Fabian J Marr, Carsten |
author_facet | Feigelman, Justin Theis, Fabian J Marr, Carsten |
author_sort | Feigelman, Justin |
collection | PubMed |
description | BACKGROUND: Biological data often originate from samples containing mixtures of subpopulations, corresponding e.g. to distinct cellular phenotypes. However, identification of distinct subpopulations may be difficult if biological measurements yield distributions that are not easily separable. RESULTS: We present Multiresolution Correlation Analysis (MCA), a method for visually identifying subpopulations based on the local pairwise correlation between covariates, without needing to define an a priori interaction scale. We demonstrate that MCA facilitates the identification of differentially regulated subpopulations in simulated data from a small gene regulatory network, followed by application to previously published single-cell qPCR data from mouse embryonic stem cells. We show that MCA recovers previously identified subpopulations, provides additional insight into the underlying correlation structure, reveals potentially spurious compartmentalizations, and provides insight into novel subpopulations. CONCLUSIONS: MCA is a useful method for the identification of subpopulations in low-dimensional expression data, as emerging from qPCR or FACS measurements. With MCA it is possible to investigate the robustness of covariate correlations with respect subpopulations, graphically identify outliers, and identify factors contributing to differential regulation between pairs of covariates. MCA thus provides a framework for investigation of expression correlations for genes of interests and biological hypothesis generation. |
format | Online Article Text |
id | pubmed-4227291 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42272912014-11-12 MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data Feigelman, Justin Theis, Fabian J Marr, Carsten BMC Bioinformatics Research Article BACKGROUND: Biological data often originate from samples containing mixtures of subpopulations, corresponding e.g. to distinct cellular phenotypes. However, identification of distinct subpopulations may be difficult if biological measurements yield distributions that are not easily separable. RESULTS: We present Multiresolution Correlation Analysis (MCA), a method for visually identifying subpopulations based on the local pairwise correlation between covariates, without needing to define an a priori interaction scale. We demonstrate that MCA facilitates the identification of differentially regulated subpopulations in simulated data from a small gene regulatory network, followed by application to previously published single-cell qPCR data from mouse embryonic stem cells. We show that MCA recovers previously identified subpopulations, provides additional insight into the underlying correlation structure, reveals potentially spurious compartmentalizations, and provides insight into novel subpopulations. CONCLUSIONS: MCA is a useful method for the identification of subpopulations in low-dimensional expression data, as emerging from qPCR or FACS measurements. With MCA it is possible to investigate the robustness of covariate correlations with respect subpopulations, graphically identify outliers, and identify factors contributing to differential regulation between pairs of covariates. MCA thus provides a framework for investigation of expression correlations for genes of interests and biological hypothesis generation. BioMed Central 2014-07-11 /pmc/articles/PMC4227291/ /pubmed/25015590 http://dx.doi.org/10.1186/1471-2105-15-240 Text en Copyright © 2014 Feigelman et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Feigelman, Justin Theis, Fabian J Marr, Carsten MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data |
title | MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data |
title_full | MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data |
title_fullStr | MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data |
title_full_unstemmed | MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data |
title_short | MCA: Multiresolution Correlation Analysis, a graphical tool for subpopulation identification in single-cell gene expression data |
title_sort | mca: multiresolution correlation analysis, a graphical tool for subpopulation identification in single-cell gene expression data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4227291/ https://www.ncbi.nlm.nih.gov/pubmed/25015590 http://dx.doi.org/10.1186/1471-2105-15-240 |
work_keys_str_mv | AT feigelmanjustin mcamultiresolutioncorrelationanalysisagraphicaltoolforsubpopulationidentificationinsinglecellgeneexpressiondata AT theisfabianj mcamultiresolutioncorrelationanalysisagraphicaltoolforsubpopulationidentificationinsinglecellgeneexpressiondata AT marrcarsten mcamultiresolutioncorrelationanalysisagraphicaltoolforsubpopulationidentificationinsinglecellgeneexpressiondata |