Cargando…

pcaReduce: hierarchical clustering of single cell transcriptional profiles

BACKGROUND: Advances in single cell genomics provide a way of routinely generating transcriptomics data at the single cell level. A frequent requirement of single cell expression analysis is the identification of novel patterns of heterogeneity across single cells that might explain complex cellular...

Descripción completa

Detalles Bibliográficos
Autores principales: žurauskienė, Justina, Yau, Christopher
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4802652/
https://www.ncbi.nlm.nih.gov/pubmed/27005807
http://dx.doi.org/10.1186/s12859-016-0984-y
_version_ 1782422765101383680
author žurauskienė, Justina
Yau, Christopher
author_facet žurauskienė, Justina
Yau, Christopher
author_sort žurauskienė, Justina
collection PubMed
description BACKGROUND: Advances in single cell genomics provide a way of routinely generating transcriptomics data at the single cell level. A frequent requirement of single cell expression analysis is the identification of novel patterns of heterogeneity across single cells that might explain complex cellular states or tissue composition. To date, classical statistical analysis tools have being routinely applied, but there is considerable scope for the development of novel statistical approaches that are better adapted to the challenges of inferring cellular hierarchies. RESULTS: We have developed a novel agglomerative clustering method that we call pcaReduce to generate a cell state hierarchy where each cluster branch is associated with a principal component of variation that can be used to differentiate two cell states. Using two real single cell datasets, we compared our approach to other commonly used statistical techniques, such as K-means and hierarchical clustering. We found that pcaReduce was able to give more consistent clustering structures when compared to broad and detailed cell type labels. CONCLUSIONS: Our novel integration of principal components analysis and hierarchical clustering establishes a connection between the representation of the expression data and the number of cell types that can be discovered. In doing so we found that pcaReduce performs better than either technique in isolation in terms of characterising putative cell states. Our methodology is complimentary to other single cell clustering techniques and adds to a growing palette of single cell bioinformatics tools for profiling heterogeneous cell populations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-0984-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4802652
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48026522016-03-22 pcaReduce: hierarchical clustering of single cell transcriptional profiles žurauskienė, Justina Yau, Christopher BMC Bioinformatics Methodology Article BACKGROUND: Advances in single cell genomics provide a way of routinely generating transcriptomics data at the single cell level. A frequent requirement of single cell expression analysis is the identification of novel patterns of heterogeneity across single cells that might explain complex cellular states or tissue composition. To date, classical statistical analysis tools have being routinely applied, but there is considerable scope for the development of novel statistical approaches that are better adapted to the challenges of inferring cellular hierarchies. RESULTS: We have developed a novel agglomerative clustering method that we call pcaReduce to generate a cell state hierarchy where each cluster branch is associated with a principal component of variation that can be used to differentiate two cell states. Using two real single cell datasets, we compared our approach to other commonly used statistical techniques, such as K-means and hierarchical clustering. We found that pcaReduce was able to give more consistent clustering structures when compared to broad and detailed cell type labels. CONCLUSIONS: Our novel integration of principal components analysis and hierarchical clustering establishes a connection between the representation of the expression data and the number of cell types that can be discovered. In doing so we found that pcaReduce performs better than either technique in isolation in terms of characterising putative cell states. Our methodology is complimentary to other single cell clustering techniques and adds to a growing palette of single cell bioinformatics tools for profiling heterogeneous cell populations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-0984-y) contains supplementary material, which is available to authorized users. BioMed Central 2016-03-22 /pmc/articles/PMC4802652/ /pubmed/27005807 http://dx.doi.org/10.1186/s12859-016-0984-y Text en © žurauskienė and Yau. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
žurauskienė, Justina
Yau, Christopher
pcaReduce: hierarchical clustering of single cell transcriptional profiles
title pcaReduce: hierarchical clustering of single cell transcriptional profiles
title_full pcaReduce: hierarchical clustering of single cell transcriptional profiles
title_fullStr pcaReduce: hierarchical clustering of single cell transcriptional profiles
title_full_unstemmed pcaReduce: hierarchical clustering of single cell transcriptional profiles
title_short pcaReduce: hierarchical clustering of single cell transcriptional profiles
title_sort pcareduce: hierarchical clustering of single cell transcriptional profiles
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4802652/
https://www.ncbi.nlm.nih.gov/pubmed/27005807
http://dx.doi.org/10.1186/s12859-016-0984-y
work_keys_str_mv AT zurauskienejustina pcareducehierarchicalclusteringofsinglecelltranscriptionalprofiles
AT yauchristopher pcareducehierarchicalclusteringofsinglecelltranscriptionalprofiles