Cargando…
pcaReduce: hierarchical clustering of single cell transcriptional profiles
BACKGROUND: Advances in single cell genomics provide a way of routinely generating transcriptomics data at the single cell level. A frequent requirement of single cell expression analysis is the identification of novel patterns of heterogeneity across single cells that might explain complex cellular...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4802652/ https://www.ncbi.nlm.nih.gov/pubmed/27005807 http://dx.doi.org/10.1186/s12859-016-0984-y |
_version_ | 1782422765101383680 |
---|---|
author | žurauskienė, Justina Yau, Christopher |
author_facet | žurauskienė, Justina Yau, Christopher |
author_sort | žurauskienė, Justina |
collection | PubMed |
description | BACKGROUND: Advances in single cell genomics provide a way of routinely generating transcriptomics data at the single cell level. A frequent requirement of single cell expression analysis is the identification of novel patterns of heterogeneity across single cells that might explain complex cellular states or tissue composition. To date, classical statistical analysis tools have being routinely applied, but there is considerable scope for the development of novel statistical approaches that are better adapted to the challenges of inferring cellular hierarchies. RESULTS: We have developed a novel agglomerative clustering method that we call pcaReduce to generate a cell state hierarchy where each cluster branch is associated with a principal component of variation that can be used to differentiate two cell states. Using two real single cell datasets, we compared our approach to other commonly used statistical techniques, such as K-means and hierarchical clustering. We found that pcaReduce was able to give more consistent clustering structures when compared to broad and detailed cell type labels. CONCLUSIONS: Our novel integration of principal components analysis and hierarchical clustering establishes a connection between the representation of the expression data and the number of cell types that can be discovered. In doing so we found that pcaReduce performs better than either technique in isolation in terms of characterising putative cell states. Our methodology is complimentary to other single cell clustering techniques and adds to a growing palette of single cell bioinformatics tools for profiling heterogeneous cell populations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-0984-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4802652 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-48026522016-03-22 pcaReduce: hierarchical clustering of single cell transcriptional profiles žurauskienė, Justina Yau, Christopher BMC Bioinformatics Methodology Article BACKGROUND: Advances in single cell genomics provide a way of routinely generating transcriptomics data at the single cell level. A frequent requirement of single cell expression analysis is the identification of novel patterns of heterogeneity across single cells that might explain complex cellular states or tissue composition. To date, classical statistical analysis tools have being routinely applied, but there is considerable scope for the development of novel statistical approaches that are better adapted to the challenges of inferring cellular hierarchies. RESULTS: We have developed a novel agglomerative clustering method that we call pcaReduce to generate a cell state hierarchy where each cluster branch is associated with a principal component of variation that can be used to differentiate two cell states. Using two real single cell datasets, we compared our approach to other commonly used statistical techniques, such as K-means and hierarchical clustering. We found that pcaReduce was able to give more consistent clustering structures when compared to broad and detailed cell type labels. CONCLUSIONS: Our novel integration of principal components analysis and hierarchical clustering establishes a connection between the representation of the expression data and the number of cell types that can be discovered. In doing so we found that pcaReduce performs better than either technique in isolation in terms of characterising putative cell states. Our methodology is complimentary to other single cell clustering techniques and adds to a growing palette of single cell bioinformatics tools for profiling heterogeneous cell populations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-0984-y) contains supplementary material, which is available to authorized users. BioMed Central 2016-03-22 /pmc/articles/PMC4802652/ /pubmed/27005807 http://dx.doi.org/10.1186/s12859-016-0984-y Text en © žurauskienė and Yau. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article žurauskienė, Justina Yau, Christopher pcaReduce: hierarchical clustering of single cell transcriptional profiles |
title | pcaReduce: hierarchical clustering of single cell transcriptional profiles |
title_full | pcaReduce: hierarchical clustering of single cell transcriptional profiles |
title_fullStr | pcaReduce: hierarchical clustering of single cell transcriptional profiles |
title_full_unstemmed | pcaReduce: hierarchical clustering of single cell transcriptional profiles |
title_short | pcaReduce: hierarchical clustering of single cell transcriptional profiles |
title_sort | pcareduce: hierarchical clustering of single cell transcriptional profiles |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4802652/ https://www.ncbi.nlm.nih.gov/pubmed/27005807 http://dx.doi.org/10.1186/s12859-016-0984-y |
work_keys_str_mv | AT zurauskienejustina pcareducehierarchicalclusteringofsinglecelltranscriptionalprofiles AT yauchristopher pcareducehierarchicalclusteringofsinglecelltranscriptionalprofiles |