Cargando…

GPseudoClust: deconvolution of shared pseudo-profiles at single-cell resolution

MOTIVATION: Many methods have been developed to cluster genes on the basis of their changes in mRNA expression over time, using bulk RNA-seq or microarray data. However, single-cell data may present a particular challenge for these algorithms, since the temporal ordering of cells is not directly obs...

Descripción completa

Detalles Bibliográficos
Autores principales: Strauss, Magdalena E, Kirk, Paul D W, Reid, John E, Wernisch, Lorenz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7703763/
https://www.ncbi.nlm.nih.gov/pubmed/31608923
http://dx.doi.org/10.1093/bioinformatics/btz778
_version_ 1783616690762809344
author Strauss, Magdalena E
Kirk, Paul D W
Reid, John E
Wernisch, Lorenz
author_facet Strauss, Magdalena E
Kirk, Paul D W
Reid, John E
Wernisch, Lorenz
author_sort Strauss, Magdalena E
collection PubMed
description MOTIVATION: Many methods have been developed to cluster genes on the basis of their changes in mRNA expression over time, using bulk RNA-seq or microarray data. However, single-cell data may present a particular challenge for these algorithms, since the temporal ordering of cells is not directly observed. One way to address this is to first use pseudotime methods to order the cells, and then apply clustering techniques for time course data. However, pseudotime estimates are subject to high levels of uncertainty, and failing to account for this uncertainty is liable to lead to erroneous and/or over-confident gene clusters. RESULTS: The proposed method, GPseudoClust, is a novel approach that jointly infers pseudotemporal ordering and gene clusters, and quantifies the uncertainty in both. GPseudoClust combines a recent method for pseudotime inference with non-parametric Bayesian clustering methods, efficient Markov Chain Monte Carlo sampling and novel subsampling strategies which aid computation. We consider a broad array of simulated and experimental datasets to demonstrate the effectiveness of GPseudoClust in a range of settings. AVAILABILITY AND IMPLEMENTATION: An implementation is available on GitHub: https://github.com/magStra/nonparametricSummaryPSM and https://github.com/magStra/GPseudoClust. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7703763
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-77037632020-12-07 GPseudoClust: deconvolution of shared pseudo-profiles at single-cell resolution Strauss, Magdalena E Kirk, Paul D W Reid, John E Wernisch, Lorenz Bioinformatics Original Papers MOTIVATION: Many methods have been developed to cluster genes on the basis of their changes in mRNA expression over time, using bulk RNA-seq or microarray data. However, single-cell data may present a particular challenge for these algorithms, since the temporal ordering of cells is not directly observed. One way to address this is to first use pseudotime methods to order the cells, and then apply clustering techniques for time course data. However, pseudotime estimates are subject to high levels of uncertainty, and failing to account for this uncertainty is liable to lead to erroneous and/or over-confident gene clusters. RESULTS: The proposed method, GPseudoClust, is a novel approach that jointly infers pseudotemporal ordering and gene clusters, and quantifies the uncertainty in both. GPseudoClust combines a recent method for pseudotime inference with non-parametric Bayesian clustering methods, efficient Markov Chain Monte Carlo sampling and novel subsampling strategies which aid computation. We consider a broad array of simulated and experimental datasets to demonstrate the effectiveness of GPseudoClust in a range of settings. AVAILABILITY AND IMPLEMENTATION: An implementation is available on GitHub: https://github.com/magStra/nonparametricSummaryPSM and https://github.com/magStra/GPseudoClust. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-03 2019-10-14 /pmc/articles/PMC7703763/ /pubmed/31608923 http://dx.doi.org/10.1093/bioinformatics/btz778 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Strauss, Magdalena E
Kirk, Paul D W
Reid, John E
Wernisch, Lorenz
GPseudoClust: deconvolution of shared pseudo-profiles at single-cell resolution
title GPseudoClust: deconvolution of shared pseudo-profiles at single-cell resolution
title_full GPseudoClust: deconvolution of shared pseudo-profiles at single-cell resolution
title_fullStr GPseudoClust: deconvolution of shared pseudo-profiles at single-cell resolution
title_full_unstemmed GPseudoClust: deconvolution of shared pseudo-profiles at single-cell resolution
title_short GPseudoClust: deconvolution of shared pseudo-profiles at single-cell resolution
title_sort gpseudoclust: deconvolution of shared pseudo-profiles at single-cell resolution
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7703763/
https://www.ncbi.nlm.nih.gov/pubmed/31608923
http://dx.doi.org/10.1093/bioinformatics/btz778
work_keys_str_mv AT straussmagdalenae gpseudoclustdeconvolutionofsharedpseudoprofilesatsinglecellresolution
AT kirkpauldw gpseudoclustdeconvolutionofsharedpseudoprofilesatsinglecellresolution
AT reidjohne gpseudoclustdeconvolutionofsharedpseudoprofilesatsinglecellresolution
AT wernischlorenz gpseudoclustdeconvolutionofsharedpseudoprofilesatsinglecellresolution