Cargando…

GrandPrix: scaling up the Bayesian GPLVM for single-cell data

MOTIVATION: The Gaussian Process Latent Variable Model (GPLVM) is a popular approach for dimensionality reduction of single-cell data and has been used for pseudotime estimation with capture time information. However, current implementations are computationally intensive and will not scale up to mod...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahmed, Sumon, Rattray, Magnus, Boukouvalas, Alexis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6298059/
https://www.ncbi.nlm.nih.gov/pubmed/30561544
http://dx.doi.org/10.1093/bioinformatics/bty533
_version_ 1783381252351459328
author Ahmed, Sumon
Rattray, Magnus
Boukouvalas, Alexis
author_facet Ahmed, Sumon
Rattray, Magnus
Boukouvalas, Alexis
author_sort Ahmed, Sumon
collection PubMed
description MOTIVATION: The Gaussian Process Latent Variable Model (GPLVM) is a popular approach for dimensionality reduction of single-cell data and has been used for pseudotime estimation with capture time information. However, current implementations are computationally intensive and will not scale up to modern droplet-based single-cell datasets which routinely profile many tens of thousands of cells. RESULTS: We provide an efficient implementation which allows scaling up this approach to modern single-cell datasets. We also generalize the application of pseudotime inference to cases where there are other sources of variation such as branching dynamics. We apply our method on microarray, nCounter, RNA-seq, qPCR and droplet-based datasets from different organisms. The model converges an order of magnitude faster compared to existing methods whilst achieving similar levels of estimation accuracy. Further, we demonstrate the flexibility of our approach by extending the model to higher-dimensional latent spaces that can be used to simultaneously infer pseudotime and other structure such as branching. Thus, the model has the capability of producing meaningful biological insights about cell ordering as well as cell fate regulation. AVAILABILITY AND IMPLEMENTATION: Software available at github.com/ManchesterBioinference/GrandPrix. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6298059
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-62980592018-12-21 GrandPrix: scaling up the Bayesian GPLVM for single-cell data Ahmed, Sumon Rattray, Magnus Boukouvalas, Alexis Bioinformatics Original Papers MOTIVATION: The Gaussian Process Latent Variable Model (GPLVM) is a popular approach for dimensionality reduction of single-cell data and has been used for pseudotime estimation with capture time information. However, current implementations are computationally intensive and will not scale up to modern droplet-based single-cell datasets which routinely profile many tens of thousands of cells. RESULTS: We provide an efficient implementation which allows scaling up this approach to modern single-cell datasets. We also generalize the application of pseudotime inference to cases where there are other sources of variation such as branching dynamics. We apply our method on microarray, nCounter, RNA-seq, qPCR and droplet-based datasets from different organisms. The model converges an order of magnitude faster compared to existing methods whilst achieving similar levels of estimation accuracy. Further, we demonstrate the flexibility of our approach by extending the model to higher-dimensional latent spaces that can be used to simultaneously infer pseudotime and other structure such as branching. Thus, the model has the capability of producing meaningful biological insights about cell ordering as well as cell fate regulation. AVAILABILITY AND IMPLEMENTATION: Software available at github.com/ManchesterBioinference/GrandPrix. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-01-01 2018-07-02 /pmc/articles/PMC6298059/ /pubmed/30561544 http://dx.doi.org/10.1093/bioinformatics/bty533 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Ahmed, Sumon
Rattray, Magnus
Boukouvalas, Alexis
GrandPrix: scaling up the Bayesian GPLVM for single-cell data
title GrandPrix: scaling up the Bayesian GPLVM for single-cell data
title_full GrandPrix: scaling up the Bayesian GPLVM for single-cell data
title_fullStr GrandPrix: scaling up the Bayesian GPLVM for single-cell data
title_full_unstemmed GrandPrix: scaling up the Bayesian GPLVM for single-cell data
title_short GrandPrix: scaling up the Bayesian GPLVM for single-cell data
title_sort grandprix: scaling up the bayesian gplvm for single-cell data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6298059/
https://www.ncbi.nlm.nih.gov/pubmed/30561544
http://dx.doi.org/10.1093/bioinformatics/bty533
work_keys_str_mv AT ahmedsumon grandprixscalingupthebayesiangplvmforsinglecelldata
AT rattraymagnus grandprixscalingupthebayesiangplvmforsinglecelldata
AT boukouvalasalexis grandprixscalingupthebayesiangplvmforsinglecelldata