Cargando…

Pseudotime estimation: deconfounding single cell time series

Motivation: Repeated cross-sectional time series single cell data confound several sources of variation, with contributions from measurement noise, stochastic cell-to-cell variation and cell progression at different rates. Time series from single cell assays are particularly susceptible to confoundi...

Descripción completa

Detalles Bibliográficos
Autores principales: Reid, John E., Wernisch, Lorenz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5039927/
https://www.ncbi.nlm.nih.gov/pubmed/27318198
http://dx.doi.org/10.1093/bioinformatics/btw372
_version_ 1782456150725230592
author Reid, John E.
Wernisch, Lorenz
author_facet Reid, John E.
Wernisch, Lorenz
author_sort Reid, John E.
collection PubMed
description Motivation: Repeated cross-sectional time series single cell data confound several sources of variation, with contributions from measurement noise, stochastic cell-to-cell variation and cell progression at different rates. Time series from single cell assays are particularly susceptible to confounding as the measurements are not averaged over populations of cells. When several genes are assayed in parallel these effects can be estimated and corrected for under certain smoothness assumptions on cell progression. Results: We present a principled probabilistic model with a Bayesian inference scheme to analyse such data. We demonstrate our method’s utility on public microarray, nCounter and RNA-seq datasets from three organisms. Our method almost perfectly recovers withheld capture times in an Arabidopsis dataset, it accurately estimates cell cycle peak times in a human prostate cancer cell line and it correctly identifies two precocious cells in a study of paracrine signalling in mouse dendritic cells. Furthermore, our method compares favourably with Monocle, a state-of-the-art technique. We also show using held-out data that uncertainty in the temporal dimension is a common confounder and should be accounted for in analyses of repeated cross-sectional time series. Availability and Implementation: Our method is available on CRAN in the DeLorean package. Contact: john.reid@mrc-bsu.cam.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5039927
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-50399272016-09-29 Pseudotime estimation: deconfounding single cell time series Reid, John E. Wernisch, Lorenz Bioinformatics Original Papers Motivation: Repeated cross-sectional time series single cell data confound several sources of variation, with contributions from measurement noise, stochastic cell-to-cell variation and cell progression at different rates. Time series from single cell assays are particularly susceptible to confounding as the measurements are not averaged over populations of cells. When several genes are assayed in parallel these effects can be estimated and corrected for under certain smoothness assumptions on cell progression. Results: We present a principled probabilistic model with a Bayesian inference scheme to analyse such data. We demonstrate our method’s utility on public microarray, nCounter and RNA-seq datasets from three organisms. Our method almost perfectly recovers withheld capture times in an Arabidopsis dataset, it accurately estimates cell cycle peak times in a human prostate cancer cell line and it correctly identifies two precocious cells in a study of paracrine signalling in mouse dendritic cells. Furthermore, our method compares favourably with Monocle, a state-of-the-art technique. We also show using held-out data that uncertainty in the temporal dimension is a common confounder and should be accounted for in analyses of repeated cross-sectional time series. Availability and Implementation: Our method is available on CRAN in the DeLorean package. Contact: john.reid@mrc-bsu.cam.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-10-01 2016-06-17 /pmc/articles/PMC5039927/ /pubmed/27318198 http://dx.doi.org/10.1093/bioinformatics/btw372 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Reid, John E.
Wernisch, Lorenz
Pseudotime estimation: deconfounding single cell time series
title Pseudotime estimation: deconfounding single cell time series
title_full Pseudotime estimation: deconfounding single cell time series
title_fullStr Pseudotime estimation: deconfounding single cell time series
title_full_unstemmed Pseudotime estimation: deconfounding single cell time series
title_short Pseudotime estimation: deconfounding single cell time series
title_sort pseudotime estimation: deconfounding single cell time series
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5039927/
https://www.ncbi.nlm.nih.gov/pubmed/27318198
http://dx.doi.org/10.1093/bioinformatics/btw372
work_keys_str_mv AT reidjohne pseudotimeestimationdeconfoundingsinglecelltimeseries
AT wernischlorenz pseudotimeestimationdeconfoundingsinglecelltimeseries