Cargando…

scShaper: an ensemble method for fast and accurate linear trajectory inference from single-cell RNA-seq data

MOTIVATION: Computational models are needed to infer a representation of the cells, i.e. a trajectory, from single-cell RNA-sequencing data that model cell differentiation during a dynamic process. Although many trajectory inference methods exist, their performance varies greatly depending on the da...

Descripción completa

Detalles Bibliográficos
Autores principales: Smolander, Johannes, Junttila, Sini, Venäläinen, Mikko S, Elo, Laura L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8825760/
https://www.ncbi.nlm.nih.gov/pubmed/34888622
http://dx.doi.org/10.1093/bioinformatics/btab831
Descripción
Sumario:MOTIVATION: Computational models are needed to infer a representation of the cells, i.e. a trajectory, from single-cell RNA-sequencing data that model cell differentiation during a dynamic process. Although many trajectory inference methods exist, their performance varies greatly depending on the dataset and hence there is a need to establish more accurate, better generalizable methods. RESULTS: We introduce scShaper, a new trajectory inference method that enables accurate linear trajectory inference. The ensemble approach of scShaper generates a continuous smooth pseudotime based on a set of discrete pseudotimes. We demonstrate that scShaper is able to infer accurate trajectories for a variety of trigonometric trajectories, including many for which the commonly used principal curves method fails. A comprehensive benchmarking with state-of-the-art methods revealed that scShaper achieved superior accuracy of the cell ordering and, in particular, the differentially expressed genes. Moreover, scShaper is a fast method with few hyperparameters, making it a promising alternative to the principal curves method for linear pseudotemporal ordering. AVAILABILITY AND IMPLEMENTATION: scShaper is available as an R package at https://github.com/elolab/scshaper. The test data are available at https://doi.org/10.5281/zenodo.5734488. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.