Cargando…

scShaper: an ensemble method for fast and accurate linear trajectory inference from single-cell RNA-seq data

MOTIVATION: Computational models are needed to infer a representation of the cells, i.e. a trajectory, from single-cell RNA-sequencing data that model cell differentiation during a dynamic process. Although many trajectory inference methods exist, their performance varies greatly depending on the da...

Descripción completa

Detalles Bibliográficos
Autores principales: Smolander, Johannes, Junttila, Sini, Venäläinen, Mikko S, Elo, Laura L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8825760/
https://www.ncbi.nlm.nih.gov/pubmed/34888622
http://dx.doi.org/10.1093/bioinformatics/btab831
_version_ 1784647286235594752
author Smolander, Johannes
Junttila, Sini
Venäläinen, Mikko S
Elo, Laura L
author_facet Smolander, Johannes
Junttila, Sini
Venäläinen, Mikko S
Elo, Laura L
author_sort Smolander, Johannes
collection PubMed
description MOTIVATION: Computational models are needed to infer a representation of the cells, i.e. a trajectory, from single-cell RNA-sequencing data that model cell differentiation during a dynamic process. Although many trajectory inference methods exist, their performance varies greatly depending on the dataset and hence there is a need to establish more accurate, better generalizable methods. RESULTS: We introduce scShaper, a new trajectory inference method that enables accurate linear trajectory inference. The ensemble approach of scShaper generates a continuous smooth pseudotime based on a set of discrete pseudotimes. We demonstrate that scShaper is able to infer accurate trajectories for a variety of trigonometric trajectories, including many for which the commonly used principal curves method fails. A comprehensive benchmarking with state-of-the-art methods revealed that scShaper achieved superior accuracy of the cell ordering and, in particular, the differentially expressed genes. Moreover, scShaper is a fast method with few hyperparameters, making it a promising alternative to the principal curves method for linear pseudotemporal ordering. AVAILABILITY AND IMPLEMENTATION: scShaper is available as an R package at https://github.com/elolab/scshaper. The test data are available at https://doi.org/10.5281/zenodo.5734488. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8825760
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-88257602022-02-09 scShaper: an ensemble method for fast and accurate linear trajectory inference from single-cell RNA-seq data Smolander, Johannes Junttila, Sini Venäläinen, Mikko S Elo, Laura L Bioinformatics Original Papers MOTIVATION: Computational models are needed to infer a representation of the cells, i.e. a trajectory, from single-cell RNA-sequencing data that model cell differentiation during a dynamic process. Although many trajectory inference methods exist, their performance varies greatly depending on the dataset and hence there is a need to establish more accurate, better generalizable methods. RESULTS: We introduce scShaper, a new trajectory inference method that enables accurate linear trajectory inference. The ensemble approach of scShaper generates a continuous smooth pseudotime based on a set of discrete pseudotimes. We demonstrate that scShaper is able to infer accurate trajectories for a variety of trigonometric trajectories, including many for which the commonly used principal curves method fails. A comprehensive benchmarking with state-of-the-art methods revealed that scShaper achieved superior accuracy of the cell ordering and, in particular, the differentially expressed genes. Moreover, scShaper is a fast method with few hyperparameters, making it a promising alternative to the principal curves method for linear pseudotemporal ordering. AVAILABILITY AND IMPLEMENTATION: scShaper is available as an R package at https://github.com/elolab/scshaper. The test data are available at https://doi.org/10.5281/zenodo.5734488. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-12-09 /pmc/articles/PMC8825760/ /pubmed/34888622 http://dx.doi.org/10.1093/bioinformatics/btab831 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Smolander, Johannes
Junttila, Sini
Venäläinen, Mikko S
Elo, Laura L
scShaper: an ensemble method for fast and accurate linear trajectory inference from single-cell RNA-seq data
title scShaper: an ensemble method for fast and accurate linear trajectory inference from single-cell RNA-seq data
title_full scShaper: an ensemble method for fast and accurate linear trajectory inference from single-cell RNA-seq data
title_fullStr scShaper: an ensemble method for fast and accurate linear trajectory inference from single-cell RNA-seq data
title_full_unstemmed scShaper: an ensemble method for fast and accurate linear trajectory inference from single-cell RNA-seq data
title_short scShaper: an ensemble method for fast and accurate linear trajectory inference from single-cell RNA-seq data
title_sort scshaper: an ensemble method for fast and accurate linear trajectory inference from single-cell rna-seq data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8825760/
https://www.ncbi.nlm.nih.gov/pubmed/34888622
http://dx.doi.org/10.1093/bioinformatics/btab831
work_keys_str_mv AT smolanderjohannes scshaperanensemblemethodforfastandaccuratelineartrajectoryinferencefromsinglecellrnaseqdata
AT junttilasini scshaperanensemblemethodforfastandaccuratelineartrajectoryinferencefromsinglecellrnaseqdata
AT venalainenmikkos scshaperanensemblemethodforfastandaccuratelineartrajectoryinferencefromsinglecellrnaseqdata
AT elolaural scshaperanensemblemethodforfastandaccuratelineartrajectoryinferencefromsinglecellrnaseqdata