Cargando…

Preprocessing choices affect RNA velocity results for droplet scRNA-seq data

Experimental single-cell approaches are becoming widely used for many purposes, including investigation of the dynamic behaviour of developing biological systems. Consequently, a large number of computational methods for extracting dynamic information from such data have been developed. One example...

Descripción completa

Detalles Bibliográficos
Autores principales: Soneson, Charlotte, Srivastava, Avi, Patro, Rob, Stadler, Michael B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7822509/
https://www.ncbi.nlm.nih.gov/pubmed/33428615
http://dx.doi.org/10.1371/journal.pcbi.1008585
_version_ 1783639653279072256
author Soneson, Charlotte
Srivastava, Avi
Patro, Rob
Stadler, Michael B.
author_facet Soneson, Charlotte
Srivastava, Avi
Patro, Rob
Stadler, Michael B.
author_sort Soneson, Charlotte
collection PubMed
description Experimental single-cell approaches are becoming widely used for many purposes, including investigation of the dynamic behaviour of developing biological systems. Consequently, a large number of computational methods for extracting dynamic information from such data have been developed. One example is RNA velocity analysis, in which spliced and unspliced RNA abundances are jointly modeled in order to infer a ‘direction of change’ and thereby a future state for each cell in the gene expression space. Naturally, the accuracy and interpretability of the inferred RNA velocities depend crucially on the correctness of the estimated abundances. Here, we systematically compare five widely used quantification tools, in total yielding thirteen different quantification approaches, in terms of their estimates of spliced and unspliced RNA abundances in five experimental droplet scRNA-seq data sets. We show that there are substantial differences between the quantifications obtained from different tools, and identify typical genes for which such discrepancies are observed. We further show that these abundance differences propagate to the downstream analysis, and can have a large effect on estimated velocities as well as the biological interpretation. Our results highlight that abundance quantification is a crucial aspect of the RNA velocity analysis workflow, and that both the definition of the genomic features of interest and the quantification algorithm itself require careful consideration.
format Online
Article
Text
id pubmed-7822509
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-78225092021-01-29 Preprocessing choices affect RNA velocity results for droplet scRNA-seq data Soneson, Charlotte Srivastava, Avi Patro, Rob Stadler, Michael B. PLoS Comput Biol Research Article Experimental single-cell approaches are becoming widely used for many purposes, including investigation of the dynamic behaviour of developing biological systems. Consequently, a large number of computational methods for extracting dynamic information from such data have been developed. One example is RNA velocity analysis, in which spliced and unspliced RNA abundances are jointly modeled in order to infer a ‘direction of change’ and thereby a future state for each cell in the gene expression space. Naturally, the accuracy and interpretability of the inferred RNA velocities depend crucially on the correctness of the estimated abundances. Here, we systematically compare five widely used quantification tools, in total yielding thirteen different quantification approaches, in terms of their estimates of spliced and unspliced RNA abundances in five experimental droplet scRNA-seq data sets. We show that there are substantial differences between the quantifications obtained from different tools, and identify typical genes for which such discrepancies are observed. We further show that these abundance differences propagate to the downstream analysis, and can have a large effect on estimated velocities as well as the biological interpretation. Our results highlight that abundance quantification is a crucial aspect of the RNA velocity analysis workflow, and that both the definition of the genomic features of interest and the quantification algorithm itself require careful consideration. Public Library of Science 2021-01-11 /pmc/articles/PMC7822509/ /pubmed/33428615 http://dx.doi.org/10.1371/journal.pcbi.1008585 Text en © 2021 Soneson et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Soneson, Charlotte
Srivastava, Avi
Patro, Rob
Stadler, Michael B.
Preprocessing choices affect RNA velocity results for droplet scRNA-seq data
title Preprocessing choices affect RNA velocity results for droplet scRNA-seq data
title_full Preprocessing choices affect RNA velocity results for droplet scRNA-seq data
title_fullStr Preprocessing choices affect RNA velocity results for droplet scRNA-seq data
title_full_unstemmed Preprocessing choices affect RNA velocity results for droplet scRNA-seq data
title_short Preprocessing choices affect RNA velocity results for droplet scRNA-seq data
title_sort preprocessing choices affect rna velocity results for droplet scrna-seq data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7822509/
https://www.ncbi.nlm.nih.gov/pubmed/33428615
http://dx.doi.org/10.1371/journal.pcbi.1008585
work_keys_str_mv AT sonesoncharlotte preprocessingchoicesaffectrnavelocityresultsfordropletscrnaseqdata
AT srivastavaavi preprocessingchoicesaffectrnavelocityresultsfordropletscrnaseqdata
AT patrorob preprocessingchoicesaffectrnavelocityresultsfordropletscrnaseqdata
AT stadlermichaelb preprocessingchoicesaffectrnavelocityresultsfordropletscrnaseqdata