Cargando…

Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates

BACKGROUND: RNA velocity analysis of single cells offers the potential to predict temporal dynamics from gene expression. In many systems, RNA velocity has been observed to produce a vector field that qualitatively reflects known features of the system. However, the limitations of RNA velocity estim...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Shijie C., Stein-O’Brien, Genevieve, Boukas, Leandros, Goff, Loyal A., Hansen, Kasper D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10601342/
https://www.ncbi.nlm.nih.gov/pubmed/37885016
http://dx.doi.org/10.1186/s13059-023-03065-x
_version_ 1785126180655988736
author Zheng, Shijie C.
Stein-O’Brien, Genevieve
Boukas, Leandros
Goff, Loyal A.
Hansen, Kasper D.
author_facet Zheng, Shijie C.
Stein-O’Brien, Genevieve
Boukas, Leandros
Goff, Loyal A.
Hansen, Kasper D.
author_sort Zheng, Shijie C.
collection PubMed
description BACKGROUND: RNA velocity analysis of single cells offers the potential to predict temporal dynamics from gene expression. In many systems, RNA velocity has been observed to produce a vector field that qualitatively reflects known features of the system. However, the limitations of RNA velocity estimates are still not well understood. RESULTS: We analyze the impact of different steps in the RNA velocity workflow on direction and speed. We consider both high-dimensional velocity estimates and low-dimensional velocity vector fields mapped onto an embedding. We conclude the transition probability method for mapping velocity estimates onto an embedding is effectively interpolating in the embedding space. Our findings reveal a significant dependence of the RNA velocity workflow on smoothing via the k-nearest-neighbors (k-NN) graph of the observed data. This reliance results in considerable estimation errors for both direction and speed in both high- and low-dimensional settings when the k-NN graph fails to accurately represent the true data structure; this is an unknown feature of real data. RNA velocity performs poorly at estimating speed in both low- and high-dimensional spaces, except in very low noise settings. We introduce a novel quality measure that can identify when RNA velocity should not be used. CONCLUSIONS: Our findings emphasize the importance of choices in the RNA velocity workflow and highlight critical limitations of data analysis. We advise against over-interpreting expression dynamics using RNA velocity, particularly in terms of speed. Finally, we emphasize that the use of RNA velocity in assessing the correctness of a low-dimensional embedding is circular. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03065-x.
format Online
Article
Text
id pubmed-10601342
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-106013422023-10-27 Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates Zheng, Shijie C. Stein-O’Brien, Genevieve Boukas, Leandros Goff, Loyal A. Hansen, Kasper D. Genome Biol Research BACKGROUND: RNA velocity analysis of single cells offers the potential to predict temporal dynamics from gene expression. In many systems, RNA velocity has been observed to produce a vector field that qualitatively reflects known features of the system. However, the limitations of RNA velocity estimates are still not well understood. RESULTS: We analyze the impact of different steps in the RNA velocity workflow on direction and speed. We consider both high-dimensional velocity estimates and low-dimensional velocity vector fields mapped onto an embedding. We conclude the transition probability method for mapping velocity estimates onto an embedding is effectively interpolating in the embedding space. Our findings reveal a significant dependence of the RNA velocity workflow on smoothing via the k-nearest-neighbors (k-NN) graph of the observed data. This reliance results in considerable estimation errors for both direction and speed in both high- and low-dimensional settings when the k-NN graph fails to accurately represent the true data structure; this is an unknown feature of real data. RNA velocity performs poorly at estimating speed in both low- and high-dimensional spaces, except in very low noise settings. We introduce a novel quality measure that can identify when RNA velocity should not be used. CONCLUSIONS: Our findings emphasize the importance of choices in the RNA velocity workflow and highlight critical limitations of data analysis. We advise against over-interpreting expression dynamics using RNA velocity, particularly in terms of speed. Finally, we emphasize that the use of RNA velocity in assessing the correctness of a low-dimensional embedding is circular. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03065-x. BioMed Central 2023-10-26 /pmc/articles/PMC10601342/ /pubmed/37885016 http://dx.doi.org/10.1186/s13059-023-03065-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zheng, Shijie C.
Stein-O’Brien, Genevieve
Boukas, Leandros
Goff, Loyal A.
Hansen, Kasper D.
Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates
title Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates
title_full Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates
title_fullStr Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates
title_full_unstemmed Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates
title_short Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates
title_sort pumping the brakes on rna velocity by understanding and interpreting rna velocity estimates
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10601342/
https://www.ncbi.nlm.nih.gov/pubmed/37885016
http://dx.doi.org/10.1186/s13059-023-03065-x
work_keys_str_mv AT zhengshijiec pumpingthebrakesonrnavelocitybyunderstandingandinterpretingrnavelocityestimates
AT steinobriengenevieve pumpingthebrakesonrnavelocitybyunderstandingandinterpretingrnavelocityestimates
AT boukasleandros pumpingthebrakesonrnavelocitybyunderstandingandinterpretingrnavelocityestimates
AT goffloyala pumpingthebrakesonrnavelocitybyunderstandingandinterpretingrnavelocityestimates
AT hansenkasperd pumpingthebrakesonrnavelocitybyunderstandingandinterpretingrnavelocityestimates