Cargando…
Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates
BACKGROUND: RNA velocity analysis of single cells offers the potential to predict temporal dynamics from gene expression. In many systems, RNA velocity has been observed to produce a vector field that qualitatively reflects known features of the system. However, the limitations of RNA velocity estim...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10601342/ https://www.ncbi.nlm.nih.gov/pubmed/37885016 http://dx.doi.org/10.1186/s13059-023-03065-x |
_version_ | 1785126180655988736 |
---|---|
author | Zheng, Shijie C. Stein-O’Brien, Genevieve Boukas, Leandros Goff, Loyal A. Hansen, Kasper D. |
author_facet | Zheng, Shijie C. Stein-O’Brien, Genevieve Boukas, Leandros Goff, Loyal A. Hansen, Kasper D. |
author_sort | Zheng, Shijie C. |
collection | PubMed |
description | BACKGROUND: RNA velocity analysis of single cells offers the potential to predict temporal dynamics from gene expression. In many systems, RNA velocity has been observed to produce a vector field that qualitatively reflects known features of the system. However, the limitations of RNA velocity estimates are still not well understood. RESULTS: We analyze the impact of different steps in the RNA velocity workflow on direction and speed. We consider both high-dimensional velocity estimates and low-dimensional velocity vector fields mapped onto an embedding. We conclude the transition probability method for mapping velocity estimates onto an embedding is effectively interpolating in the embedding space. Our findings reveal a significant dependence of the RNA velocity workflow on smoothing via the k-nearest-neighbors (k-NN) graph of the observed data. This reliance results in considerable estimation errors for both direction and speed in both high- and low-dimensional settings when the k-NN graph fails to accurately represent the true data structure; this is an unknown feature of real data. RNA velocity performs poorly at estimating speed in both low- and high-dimensional spaces, except in very low noise settings. We introduce a novel quality measure that can identify when RNA velocity should not be used. CONCLUSIONS: Our findings emphasize the importance of choices in the RNA velocity workflow and highlight critical limitations of data analysis. We advise against over-interpreting expression dynamics using RNA velocity, particularly in terms of speed. Finally, we emphasize that the use of RNA velocity in assessing the correctness of a low-dimensional embedding is circular. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03065-x. |
format | Online Article Text |
id | pubmed-10601342 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-106013422023-10-27 Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates Zheng, Shijie C. Stein-O’Brien, Genevieve Boukas, Leandros Goff, Loyal A. Hansen, Kasper D. Genome Biol Research BACKGROUND: RNA velocity analysis of single cells offers the potential to predict temporal dynamics from gene expression. In many systems, RNA velocity has been observed to produce a vector field that qualitatively reflects known features of the system. However, the limitations of RNA velocity estimates are still not well understood. RESULTS: We analyze the impact of different steps in the RNA velocity workflow on direction and speed. We consider both high-dimensional velocity estimates and low-dimensional velocity vector fields mapped onto an embedding. We conclude the transition probability method for mapping velocity estimates onto an embedding is effectively interpolating in the embedding space. Our findings reveal a significant dependence of the RNA velocity workflow on smoothing via the k-nearest-neighbors (k-NN) graph of the observed data. This reliance results in considerable estimation errors for both direction and speed in both high- and low-dimensional settings when the k-NN graph fails to accurately represent the true data structure; this is an unknown feature of real data. RNA velocity performs poorly at estimating speed in both low- and high-dimensional spaces, except in very low noise settings. We introduce a novel quality measure that can identify when RNA velocity should not be used. CONCLUSIONS: Our findings emphasize the importance of choices in the RNA velocity workflow and highlight critical limitations of data analysis. We advise against over-interpreting expression dynamics using RNA velocity, particularly in terms of speed. Finally, we emphasize that the use of RNA velocity in assessing the correctness of a low-dimensional embedding is circular. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03065-x. BioMed Central 2023-10-26 /pmc/articles/PMC10601342/ /pubmed/37885016 http://dx.doi.org/10.1186/s13059-023-03065-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Zheng, Shijie C. Stein-O’Brien, Genevieve Boukas, Leandros Goff, Loyal A. Hansen, Kasper D. Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates |
title | Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates |
title_full | Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates |
title_fullStr | Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates |
title_full_unstemmed | Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates |
title_short | Pumping the brakes on RNA velocity by understanding and interpreting RNA velocity estimates |
title_sort | pumping the brakes on rna velocity by understanding and interpreting rna velocity estimates |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10601342/ https://www.ncbi.nlm.nih.gov/pubmed/37885016 http://dx.doi.org/10.1186/s13059-023-03065-x |
work_keys_str_mv | AT zhengshijiec pumpingthebrakesonrnavelocitybyunderstandingandinterpretingrnavelocityestimates AT steinobriengenevieve pumpingthebrakesonrnavelocitybyunderstandingandinterpretingrnavelocityestimates AT boukasleandros pumpingthebrakesonrnavelocitybyunderstandingandinterpretingrnavelocityestimates AT goffloyala pumpingthebrakesonrnavelocitybyunderstandingandinterpretingrnavelocityestimates AT hansenkasperd pumpingthebrakesonrnavelocitybyunderstandingandinterpretingrnavelocityestimates |