Cargando…

Length biases in single-cell RNA sequencing of pre-mRNA

Single-cell RNA sequencing data can be modeled using Markov chains to yield genome-wide insights into transcriptional physics. However, quantitative inference with such data requires careful assessment of noise sources. We find that long pre-mRNA transcripts are over-represented in sequencing data....

Descripción completa

Detalles Bibliográficos
Autores principales: Gorin, Gennady, Pachter, Lior
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9843228/
https://www.ncbi.nlm.nih.gov/pubmed/36660179
http://dx.doi.org/10.1016/j.bpr.2022.100097
_version_ 1784870343156957184
author Gorin, Gennady
Pachter, Lior
author_facet Gorin, Gennady
Pachter, Lior
author_sort Gorin, Gennady
collection PubMed
description Single-cell RNA sequencing data can be modeled using Markov chains to yield genome-wide insights into transcriptional physics. However, quantitative inference with such data requires careful assessment of noise sources. We find that long pre-mRNA transcripts are over-represented in sequencing data. To explain this trend, we propose a length-based model of capture bias, which may produce false-positive observations. We solve this model and use it to find concordant parameter trends as well as systematic, mechanistically interpretable technical and biological differences in paired data sets.
format Online
Article
Text
id pubmed-9843228
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-98432282023-01-18 Length biases in single-cell RNA sequencing of pre-mRNA Gorin, Gennady Pachter, Lior Biophys Rep (N Y) Article Single-cell RNA sequencing data can be modeled using Markov chains to yield genome-wide insights into transcriptional physics. However, quantitative inference with such data requires careful assessment of noise sources. We find that long pre-mRNA transcripts are over-represented in sequencing data. To explain this trend, we propose a length-based model of capture bias, which may produce false-positive observations. We solve this model and use it to find concordant parameter trends as well as systematic, mechanistically interpretable technical and biological differences in paired data sets. Elsevier 2022-12-27 /pmc/articles/PMC9843228/ /pubmed/36660179 http://dx.doi.org/10.1016/j.bpr.2022.100097 Text en © 2022. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Gorin, Gennady
Pachter, Lior
Length biases in single-cell RNA sequencing of pre-mRNA
title Length biases in single-cell RNA sequencing of pre-mRNA
title_full Length biases in single-cell RNA sequencing of pre-mRNA
title_fullStr Length biases in single-cell RNA sequencing of pre-mRNA
title_full_unstemmed Length biases in single-cell RNA sequencing of pre-mRNA
title_short Length biases in single-cell RNA sequencing of pre-mRNA
title_sort length biases in single-cell rna sequencing of pre-mrna
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9843228/
https://www.ncbi.nlm.nih.gov/pubmed/36660179
http://dx.doi.org/10.1016/j.bpr.2022.100097
work_keys_str_mv AT goringennady lengthbiasesinsinglecellrnasequencingofpremrna
AT pachterlior lengthbiasesinsinglecellrnasequencingofpremrna