Cargando…

Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways

Experimental procedures for preparing RNA-seq and single-cell (sc) RNA-seq libraries are based on assumptions regarding their underlying enzymatic reactions. Here, we show that the fairness of these assumptions varies within libraries: coverage by sequencing reads along and between transcripts exhib...

Descripción completa

Detalles Bibliográficos
Autores principales: Archer, Nathan, Walsh, Mark D., Shahrezaei, Vahid, Hebenstreit, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cell Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5167349/
https://www.ncbi.nlm.nih.gov/pubmed/27840077
http://dx.doi.org/10.1016/j.cels.2016.10.012
_version_ 1782483163605368832
author Archer, Nathan
Walsh, Mark D.
Shahrezaei, Vahid
Hebenstreit, Daniel
author_facet Archer, Nathan
Walsh, Mark D.
Shahrezaei, Vahid
Hebenstreit, Daniel
author_sort Archer, Nathan
collection PubMed
description Experimental procedures for preparing RNA-seq and single-cell (sc) RNA-seq libraries are based on assumptions regarding their underlying enzymatic reactions. Here, we show that the fairness of these assumptions varies within libraries: coverage by sequencing reads along and between transcripts exhibits characteristic, protocol-dependent biases. To understand the mechanistic basis of this bias, we present an integrated modeling framework that infers the relationship between enzyme reactions during library preparation and the characteristic coverage patterns observed for different protocols. Analysis of new and existing (sc)RNA-seq data from six different library preparation protocols reveals that polymerase processivity is the mechanistic origin of coverage biases. We apply our framework to demonstrate that lowering incubation temperature increases processivity, yield, and (sc)RNA-seq sensitivity in all protocols. We also provide correction factors based on our model for increasing accuracy of transcript quantification in existing samples prepared at standard temperatures. In total, our findings improve our ability to accurately reflect in vivo transcript abundances in (sc)RNA-seq libraries.
format Online
Article
Text
id pubmed-5167349
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Cell Press
record_format MEDLINE/PubMed
spelling pubmed-51673492016-12-22 Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways Archer, Nathan Walsh, Mark D. Shahrezaei, Vahid Hebenstreit, Daniel Cell Syst Article Experimental procedures for preparing RNA-seq and single-cell (sc) RNA-seq libraries are based on assumptions regarding their underlying enzymatic reactions. Here, we show that the fairness of these assumptions varies within libraries: coverage by sequencing reads along and between transcripts exhibits characteristic, protocol-dependent biases. To understand the mechanistic basis of this bias, we present an integrated modeling framework that infers the relationship between enzyme reactions during library preparation and the characteristic coverage patterns observed for different protocols. Analysis of new and existing (sc)RNA-seq data from six different library preparation protocols reveals that polymerase processivity is the mechanistic origin of coverage biases. We apply our framework to demonstrate that lowering incubation temperature increases processivity, yield, and (sc)RNA-seq sensitivity in all protocols. We also provide correction factors based on our model for increasing accuracy of transcript quantification in existing samples prepared at standard temperatures. In total, our findings improve our ability to accurately reflect in vivo transcript abundances in (sc)RNA-seq libraries. Cell Press 2016-11-23 /pmc/articles/PMC5167349/ /pubmed/27840077 http://dx.doi.org/10.1016/j.cels.2016.10.012 Text en © 2016 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Archer, Nathan
Walsh, Mark D.
Shahrezaei, Vahid
Hebenstreit, Daniel
Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways
title Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways
title_full Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways
title_fullStr Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways
title_full_unstemmed Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways
title_short Modeling Enzyme Processivity Reveals that RNA-Seq Libraries Are Biased in Characteristic and Correctable Ways
title_sort modeling enzyme processivity reveals that rna-seq libraries are biased in characteristic and correctable ways
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5167349/
https://www.ncbi.nlm.nih.gov/pubmed/27840077
http://dx.doi.org/10.1016/j.cels.2016.10.012
work_keys_str_mv AT archernathan modelingenzymeprocessivityrevealsthatrnaseqlibrariesarebiasedincharacteristicandcorrectableways
AT walshmarkd modelingenzymeprocessivityrevealsthatrnaseqlibrariesarebiasedincharacteristicandcorrectableways
AT shahrezaeivahid modelingenzymeprocessivityrevealsthatrnaseqlibrariesarebiasedincharacteristicandcorrectableways
AT hebenstreitdaniel modelingenzymeprocessivityrevealsthatrnaseqlibrariesarebiasedincharacteristicandcorrectableways