Cargando…

Protocol variations in run-on transcription dataset preparation produce detectable signatures in sequencing libraries

BACKGROUND: A variety of protocols exist for producing whole genome run-on transcription datasets. However, little is known about how differences between these protocols affect the signal within the resulting libraries. RESULTS: Using run-on transcription datasets generated from the same biological...

Descripción completa

Detalles Bibliográficos
Autores principales: Hunter, Samuel, Sigauke, Rutendo F., Stanley, Jacob T., Allen, Mary A., Dowell, Robin D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8900324/
https://www.ncbi.nlm.nih.gov/pubmed/35255806
http://dx.doi.org/10.1186/s12864-022-08352-8
_version_ 1784664089273827328
author Hunter, Samuel
Sigauke, Rutendo F.
Stanley, Jacob T.
Allen, Mary A.
Dowell, Robin D.
author_facet Hunter, Samuel
Sigauke, Rutendo F.
Stanley, Jacob T.
Allen, Mary A.
Dowell, Robin D.
author_sort Hunter, Samuel
collection PubMed
description BACKGROUND: A variety of protocols exist for producing whole genome run-on transcription datasets. However, little is known about how differences between these protocols affect the signal within the resulting libraries. RESULTS: Using run-on transcription datasets generated from the same biological system, we show that a variety of GRO- and PRO-seq preparation methods leave identifiable signatures within each library. Specifically we show that the library preparation method results in differences in quality control metrics, as well as differences in the signal distribution at the 5 (′) end of transcribed regions. These shifts lead to disparities in eRNA identification, but do not impact analyses aimed at inferring the key regulators involved in changes to transcription. CONCLUSIONS: Run-on sequencing protocol variations result in technical signatures that can be used to identify both the enrichment and library preparation method of a particular data set. These technical signatures are batch effects that limit detailed comparisons of pausing ratios and eRNAs identified across protocols. However, these batch effects have only limited impact on our ability to infer which regulators underlie the observed transcriptional changes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-022-08352-8).
format Online
Article
Text
id pubmed-8900324
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-89003242022-03-17 Protocol variations in run-on transcription dataset preparation produce detectable signatures in sequencing libraries Hunter, Samuel Sigauke, Rutendo F. Stanley, Jacob T. Allen, Mary A. Dowell, Robin D. BMC Genomics Research BACKGROUND: A variety of protocols exist for producing whole genome run-on transcription datasets. However, little is known about how differences between these protocols affect the signal within the resulting libraries. RESULTS: Using run-on transcription datasets generated from the same biological system, we show that a variety of GRO- and PRO-seq preparation methods leave identifiable signatures within each library. Specifically we show that the library preparation method results in differences in quality control metrics, as well as differences in the signal distribution at the 5 (′) end of transcribed regions. These shifts lead to disparities in eRNA identification, but do not impact analyses aimed at inferring the key regulators involved in changes to transcription. CONCLUSIONS: Run-on sequencing protocol variations result in technical signatures that can be used to identify both the enrichment and library preparation method of a particular data set. These technical signatures are batch effects that limit detailed comparisons of pausing ratios and eRNAs identified across protocols. However, these batch effects have only limited impact on our ability to infer which regulators underlie the observed transcriptional changes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-022-08352-8). BioMed Central 2022-03-07 /pmc/articles/PMC8900324/ /pubmed/35255806 http://dx.doi.org/10.1186/s12864-022-08352-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Hunter, Samuel
Sigauke, Rutendo F.
Stanley, Jacob T.
Allen, Mary A.
Dowell, Robin D.
Protocol variations in run-on transcription dataset preparation produce detectable signatures in sequencing libraries
title Protocol variations in run-on transcription dataset preparation produce detectable signatures in sequencing libraries
title_full Protocol variations in run-on transcription dataset preparation produce detectable signatures in sequencing libraries
title_fullStr Protocol variations in run-on transcription dataset preparation produce detectable signatures in sequencing libraries
title_full_unstemmed Protocol variations in run-on transcription dataset preparation produce detectable signatures in sequencing libraries
title_short Protocol variations in run-on transcription dataset preparation produce detectable signatures in sequencing libraries
title_sort protocol variations in run-on transcription dataset preparation produce detectable signatures in sequencing libraries
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8900324/
https://www.ncbi.nlm.nih.gov/pubmed/35255806
http://dx.doi.org/10.1186/s12864-022-08352-8
work_keys_str_mv AT huntersamuel protocolvariationsinrunontranscriptiondatasetpreparationproducedetectablesignaturesinsequencinglibraries
AT sigaukerutendof protocolvariationsinrunontranscriptiondatasetpreparationproducedetectablesignaturesinsequencinglibraries
AT stanleyjacobt protocolvariationsinrunontranscriptiondatasetpreparationproducedetectablesignaturesinsequencinglibraries
AT allenmarya protocolvariationsinrunontranscriptiondatasetpreparationproducedetectablesignaturesinsequencinglibraries
AT dowellrobind protocolvariationsinrunontranscriptiondatasetpreparationproducedetectablesignaturesinsequencinglibraries