Cargando…

Quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets

BACKGROUND: Methods to read out naturally occurring or experimentally introduced nucleic acid modifications are emerging as powerful tools to study dynamic cellular processes. The recovery, quantification and interpretation of such events in high-throughput sequencing datasets demands specialized bi...

Descripción completa

Detalles Bibliográficos
Autores principales: Neumann, Tobias, Herzog, Veronika A., Muhar, Matthias, von Haeseler, Arndt, Zuber, Johannes, Ameres, Stefan L., Rescheneder, Philipp
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6528199/
https://www.ncbi.nlm.nih.gov/pubmed/31109287
http://dx.doi.org/10.1186/s12859-019-2849-7
_version_ 1783420163213754368
author Neumann, Tobias
Herzog, Veronika A.
Muhar, Matthias
von Haeseler, Arndt
Zuber, Johannes
Ameres, Stefan L.
Rescheneder, Philipp
author_facet Neumann, Tobias
Herzog, Veronika A.
Muhar, Matthias
von Haeseler, Arndt
Zuber, Johannes
Ameres, Stefan L.
Rescheneder, Philipp
author_sort Neumann, Tobias
collection PubMed
description BACKGROUND: Methods to read out naturally occurring or experimentally introduced nucleic acid modifications are emerging as powerful tools to study dynamic cellular processes. The recovery, quantification and interpretation of such events in high-throughput sequencing datasets demands specialized bioinformatics approaches. RESULTS: Here, we present Digital Unmasking of Nucleotide conversions in K-mers (DUNK), a data analysis pipeline enabling the quantification of nucleotide conversions in high-throughput sequencing datasets. We demonstrate using experimentally generated and simulated datasets that DUNK allows constant mapping rates irrespective of nucleotide-conversion rates, promotes the recovery of multimapping reads and employs Single Nucleotide Polymorphism (SNP) masking to uncouple true SNPs from nucleotide conversions to facilitate a robust and sensitive quantification of nucleotide-conversions. As a first application, we implement this strategy as SLAM-DUNK for the analysis of SLAMseq profiles, in which 4-thiouridine-labeled transcripts are detected based on T > C conversions. SLAM-DUNK provides both raw counts of nucleotide-conversion containing reads as well as a base-content and read coverage normalized approach for estimating the fractions of labeled transcripts as readout. CONCLUSION: Beyond providing a readily accessible tool for analyzing SLAMseq and related time-resolved RNA sequencing methods (TimeLapse-seq, TUC-seq), DUNK establishes a broadly applicable strategy for quantifying nucleotide conversions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2849-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6528199
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65281992019-05-28 Quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets Neumann, Tobias Herzog, Veronika A. Muhar, Matthias von Haeseler, Arndt Zuber, Johannes Ameres, Stefan L. Rescheneder, Philipp BMC Bioinformatics Research Article BACKGROUND: Methods to read out naturally occurring or experimentally introduced nucleic acid modifications are emerging as powerful tools to study dynamic cellular processes. The recovery, quantification and interpretation of such events in high-throughput sequencing datasets demands specialized bioinformatics approaches. RESULTS: Here, we present Digital Unmasking of Nucleotide conversions in K-mers (DUNK), a data analysis pipeline enabling the quantification of nucleotide conversions in high-throughput sequencing datasets. We demonstrate using experimentally generated and simulated datasets that DUNK allows constant mapping rates irrespective of nucleotide-conversion rates, promotes the recovery of multimapping reads and employs Single Nucleotide Polymorphism (SNP) masking to uncouple true SNPs from nucleotide conversions to facilitate a robust and sensitive quantification of nucleotide-conversions. As a first application, we implement this strategy as SLAM-DUNK for the analysis of SLAMseq profiles, in which 4-thiouridine-labeled transcripts are detected based on T > C conversions. SLAM-DUNK provides both raw counts of nucleotide-conversion containing reads as well as a base-content and read coverage normalized approach for estimating the fractions of labeled transcripts as readout. CONCLUSION: Beyond providing a readily accessible tool for analyzing SLAMseq and related time-resolved RNA sequencing methods (TimeLapse-seq, TUC-seq), DUNK establishes a broadly applicable strategy for quantifying nucleotide conversions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2849-7) contains supplementary material, which is available to authorized users. BioMed Central 2019-05-20 /pmc/articles/PMC6528199/ /pubmed/31109287 http://dx.doi.org/10.1186/s12859-019-2849-7 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Neumann, Tobias
Herzog, Veronika A.
Muhar, Matthias
von Haeseler, Arndt
Zuber, Johannes
Ameres, Stefan L.
Rescheneder, Philipp
Quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets
title Quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets
title_full Quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets
title_fullStr Quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets
title_full_unstemmed Quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets
title_short Quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets
title_sort quantification of experimentally induced nucleotide conversions in high-throughput sequencing datasets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6528199/
https://www.ncbi.nlm.nih.gov/pubmed/31109287
http://dx.doi.org/10.1186/s12859-019-2849-7
work_keys_str_mv AT neumanntobias quantificationofexperimentallyinducednucleotideconversionsinhighthroughputsequencingdatasets
AT herzogveronikaa quantificationofexperimentallyinducednucleotideconversionsinhighthroughputsequencingdatasets
AT muharmatthias quantificationofexperimentallyinducednucleotideconversionsinhighthroughputsequencingdatasets
AT vonhaeselerarndt quantificationofexperimentallyinducednucleotideconversionsinhighthroughputsequencingdatasets
AT zuberjohannes quantificationofexperimentallyinducednucleotideconversionsinhighthroughputsequencingdatasets
AT ameresstefanl quantificationofexperimentallyinducednucleotideconversionsinhighthroughputsequencingdatasets
AT reschenederphilipp quantificationofexperimentallyinducednucleotideconversionsinhighthroughputsequencingdatasets