Cargando…

ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data

BACKGROUND: Biochemical methods are available for enriching 5′ ends of RNAs in prokaryotes, which are employed in the differential RNA-seq (dRNA-seq) and the more recent Cappable-seq protocols. Computational methods are needed to locate RNA 5′ ends from these data by statistical analysis of the enri...

Descripción completa

Detalles Bibliográficos
Autores principales: Promworn, Yuttachon, Kaewprommal, Pavita, Shaw, Philip J., Intarapanich, Apichart, Tongsima, Sissades, Piriyapongsa, Jittima
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5444824/
https://www.ncbi.nlm.nih.gov/pubmed/28542466
http://dx.doi.org/10.1371/journal.pone.0178483
_version_ 1783238774159835136
author Promworn, Yuttachon
Kaewprommal, Pavita
Shaw, Philip J.
Intarapanich, Apichart
Tongsima, Sissades
Piriyapongsa, Jittima
author_facet Promworn, Yuttachon
Kaewprommal, Pavita
Shaw, Philip J.
Intarapanich, Apichart
Tongsima, Sissades
Piriyapongsa, Jittima
author_sort Promworn, Yuttachon
collection PubMed
description BACKGROUND: Biochemical methods are available for enriching 5′ ends of RNAs in prokaryotes, which are employed in the differential RNA-seq (dRNA-seq) and the more recent Cappable-seq protocols. Computational methods are needed to locate RNA 5′ ends from these data by statistical analysis of the enrichment. Although statistical-based analysis methods have been developed for dRNA-seq, they may not be suitable for Cappable-seq data. The more efficient enrichment method employed in Cappable-seq compared with dRNA-seq could affect data distribution and thus algorithm performance. RESULTS: We present Transformation of Nucleotide Enrichment Ratios (ToNER), a tool for statistical modeling of enrichment from RNA-seq data obtained from enriched and unenriched libraries. The tool calculates nucleotide enrichment scores and determines the global transformation for fitting to the normal distribution using the Box-Cox procedure. From the transformed distribution, sites of significant enrichment are identified. To increase power of detection, meta-analysis across experimental replicates is offered. We tested the tool on Cappable-seq and dRNA-seq data for identifying Escherichia coli transcript 5′ ends and compared the results with those from the TSSAR tool, which is designed for analyzing dRNA-seq data. When combining results across Cappable-seq replicates, ToNER detects more known transcript 5′ ends than TSSAR. In general, the transcript 5′ ends detected by ToNER but not TSSAR occur in regions which cannot be locally modeled by TSSAR. CONCLUSION: ToNER uses a simple yet robust statistical modeling approach, which can be used for detecting RNA 5′ends from Cappable-seq data, in particular when combining information from experimental replicates. The ToNER tool could potentially be applied for analyzing other RNA-seq datasets in which enrichment for other structural features of RNA is employed. The program is freely available for download at ToNER webpage (http://www4a.biotec.or.th/GI/tools/toner) and GitHub repository (https://github.com/PavitaKae/ToNER).
format Online
Article
Text
id pubmed-5444824
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-54448242017-06-12 ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data Promworn, Yuttachon Kaewprommal, Pavita Shaw, Philip J. Intarapanich, Apichart Tongsima, Sissades Piriyapongsa, Jittima PLoS One Research Article BACKGROUND: Biochemical methods are available for enriching 5′ ends of RNAs in prokaryotes, which are employed in the differential RNA-seq (dRNA-seq) and the more recent Cappable-seq protocols. Computational methods are needed to locate RNA 5′ ends from these data by statistical analysis of the enrichment. Although statistical-based analysis methods have been developed for dRNA-seq, they may not be suitable for Cappable-seq data. The more efficient enrichment method employed in Cappable-seq compared with dRNA-seq could affect data distribution and thus algorithm performance. RESULTS: We present Transformation of Nucleotide Enrichment Ratios (ToNER), a tool for statistical modeling of enrichment from RNA-seq data obtained from enriched and unenriched libraries. The tool calculates nucleotide enrichment scores and determines the global transformation for fitting to the normal distribution using the Box-Cox procedure. From the transformed distribution, sites of significant enrichment are identified. To increase power of detection, meta-analysis across experimental replicates is offered. We tested the tool on Cappable-seq and dRNA-seq data for identifying Escherichia coli transcript 5′ ends and compared the results with those from the TSSAR tool, which is designed for analyzing dRNA-seq data. When combining results across Cappable-seq replicates, ToNER detects more known transcript 5′ ends than TSSAR. In general, the transcript 5′ ends detected by ToNER but not TSSAR occur in regions which cannot be locally modeled by TSSAR. CONCLUSION: ToNER uses a simple yet robust statistical modeling approach, which can be used for detecting RNA 5′ends from Cappable-seq data, in particular when combining information from experimental replicates. The ToNER tool could potentially be applied for analyzing other RNA-seq datasets in which enrichment for other structural features of RNA is employed. The program is freely available for download at ToNER webpage (http://www4a.biotec.or.th/GI/tools/toner) and GitHub repository (https://github.com/PavitaKae/ToNER). Public Library of Science 2017-05-25 /pmc/articles/PMC5444824/ /pubmed/28542466 http://dx.doi.org/10.1371/journal.pone.0178483 Text en © 2017 Promworn et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Promworn, Yuttachon
Kaewprommal, Pavita
Shaw, Philip J.
Intarapanich, Apichart
Tongsima, Sissades
Piriyapongsa, Jittima
ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data
title ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data
title_full ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data
title_fullStr ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data
title_full_unstemmed ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data
title_short ToNER: A tool for identifying nucleotide enrichment signals in feature-enriched RNA-seq data
title_sort toner: a tool for identifying nucleotide enrichment signals in feature-enriched rna-seq data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5444824/
https://www.ncbi.nlm.nih.gov/pubmed/28542466
http://dx.doi.org/10.1371/journal.pone.0178483
work_keys_str_mv AT promwornyuttachon toneratoolforidentifyingnucleotideenrichmentsignalsinfeatureenrichedrnaseqdata
AT kaewprommalpavita toneratoolforidentifyingnucleotideenrichmentsignalsinfeatureenrichedrnaseqdata
AT shawphilipj toneratoolforidentifyingnucleotideenrichmentsignalsinfeatureenrichedrnaseqdata
AT intarapanichapichart toneratoolforidentifyingnucleotideenrichmentsignalsinfeatureenrichedrnaseqdata
AT tongsimasissades toneratoolforidentifyingnucleotideenrichmentsignalsinfeatureenrichedrnaseqdata
AT piriyapongsajittima toneratoolforidentifyingnucleotideenrichmentsignalsinfeatureenrichedrnaseqdata