Cargando…

deepBlockAlign: a tool for aligning RNA-seq profiles of read block patterns

Motivation: High-throughput sequencing methods allow whole transcriptomes to be sequenced fast and cost-effectively. Short RNA sequencing provides not only quantitative expression data but also an opportunity to identify novel coding and non-coding RNAs. Many long transcripts undergo post-transcript...

Descripción completa

Detalles Bibliográficos
Autores principales: Langenberger, David, Pundhir, Sachin, Ekstrøm, Claus T., Stadler, Peter F., Hoffmann, Steve, Gorodkin, Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3244762/
https://www.ncbi.nlm.nih.gov/pubmed/22053076
http://dx.doi.org/10.1093/bioinformatics/btr598
_version_ 1782219753901785088
author Langenberger, David
Pundhir, Sachin
Ekstrøm, Claus T.
Stadler, Peter F.
Hoffmann, Steve
Gorodkin, Jan
author_facet Langenberger, David
Pundhir, Sachin
Ekstrøm, Claus T.
Stadler, Peter F.
Hoffmann, Steve
Gorodkin, Jan
author_sort Langenberger, David
collection PubMed
description Motivation: High-throughput sequencing methods allow whole transcriptomes to be sequenced fast and cost-effectively. Short RNA sequencing provides not only quantitative expression data but also an opportunity to identify novel coding and non-coding RNAs. Many long transcripts undergo post-transcriptional processing that generates short RNA sequence fragments. Mapped back to a reference genome, they form distinctive patterns that convey information on both the structure of the parent transcript and the modalities of its processing. The miR-miR* pattern from microRNA precursors is the best-known, but by no means singular, example. Results: deepBlockAlign introduces a two-step approach to align RNA-seq read patterns with the aim of quickly identifying RNAs that share similar processing footprints. Overlapping mapped reads are first merged to blocks and then closely spaced blocks are combined to block groups, each representing a locus of expression. In order to compare block groups, the constituent blocks are first compared using a modified sequence alignment algorithm to determine similarity scores for pairs of blocks. In the second stage, block patterns are compared by means of a modified Sankoff algorithm that takes both block similarities and similarities of pattern of distances within the block groups into account. Hierarchical clustering of block groups clearly separates most miRNA and tRNA, and also identifies about a dozen tRNAs clustering together with miRNA. Most of these putative Dicer-processed tRNAs, including eight cases reported to generate products with miRNA-like features in literature, exhibit read blocks distinguished by precise start position of reads. Availability: The program deepBlockAlign is available as source code from http://rth.dk/resources/dba/. Contact: gorodkin@rth.dk; studla@bioinf.uni-leipzig.de Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3244762
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-32447622011-12-22 deepBlockAlign: a tool for aligning RNA-seq profiles of read block patterns Langenberger, David Pundhir, Sachin Ekstrøm, Claus T. Stadler, Peter F. Hoffmann, Steve Gorodkin, Jan Bioinformatics Original Papers Motivation: High-throughput sequencing methods allow whole transcriptomes to be sequenced fast and cost-effectively. Short RNA sequencing provides not only quantitative expression data but also an opportunity to identify novel coding and non-coding RNAs. Many long transcripts undergo post-transcriptional processing that generates short RNA sequence fragments. Mapped back to a reference genome, they form distinctive patterns that convey information on both the structure of the parent transcript and the modalities of its processing. The miR-miR* pattern from microRNA precursors is the best-known, but by no means singular, example. Results: deepBlockAlign introduces a two-step approach to align RNA-seq read patterns with the aim of quickly identifying RNAs that share similar processing footprints. Overlapping mapped reads are first merged to blocks and then closely spaced blocks are combined to block groups, each representing a locus of expression. In order to compare block groups, the constituent blocks are first compared using a modified sequence alignment algorithm to determine similarity scores for pairs of blocks. In the second stage, block patterns are compared by means of a modified Sankoff algorithm that takes both block similarities and similarities of pattern of distances within the block groups into account. Hierarchical clustering of block groups clearly separates most miRNA and tRNA, and also identifies about a dozen tRNAs clustering together with miRNA. Most of these putative Dicer-processed tRNAs, including eight cases reported to generate products with miRNA-like features in literature, exhibit read blocks distinguished by precise start position of reads. Availability: The program deepBlockAlign is available as source code from http://rth.dk/resources/dba/. Contact: gorodkin@rth.dk; studla@bioinf.uni-leipzig.de Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-01-01 2011-11-03 /pmc/articles/PMC3244762/ /pubmed/22053076 http://dx.doi.org/10.1093/bioinformatics/btr598 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Langenberger, David
Pundhir, Sachin
Ekstrøm, Claus T.
Stadler, Peter F.
Hoffmann, Steve
Gorodkin, Jan
deepBlockAlign: a tool for aligning RNA-seq profiles of read block patterns
title deepBlockAlign: a tool for aligning RNA-seq profiles of read block patterns
title_full deepBlockAlign: a tool for aligning RNA-seq profiles of read block patterns
title_fullStr deepBlockAlign: a tool for aligning RNA-seq profiles of read block patterns
title_full_unstemmed deepBlockAlign: a tool for aligning RNA-seq profiles of read block patterns
title_short deepBlockAlign: a tool for aligning RNA-seq profiles of read block patterns
title_sort deepblockalign: a tool for aligning rna-seq profiles of read block patterns
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3244762/
https://www.ncbi.nlm.nih.gov/pubmed/22053076
http://dx.doi.org/10.1093/bioinformatics/btr598
work_keys_str_mv AT langenbergerdavid deepblockalignatoolforaligningrnaseqprofilesofreadblockpatterns
AT pundhirsachin deepblockalignatoolforaligningrnaseqprofilesofreadblockpatterns
AT ekstrømclaust deepblockalignatoolforaligningrnaseqprofilesofreadblockpatterns
AT stadlerpeterf deepblockalignatoolforaligningrnaseqprofilesofreadblockpatterns
AT hoffmannsteve deepblockalignatoolforaligningrnaseqprofilesofreadblockpatterns
AT gorodkinjan deepblockalignatoolforaligningrnaseqprofilesofreadblockpatterns