Cargando…

Supersplat—spliced RNA-seq alignment

Motivation: High-throughput sequencing technologies have recently made deep interrogation of expressed transcript sequences practical, both economically and temporally. Identification of intron/exon boundaries is an essential part of genome annotation, yet remains a challenge. Here, we present super...

Descripción completa

Detalles Bibliográficos
Autores principales: Bryant, Douglas W., Shen, Rongkun, Priest, Henry D., Wong, Weng-Keen, Mockler, Todd C.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881391/
https://www.ncbi.nlm.nih.gov/pubmed/20410051
http://dx.doi.org/10.1093/bioinformatics/btq206
_version_ 1782182113483685888
author Bryant, Douglas W.
Shen, Rongkun
Priest, Henry D.
Wong, Weng-Keen
Mockler, Todd C.
author_facet Bryant, Douglas W.
Shen, Rongkun
Priest, Henry D.
Wong, Weng-Keen
Mockler, Todd C.
author_sort Bryant, Douglas W.
collection PubMed
description Motivation: High-throughput sequencing technologies have recently made deep interrogation of expressed transcript sequences practical, both economically and temporally. Identification of intron/exon boundaries is an essential part of genome annotation, yet remains a challenge. Here, we present supersplat, a method for unbiased splice-junction discovery through empirical RNA-seq data. Results: Using a genomic reference and RNA-seq high-throughput sequencing datasets, supersplat empirically identifies potential splice junctions at a rate of ∼11.4 million reads per hour. We further benchmark the performance of the algorithm by mapping Illumina RNA-seq reads to identify introns in the genome of the reference dicot plant Arabidopsis thaliana and we demonstrate the utility of supersplat for de novo empirical annotation of splice junctions using the reference monocot plant Brachypodium distachyon. Availability: Implemented in C++, supersplat source code and binaries are freely available on the web at http://mocklerlab-tools.cgrb.oregonstate.edu/ Contact: tmockler@cgrb.oregonstate.edu
format Text
id pubmed-2881391
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28813912010-06-08 Supersplat—spliced RNA-seq alignment Bryant, Douglas W. Shen, Rongkun Priest, Henry D. Wong, Weng-Keen Mockler, Todd C. Bioinformatics Original Papers Motivation: High-throughput sequencing technologies have recently made deep interrogation of expressed transcript sequences practical, both economically and temporally. Identification of intron/exon boundaries is an essential part of genome annotation, yet remains a challenge. Here, we present supersplat, a method for unbiased splice-junction discovery through empirical RNA-seq data. Results: Using a genomic reference and RNA-seq high-throughput sequencing datasets, supersplat empirically identifies potential splice junctions at a rate of ∼11.4 million reads per hour. We further benchmark the performance of the algorithm by mapping Illumina RNA-seq reads to identify introns in the genome of the reference dicot plant Arabidopsis thaliana and we demonstrate the utility of supersplat for de novo empirical annotation of splice junctions using the reference monocot plant Brachypodium distachyon. Availability: Implemented in C++, supersplat source code and binaries are freely available on the web at http://mocklerlab-tools.cgrb.oregonstate.edu/ Contact: tmockler@cgrb.oregonstate.edu Oxford University Press 2010-06-15 2010-04-21 /pmc/articles/PMC2881391/ /pubmed/20410051 http://dx.doi.org/10.1093/bioinformatics/btq206 Text en © The Author 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Bryant, Douglas W.
Shen, Rongkun
Priest, Henry D.
Wong, Weng-Keen
Mockler, Todd C.
Supersplat—spliced RNA-seq alignment
title Supersplat—spliced RNA-seq alignment
title_full Supersplat—spliced RNA-seq alignment
title_fullStr Supersplat—spliced RNA-seq alignment
title_full_unstemmed Supersplat—spliced RNA-seq alignment
title_short Supersplat—spliced RNA-seq alignment
title_sort supersplat—spliced rna-seq alignment
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2881391/
https://www.ncbi.nlm.nih.gov/pubmed/20410051
http://dx.doi.org/10.1093/bioinformatics/btq206
work_keys_str_mv AT bryantdouglasw supersplatsplicedrnaseqalignment
AT shenrongkun supersplatsplicedrnaseqalignment
AT priesthenryd supersplatsplicedrnaseqalignment
AT wongwengkeen supersplatsplicedrnaseqalignment
AT mocklertoddc supersplatsplicedrnaseqalignment