Cargando…

TrueSight: a new algorithm for splice junction detection using RNA-seq

RNA-seq has proven to be a powerful technique for transcriptome profiling based on next-generation sequencing (NGS) technologies. However, due to the short length of NGS reads, it is challenging to accurately map RNA-seq reads to splice junctions (SJs), which is a critically important step in the an...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yang, Li-Byarlay, Hongmei, Burns, Paul, Borodovsky, Mark, Robinson, Gene E., Ma, Jian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3575843/
https://www.ncbi.nlm.nih.gov/pubmed/23254332
http://dx.doi.org/10.1093/nar/gks1311
_version_ 1782259785814507520
author Li, Yang
Li-Byarlay, Hongmei
Burns, Paul
Borodovsky, Mark
Robinson, Gene E.
Ma, Jian
author_facet Li, Yang
Li-Byarlay, Hongmei
Burns, Paul
Borodovsky, Mark
Robinson, Gene E.
Ma, Jian
author_sort Li, Yang
collection PubMed
description RNA-seq has proven to be a powerful technique for transcriptome profiling based on next-generation sequencing (NGS) technologies. However, due to the short length of NGS reads, it is challenging to accurately map RNA-seq reads to splice junctions (SJs), which is a critically important step in the analysis of alternative splicing (AS) and isoform construction. In this article, we describe a new method, called TrueSight, which for the first time combines RNA-seq read mapping quality and coding potential of genomic sequences into a unified model. The model is further utilized in a machine-learning approach to precisely identify SJs. Both simulations and real data evaluations showed that TrueSight achieved higher sensitivity and specificity than other methods. We applied TrueSight to new high coverage honey bee RNA-seq data to discover novel splice forms. We found that 60.3% of honey bee multi-exon genes are alternatively spliced. By utilizing gene models improved by TrueSight, we characterized AS types in honey bee transcriptome. We believe that TrueSight will be highly useful to comprehensively study the biology of alternative splicing.
format Online
Article
Text
id pubmed-3575843
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35758432013-02-19 TrueSight: a new algorithm for splice junction detection using RNA-seq Li, Yang Li-Byarlay, Hongmei Burns, Paul Borodovsky, Mark Robinson, Gene E. Ma, Jian Nucleic Acids Res Methods Online RNA-seq has proven to be a powerful technique for transcriptome profiling based on next-generation sequencing (NGS) technologies. However, due to the short length of NGS reads, it is challenging to accurately map RNA-seq reads to splice junctions (SJs), which is a critically important step in the analysis of alternative splicing (AS) and isoform construction. In this article, we describe a new method, called TrueSight, which for the first time combines RNA-seq read mapping quality and coding potential of genomic sequences into a unified model. The model is further utilized in a machine-learning approach to precisely identify SJs. Both simulations and real data evaluations showed that TrueSight achieved higher sensitivity and specificity than other methods. We applied TrueSight to new high coverage honey bee RNA-seq data to discover novel splice forms. We found that 60.3% of honey bee multi-exon genes are alternatively spliced. By utilizing gene models improved by TrueSight, we characterized AS types in honey bee transcriptome. We believe that TrueSight will be highly useful to comprehensively study the biology of alternative splicing. Oxford University Press 2013-02 2012-12-18 /pmc/articles/PMC3575843/ /pubmed/23254332 http://dx.doi.org/10.1093/nar/gks1311 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.
spellingShingle Methods Online
Li, Yang
Li-Byarlay, Hongmei
Burns, Paul
Borodovsky, Mark
Robinson, Gene E.
Ma, Jian
TrueSight: a new algorithm for splice junction detection using RNA-seq
title TrueSight: a new algorithm for splice junction detection using RNA-seq
title_full TrueSight: a new algorithm for splice junction detection using RNA-seq
title_fullStr TrueSight: a new algorithm for splice junction detection using RNA-seq
title_full_unstemmed TrueSight: a new algorithm for splice junction detection using RNA-seq
title_short TrueSight: a new algorithm for splice junction detection using RNA-seq
title_sort truesight: a new algorithm for splice junction detection using rna-seq
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3575843/
https://www.ncbi.nlm.nih.gov/pubmed/23254332
http://dx.doi.org/10.1093/nar/gks1311
work_keys_str_mv AT liyang truesightanewalgorithmforsplicejunctiondetectionusingrnaseq
AT libyarlayhongmei truesightanewalgorithmforsplicejunctiondetectionusingrnaseq
AT burnspaul truesightanewalgorithmforsplicejunctiondetectionusingrnaseq
AT borodovskymark truesightanewalgorithmforsplicejunctiondetectionusingrnaseq
AT robinsongenee truesightanewalgorithmforsplicejunctiondetectionusingrnaseq
AT majian truesightanewalgorithmforsplicejunctiondetectionusingrnaseq