Cargando…

SpliceJumper: a classification-based approach for calling splicing junctions from RNA-seq data

BACKGROUND: Next-generation RNA sequencing technologies have been widely applied in transcriptome profiling. This facilitates further studies of gene structure and expression on the genome wide scale. It is an important step to align reads to the reference genome and call out splicing junctions for...

Descripción completa

Detalles Bibliográficos
Autores principales: Chu, Chong, Li, Xin, Wu, Yufeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4674845/
https://www.ncbi.nlm.nih.gov/pubmed/26678515
http://dx.doi.org/10.1186/1471-2105-16-S17-S10
_version_ 1782404958680776704
author Chu, Chong
Li, Xin
Wu, Yufeng
author_facet Chu, Chong
Li, Xin
Wu, Yufeng
author_sort Chu, Chong
collection PubMed
description BACKGROUND: Next-generation RNA sequencing technologies have been widely applied in transcriptome profiling. This facilitates further studies of gene structure and expression on the genome wide scale. It is an important step to align reads to the reference genome and call out splicing junctions for the following analysis, such as the analysis of alternative splicing and isoform construction. However, because of the existence of introns, when RNA-seq reads are aligned to the reference genome, reads can not be fully mapped at splicing sites. Thus, it is challenging to align reads and call out splicing junctions accurately. RESULTS: In this paper, we present a classification based approach for calling splicing junctions from RNA-seq data, which is implemented in the program SpliceJumper. SpliceJumper uses a machine learning approach which combines multiple features extracted from RNA-seq data. We compare SpliceJumper with two existing RNA-seq analysis approaches, TopHat2 and MapSplice2, on both simulated and real data. Our results show that SpliceJumper outperforms TopHat2 and MapSplice2 in accuracy. The program SpliceJumper can be downloaded at https://github.com/Reedwarbler/SpliceJumper.
format Online
Article
Text
id pubmed-4674845
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46748452015-12-15 SpliceJumper: a classification-based approach for calling splicing junctions from RNA-seq data Chu, Chong Li, Xin Wu, Yufeng BMC Bioinformatics Research BACKGROUND: Next-generation RNA sequencing technologies have been widely applied in transcriptome profiling. This facilitates further studies of gene structure and expression on the genome wide scale. It is an important step to align reads to the reference genome and call out splicing junctions for the following analysis, such as the analysis of alternative splicing and isoform construction. However, because of the existence of introns, when RNA-seq reads are aligned to the reference genome, reads can not be fully mapped at splicing sites. Thus, it is challenging to align reads and call out splicing junctions accurately. RESULTS: In this paper, we present a classification based approach for calling splicing junctions from RNA-seq data, which is implemented in the program SpliceJumper. SpliceJumper uses a machine learning approach which combines multiple features extracted from RNA-seq data. We compare SpliceJumper with two existing RNA-seq analysis approaches, TopHat2 and MapSplice2, on both simulated and real data. Our results show that SpliceJumper outperforms TopHat2 and MapSplice2 in accuracy. The program SpliceJumper can be downloaded at https://github.com/Reedwarbler/SpliceJumper. BioMed Central 2015-12-07 /pmc/articles/PMC4674845/ /pubmed/26678515 http://dx.doi.org/10.1186/1471-2105-16-S17-S10 Text en Copyright © 2015 Chu et al. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Chu, Chong
Li, Xin
Wu, Yufeng
SpliceJumper: a classification-based approach for calling splicing junctions from RNA-seq data
title SpliceJumper: a classification-based approach for calling splicing junctions from RNA-seq data
title_full SpliceJumper: a classification-based approach for calling splicing junctions from RNA-seq data
title_fullStr SpliceJumper: a classification-based approach for calling splicing junctions from RNA-seq data
title_full_unstemmed SpliceJumper: a classification-based approach for calling splicing junctions from RNA-seq data
title_short SpliceJumper: a classification-based approach for calling splicing junctions from RNA-seq data
title_sort splicejumper: a classification-based approach for calling splicing junctions from rna-seq data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4674845/
https://www.ncbi.nlm.nih.gov/pubmed/26678515
http://dx.doi.org/10.1186/1471-2105-16-S17-S10
work_keys_str_mv AT chuchong splicejumperaclassificationbasedapproachforcallingsplicingjunctionsfromrnaseqdata
AT lixin splicejumperaclassificationbasedapproachforcallingsplicingjunctionsfromrnaseqdata
AT wuyufeng splicejumperaclassificationbasedapproachforcallingsplicingjunctionsfromrnaseqdata