Cargando…

PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data

Motivation: RNA-seq is a powerful technology for the study of transcriptome profiles that uses deep-sequencing technologies. Moreover, it may be used for cellular phenotyping and help establishing the etiology of diseases characterized by abnormal splicing patterns. In RNA-Seq, the exact nature of s...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yanju, Lameijer, Eric-Wubbo, 't Hoen, Peter A. C., Ning, Zemin, Slagboom, P. Eline, Ye, Kai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3278765/
https://www.ncbi.nlm.nih.gov/pubmed/22219203
http://dx.doi.org/10.1093/bioinformatics/btr712
_version_ 1782223608023613440
author Zhang, Yanju
Lameijer, Eric-Wubbo
't Hoen, Peter A. C.
Ning, Zemin
Slagboom, P. Eline
Ye, Kai
author_facet Zhang, Yanju
Lameijer, Eric-Wubbo
't Hoen, Peter A. C.
Ning, Zemin
Slagboom, P. Eline
Ye, Kai
author_sort Zhang, Yanju
collection PubMed
description Motivation: RNA-seq is a powerful technology for the study of transcriptome profiles that uses deep-sequencing technologies. Moreover, it may be used for cellular phenotyping and help establishing the etiology of diseases characterized by abnormal splicing patterns. In RNA-Seq, the exact nature of splicing events is buried in the reads that span exon–exon boundaries. The accurate and efficient mapping of these reads to the reference genome is a major challenge. Results: We developed PASSion, a pattern growth algorithm-based pipeline for splice site detection in paired-end RNA-Seq reads. Comparing the performance of PASSion to three existing RNA-Seq analysis pipelines, TopHat, MapSplice and HMMSplicer, revealed that PASSion is competitive with these packages. Moreover, the performance of PASSion is not affected by read length and coverage. It performs better than the other three approaches when detecting junctions in highly abundant transcripts. PASSion has the ability to detect junctions that do not have known splicing motifs, which cannot be found by the other tools. Of the two public RNA-Seq datasets, PASSion predicted ∼ 137 000 and 173 000 splicing events, of which on average 82 are known junctions annotated in the Ensembl transcript database and 18% are novel. In addition, our package can discover differential and shared splicing patterns among multiple samples. Availability: The code and utilities can be freely downloaded from https://trac.nbic.nl/passion and ftp://ftp.sanger.ac.uk/pub/zn1/passion Contact: y.zhang@lumc.nl; k.ye@lumc.nl Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3278765
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-32787652012-02-15 PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data Zhang, Yanju Lameijer, Eric-Wubbo 't Hoen, Peter A. C. Ning, Zemin Slagboom, P. Eline Ye, Kai Bioinformatics Original Papers Motivation: RNA-seq is a powerful technology for the study of transcriptome profiles that uses deep-sequencing technologies. Moreover, it may be used for cellular phenotyping and help establishing the etiology of diseases characterized by abnormal splicing patterns. In RNA-Seq, the exact nature of splicing events is buried in the reads that span exon–exon boundaries. The accurate and efficient mapping of these reads to the reference genome is a major challenge. Results: We developed PASSion, a pattern growth algorithm-based pipeline for splice site detection in paired-end RNA-Seq reads. Comparing the performance of PASSion to three existing RNA-Seq analysis pipelines, TopHat, MapSplice and HMMSplicer, revealed that PASSion is competitive with these packages. Moreover, the performance of PASSion is not affected by read length and coverage. It performs better than the other three approaches when detecting junctions in highly abundant transcripts. PASSion has the ability to detect junctions that do not have known splicing motifs, which cannot be found by the other tools. Of the two public RNA-Seq datasets, PASSion predicted ∼ 137 000 and 173 000 splicing events, of which on average 82 are known junctions annotated in the Ensembl transcript database and 18% are novel. In addition, our package can discover differential and shared splicing patterns among multiple samples. Availability: The code and utilities can be freely downloaded from https://trac.nbic.nl/passion and ftp://ftp.sanger.ac.uk/pub/zn1/passion Contact: y.zhang@lumc.nl; k.ye@lumc.nl Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-02-15 2012-01-04 /pmc/articles/PMC3278765/ /pubmed/22219203 http://dx.doi.org/10.1093/bioinformatics/btr712 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Zhang, Yanju
Lameijer, Eric-Wubbo
't Hoen, Peter A. C.
Ning, Zemin
Slagboom, P. Eline
Ye, Kai
PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data
title PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data
title_full PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data
title_fullStr PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data
title_full_unstemmed PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data
title_short PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data
title_sort passion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end rna-seq data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3278765/
https://www.ncbi.nlm.nih.gov/pubmed/22219203
http://dx.doi.org/10.1093/bioinformatics/btr712
work_keys_str_mv AT zhangyanju passionapatterngrowthalgorithmbasedpipelineforsplicejunctiondetectioninpairedendrnaseqdata
AT lameijerericwubbo passionapatterngrowthalgorithmbasedpipelineforsplicejunctiondetectioninpairedendrnaseqdata
AT thoenpeterac passionapatterngrowthalgorithmbasedpipelineforsplicejunctiondetectioninpairedendrnaseqdata
AT ningzemin passionapatterngrowthalgorithmbasedpipelineforsplicejunctiondetectioninpairedendrnaseqdata
AT slagboompeline passionapatterngrowthalgorithmbasedpipelineforsplicejunctiondetectioninpairedendrnaseqdata
AT yekai passionapatterngrowthalgorithmbasedpipelineforsplicejunctiondetectioninpairedendrnaseqdata