Cargando…
PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data
Motivation: RNA-seq is a powerful technology for the study of transcriptome profiles that uses deep-sequencing technologies. Moreover, it may be used for cellular phenotyping and help establishing the etiology of diseases characterized by abnormal splicing patterns. In RNA-Seq, the exact nature of s...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3278765/ https://www.ncbi.nlm.nih.gov/pubmed/22219203 http://dx.doi.org/10.1093/bioinformatics/btr712 |
_version_ | 1782223608023613440 |
---|---|
author | Zhang, Yanju Lameijer, Eric-Wubbo 't Hoen, Peter A. C. Ning, Zemin Slagboom, P. Eline Ye, Kai |
author_facet | Zhang, Yanju Lameijer, Eric-Wubbo 't Hoen, Peter A. C. Ning, Zemin Slagboom, P. Eline Ye, Kai |
author_sort | Zhang, Yanju |
collection | PubMed |
description | Motivation: RNA-seq is a powerful technology for the study of transcriptome profiles that uses deep-sequencing technologies. Moreover, it may be used for cellular phenotyping and help establishing the etiology of diseases characterized by abnormal splicing patterns. In RNA-Seq, the exact nature of splicing events is buried in the reads that span exon–exon boundaries. The accurate and efficient mapping of these reads to the reference genome is a major challenge. Results: We developed PASSion, a pattern growth algorithm-based pipeline for splice site detection in paired-end RNA-Seq reads. Comparing the performance of PASSion to three existing RNA-Seq analysis pipelines, TopHat, MapSplice and HMMSplicer, revealed that PASSion is competitive with these packages. Moreover, the performance of PASSion is not affected by read length and coverage. It performs better than the other three approaches when detecting junctions in highly abundant transcripts. PASSion has the ability to detect junctions that do not have known splicing motifs, which cannot be found by the other tools. Of the two public RNA-Seq datasets, PASSion predicted ∼ 137 000 and 173 000 splicing events, of which on average 82 are known junctions annotated in the Ensembl transcript database and 18% are novel. In addition, our package can discover differential and shared splicing patterns among multiple samples. Availability: The code and utilities can be freely downloaded from https://trac.nbic.nl/passion and ftp://ftp.sanger.ac.uk/pub/zn1/passion Contact: y.zhang@lumc.nl; k.ye@lumc.nl Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-3278765 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-32787652012-02-15 PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data Zhang, Yanju Lameijer, Eric-Wubbo 't Hoen, Peter A. C. Ning, Zemin Slagboom, P. Eline Ye, Kai Bioinformatics Original Papers Motivation: RNA-seq is a powerful technology for the study of transcriptome profiles that uses deep-sequencing technologies. Moreover, it may be used for cellular phenotyping and help establishing the etiology of diseases characterized by abnormal splicing patterns. In RNA-Seq, the exact nature of splicing events is buried in the reads that span exon–exon boundaries. The accurate and efficient mapping of these reads to the reference genome is a major challenge. Results: We developed PASSion, a pattern growth algorithm-based pipeline for splice site detection in paired-end RNA-Seq reads. Comparing the performance of PASSion to three existing RNA-Seq analysis pipelines, TopHat, MapSplice and HMMSplicer, revealed that PASSion is competitive with these packages. Moreover, the performance of PASSion is not affected by read length and coverage. It performs better than the other three approaches when detecting junctions in highly abundant transcripts. PASSion has the ability to detect junctions that do not have known splicing motifs, which cannot be found by the other tools. Of the two public RNA-Seq datasets, PASSion predicted ∼ 137 000 and 173 000 splicing events, of which on average 82 are known junctions annotated in the Ensembl transcript database and 18% are novel. In addition, our package can discover differential and shared splicing patterns among multiple samples. Availability: The code and utilities can be freely downloaded from https://trac.nbic.nl/passion and ftp://ftp.sanger.ac.uk/pub/zn1/passion Contact: y.zhang@lumc.nl; k.ye@lumc.nl Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2012-02-15 2012-01-04 /pmc/articles/PMC3278765/ /pubmed/22219203 http://dx.doi.org/10.1093/bioinformatics/btr712 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Zhang, Yanju Lameijer, Eric-Wubbo 't Hoen, Peter A. C. Ning, Zemin Slagboom, P. Eline Ye, Kai PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data |
title | PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data |
title_full | PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data |
title_fullStr | PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data |
title_full_unstemmed | PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data |
title_short | PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data |
title_sort | passion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end rna-seq data |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3278765/ https://www.ncbi.nlm.nih.gov/pubmed/22219203 http://dx.doi.org/10.1093/bioinformatics/btr712 |
work_keys_str_mv | AT zhangyanju passionapatterngrowthalgorithmbasedpipelineforsplicejunctiondetectioninpairedendrnaseqdata AT lameijerericwubbo passionapatterngrowthalgorithmbasedpipelineforsplicejunctiondetectioninpairedendrnaseqdata AT thoenpeterac passionapatterngrowthalgorithmbasedpipelineforsplicejunctiondetectioninpairedendrnaseqdata AT ningzemin passionapatterngrowthalgorithmbasedpipelineforsplicejunctiondetectioninpairedendrnaseqdata AT slagboompeline passionapatterngrowthalgorithmbasedpipelineforsplicejunctiondetectioninpairedendrnaseqdata AT yekai passionapatterngrowthalgorithmbasedpipelineforsplicejunctiondetectioninpairedendrnaseqdata |