Cargando…
MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery
The accurate mapping of reads that span splice junctions is a critical component of all analytic techniques that work with RNA-seq data. We introduce a second generation splice detection algorithm, MapSplice, whose focus is high sensitivity and specificity in the detection of splices as well as CPU...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2952873/ https://www.ncbi.nlm.nih.gov/pubmed/20802226 http://dx.doi.org/10.1093/nar/gkq622 |
_version_ | 1782187820445597696 |
---|---|
author | Wang, Kai Singh, Darshan Zeng, Zheng Coleman, Stephen J. Huang, Yan Savich, Gleb L. He, Xiaping Mieczkowski, Piotr Grimm, Sara A. Perou, Charles M. MacLeod, James N. Chiang, Derek Y. Prins, Jan F. Liu, Jinze |
author_facet | Wang, Kai Singh, Darshan Zeng, Zheng Coleman, Stephen J. Huang, Yan Savich, Gleb L. He, Xiaping Mieczkowski, Piotr Grimm, Sara A. Perou, Charles M. MacLeod, James N. Chiang, Derek Y. Prins, Jan F. Liu, Jinze |
author_sort | Wang, Kai |
collection | PubMed |
description | The accurate mapping of reads that span splice junctions is a critical component of all analytic techniques that work with RNA-seq data. We introduce a second generation splice detection algorithm, MapSplice, whose focus is high sensitivity and specificity in the detection of splices as well as CPU and memory efficiency. MapSplice can be applied to both short (<75 bp) and long reads (≥75 bp). MapSplice is not dependent on splice site features or intron length, consequently it can detect novel canonical as well as non-canonical splices. MapSplice leverages the quality and diversity of read alignments of a given splice to increase accuracy. We demonstrate that MapSplice achieves higher sensitivity and specificity than TopHat and SpliceMap on a set of simulated RNA-seq data. Experimental studies also support the accuracy of the algorithm. Splice junctions derived from eight breast cancer RNA-seq datasets recapitulated the extensiveness of alternative splicing on a global level as well as the differences between molecular subtypes of breast cancer. These combined results indicate that MapSplice is a highly accurate algorithm for the alignment of RNA-seq reads to splice junctions. Software download URL: http://www.netlab.uky.edu/p/bioinfo/MapSplice. |
format | Text |
id | pubmed-2952873 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-29528732010-10-12 MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery Wang, Kai Singh, Darshan Zeng, Zheng Coleman, Stephen J. Huang, Yan Savich, Gleb L. He, Xiaping Mieczkowski, Piotr Grimm, Sara A. Perou, Charles M. MacLeod, James N. Chiang, Derek Y. Prins, Jan F. Liu, Jinze Nucleic Acids Res Methods Online The accurate mapping of reads that span splice junctions is a critical component of all analytic techniques that work with RNA-seq data. We introduce a second generation splice detection algorithm, MapSplice, whose focus is high sensitivity and specificity in the detection of splices as well as CPU and memory efficiency. MapSplice can be applied to both short (<75 bp) and long reads (≥75 bp). MapSplice is not dependent on splice site features or intron length, consequently it can detect novel canonical as well as non-canonical splices. MapSplice leverages the quality and diversity of read alignments of a given splice to increase accuracy. We demonstrate that MapSplice achieves higher sensitivity and specificity than TopHat and SpliceMap on a set of simulated RNA-seq data. Experimental studies also support the accuracy of the algorithm. Splice junctions derived from eight breast cancer RNA-seq datasets recapitulated the extensiveness of alternative splicing on a global level as well as the differences between molecular subtypes of breast cancer. These combined results indicate that MapSplice is a highly accurate algorithm for the alignment of RNA-seq reads to splice junctions. Software download URL: http://www.netlab.uky.edu/p/bioinfo/MapSplice. Oxford University Press 2010-10 2010-08-27 /pmc/articles/PMC2952873/ /pubmed/20802226 http://dx.doi.org/10.1093/nar/gkq622 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Wang, Kai Singh, Darshan Zeng, Zheng Coleman, Stephen J. Huang, Yan Savich, Gleb L. He, Xiaping Mieczkowski, Piotr Grimm, Sara A. Perou, Charles M. MacLeod, James N. Chiang, Derek Y. Prins, Jan F. Liu, Jinze MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery |
title | MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery |
title_full | MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery |
title_fullStr | MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery |
title_full_unstemmed | MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery |
title_short | MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery |
title_sort | mapsplice: accurate mapping of rna-seq reads for splice junction discovery |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2952873/ https://www.ncbi.nlm.nih.gov/pubmed/20802226 http://dx.doi.org/10.1093/nar/gkq622 |
work_keys_str_mv | AT wangkai mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery AT singhdarshan mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery AT zengzheng mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery AT colemanstephenj mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery AT huangyan mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery AT savichglebl mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery AT hexiaping mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery AT mieczkowskipiotr mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery AT grimmsaraa mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery AT peroucharlesm mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery AT macleodjamesn mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery AT chiangdereky mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery AT prinsjanf mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery AT liujinze mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery |