Cargando…

MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery

The accurate mapping of reads that span splice junctions is a critical component of all analytic techniques that work with RNA-seq data. We introduce a second generation splice detection algorithm, MapSplice, whose focus is high sensitivity and specificity in the detection of splices as well as CPU...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Kai, Singh, Darshan, Zeng, Zheng, Coleman, Stephen J., Huang, Yan, Savich, Gleb L., He, Xiaping, Mieczkowski, Piotr, Grimm, Sara A., Perou, Charles M., MacLeod, James N., Chiang, Derek Y., Prins, Jan F., Liu, Jinze
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2952873/
https://www.ncbi.nlm.nih.gov/pubmed/20802226
http://dx.doi.org/10.1093/nar/gkq622
_version_ 1782187820445597696
author Wang, Kai
Singh, Darshan
Zeng, Zheng
Coleman, Stephen J.
Huang, Yan
Savich, Gleb L.
He, Xiaping
Mieczkowski, Piotr
Grimm, Sara A.
Perou, Charles M.
MacLeod, James N.
Chiang, Derek Y.
Prins, Jan F.
Liu, Jinze
author_facet Wang, Kai
Singh, Darshan
Zeng, Zheng
Coleman, Stephen J.
Huang, Yan
Savich, Gleb L.
He, Xiaping
Mieczkowski, Piotr
Grimm, Sara A.
Perou, Charles M.
MacLeod, James N.
Chiang, Derek Y.
Prins, Jan F.
Liu, Jinze
author_sort Wang, Kai
collection PubMed
description The accurate mapping of reads that span splice junctions is a critical component of all analytic techniques that work with RNA-seq data. We introduce a second generation splice detection algorithm, MapSplice, whose focus is high sensitivity and specificity in the detection of splices as well as CPU and memory efficiency. MapSplice can be applied to both short (<75 bp) and long reads (≥75 bp). MapSplice is not dependent on splice site features or intron length, consequently it can detect novel canonical as well as non-canonical splices. MapSplice leverages the quality and diversity of read alignments of a given splice to increase accuracy. We demonstrate that MapSplice achieves higher sensitivity and specificity than TopHat and SpliceMap on a set of simulated RNA-seq data. Experimental studies also support the accuracy of the algorithm. Splice junctions derived from eight breast cancer RNA-seq datasets recapitulated the extensiveness of alternative splicing on a global level as well as the differences between molecular subtypes of breast cancer. These combined results indicate that MapSplice is a highly accurate algorithm for the alignment of RNA-seq reads to splice junctions. Software download URL: http://www.netlab.uky.edu/p/bioinfo/MapSplice.
format Text
id pubmed-2952873
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-29528732010-10-12 MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery Wang, Kai Singh, Darshan Zeng, Zheng Coleman, Stephen J. Huang, Yan Savich, Gleb L. He, Xiaping Mieczkowski, Piotr Grimm, Sara A. Perou, Charles M. MacLeod, James N. Chiang, Derek Y. Prins, Jan F. Liu, Jinze Nucleic Acids Res Methods Online The accurate mapping of reads that span splice junctions is a critical component of all analytic techniques that work with RNA-seq data. We introduce a second generation splice detection algorithm, MapSplice, whose focus is high sensitivity and specificity in the detection of splices as well as CPU and memory efficiency. MapSplice can be applied to both short (<75 bp) and long reads (≥75 bp). MapSplice is not dependent on splice site features or intron length, consequently it can detect novel canonical as well as non-canonical splices. MapSplice leverages the quality and diversity of read alignments of a given splice to increase accuracy. We demonstrate that MapSplice achieves higher sensitivity and specificity than TopHat and SpliceMap on a set of simulated RNA-seq data. Experimental studies also support the accuracy of the algorithm. Splice junctions derived from eight breast cancer RNA-seq datasets recapitulated the extensiveness of alternative splicing on a global level as well as the differences between molecular subtypes of breast cancer. These combined results indicate that MapSplice is a highly accurate algorithm for the alignment of RNA-seq reads to splice junctions. Software download URL: http://www.netlab.uky.edu/p/bioinfo/MapSplice. Oxford University Press 2010-10 2010-08-27 /pmc/articles/PMC2952873/ /pubmed/20802226 http://dx.doi.org/10.1093/nar/gkq622 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Wang, Kai
Singh, Darshan
Zeng, Zheng
Coleman, Stephen J.
Huang, Yan
Savich, Gleb L.
He, Xiaping
Mieczkowski, Piotr
Grimm, Sara A.
Perou, Charles M.
MacLeod, James N.
Chiang, Derek Y.
Prins, Jan F.
Liu, Jinze
MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery
title MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery
title_full MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery
title_fullStr MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery
title_full_unstemmed MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery
title_short MapSplice: Accurate mapping of RNA-seq reads for splice junction discovery
title_sort mapsplice: accurate mapping of rna-seq reads for splice junction discovery
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2952873/
https://www.ncbi.nlm.nih.gov/pubmed/20802226
http://dx.doi.org/10.1093/nar/gkq622
work_keys_str_mv AT wangkai mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery
AT singhdarshan mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery
AT zengzheng mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery
AT colemanstephenj mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery
AT huangyan mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery
AT savichglebl mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery
AT hexiaping mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery
AT mieczkowskipiotr mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery
AT grimmsaraa mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery
AT peroucharlesm mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery
AT macleodjamesn mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery
AT chiangdereky mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery
AT prinsjanf mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery
AT liujinze mapspliceaccuratemappingofrnaseqreadsforsplicejunctiondiscovery