Cargando…

RapMap: a rapid, sensitive and accurate tool for mapping RNA-seq reads to transcriptomes

Motivation: The alignment of sequencing reads to a transcriptome is a common and important step in many RNA-seq analysis tasks. When aligning RNA-seq reads directly to a transcriptome (as is common in the de novo setting or when a trusted reference annotation is available), care must be taken to rep...

Descripción completa

Detalles Bibliográficos
Autores principales: Srivastava, Avi, Sarkar, Hirak, Gupta, Nitish, Patro, Rob
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4908361/
https://www.ncbi.nlm.nih.gov/pubmed/27307617
http://dx.doi.org/10.1093/bioinformatics/btw277
_version_ 1782437667940597760
author Srivastava, Avi
Sarkar, Hirak
Gupta, Nitish
Patro, Rob
author_facet Srivastava, Avi
Sarkar, Hirak
Gupta, Nitish
Patro, Rob
author_sort Srivastava, Avi
collection PubMed
description Motivation: The alignment of sequencing reads to a transcriptome is a common and important step in many RNA-seq analysis tasks. When aligning RNA-seq reads directly to a transcriptome (as is common in the de novo setting or when a trusted reference annotation is available), care must be taken to report the potentially large number of multi-mapping locations per read. This can pose a substantial computational burden for existing aligners, and can considerably slow downstream analysis. Results: We introduce a novel concept, quasi-mapping, and an efficient algorithm implementing this approach for mapping sequencing reads to a transcriptome. By attempting only to report the potential loci of origin of a sequencing read, and not the base-to-base alignment by which it derives from the reference, RapMap—our tool implementing quasi-mapping—is capable of mapping sequencing reads to a target transcriptome substantially faster than existing alignment tools. The algorithm we use to implement quasi-mapping uses several efficient data structures and takes advantage of the special structure of shared sequence prevalent in transcriptomes to rapidly provide highly-accurate mapping information. We demonstrate how quasi-mapping can be successfully applied to the problems of transcript-level quantification from RNA-seq reads and the clustering of contigs from de novo assembled transcriptomes into biologically meaningful groups. Availability and implementation: RapMap is implemented in C ++11 and is available as open-source software, under GPL v3, at https://github.com/COMBINE-lab/RapMap. Contact: rob.patro@cs.stonybrook.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4908361
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-49083612016-06-17 RapMap: a rapid, sensitive and accurate tool for mapping RNA-seq reads to transcriptomes Srivastava, Avi Sarkar, Hirak Gupta, Nitish Patro, Rob Bioinformatics Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida Motivation: The alignment of sequencing reads to a transcriptome is a common and important step in many RNA-seq analysis tasks. When aligning RNA-seq reads directly to a transcriptome (as is common in the de novo setting or when a trusted reference annotation is available), care must be taken to report the potentially large number of multi-mapping locations per read. This can pose a substantial computational burden for existing aligners, and can considerably slow downstream analysis. Results: We introduce a novel concept, quasi-mapping, and an efficient algorithm implementing this approach for mapping sequencing reads to a transcriptome. By attempting only to report the potential loci of origin of a sequencing read, and not the base-to-base alignment by which it derives from the reference, RapMap—our tool implementing quasi-mapping—is capable of mapping sequencing reads to a target transcriptome substantially faster than existing alignment tools. The algorithm we use to implement quasi-mapping uses several efficient data structures and takes advantage of the special structure of shared sequence prevalent in transcriptomes to rapidly provide highly-accurate mapping information. We demonstrate how quasi-mapping can be successfully applied to the problems of transcript-level quantification from RNA-seq reads and the clustering of contigs from de novo assembled transcriptomes into biologically meaningful groups. Availability and implementation: RapMap is implemented in C ++11 and is available as open-source software, under GPL v3, at https://github.com/COMBINE-lab/RapMap. Contact: rob.patro@cs.stonybrook.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-06-15 2016-06-11 /pmc/articles/PMC4908361/ /pubmed/27307617 http://dx.doi.org/10.1093/bioinformatics/btw277 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida
Srivastava, Avi
Sarkar, Hirak
Gupta, Nitish
Patro, Rob
RapMap: a rapid, sensitive and accurate tool for mapping RNA-seq reads to transcriptomes
title RapMap: a rapid, sensitive and accurate tool for mapping RNA-seq reads to transcriptomes
title_full RapMap: a rapid, sensitive and accurate tool for mapping RNA-seq reads to transcriptomes
title_fullStr RapMap: a rapid, sensitive and accurate tool for mapping RNA-seq reads to transcriptomes
title_full_unstemmed RapMap: a rapid, sensitive and accurate tool for mapping RNA-seq reads to transcriptomes
title_short RapMap: a rapid, sensitive and accurate tool for mapping RNA-seq reads to transcriptomes
title_sort rapmap: a rapid, sensitive and accurate tool for mapping rna-seq reads to transcriptomes
topic Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4908361/
https://www.ncbi.nlm.nih.gov/pubmed/27307617
http://dx.doi.org/10.1093/bioinformatics/btw277
work_keys_str_mv AT srivastavaavi rapmaparapidsensitiveandaccuratetoolformappingrnaseqreadstotranscriptomes
AT sarkarhirak rapmaparapidsensitiveandaccuratetoolformappingrnaseqreadstotranscriptomes
AT guptanitish rapmaparapidsensitiveandaccuratetoolformappingrnaseqreadstotranscriptomes
AT patrorob rapmaparapidsensitiveandaccuratetoolformappingrnaseqreadstotranscriptomes