Cargando…

URMAP, an ultra-fast read mapper

Mapping of reads to reference sequences is an essential step in a wide range of biological studies. The large size of datasets generated with next-generation sequencing technologies motivates the development of fast mapping software. Here, I describe URMAP, a new read mapping algorithm. URMAP is an...

Descripción completa

Detalles Bibliográficos
Autor principal: Edgar, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7320720/
https://www.ncbi.nlm.nih.gov/pubmed/32612885
http://dx.doi.org/10.7717/peerj.9338
_version_ 1783551301141921792
author Edgar, Robert
author_facet Edgar, Robert
author_sort Edgar, Robert
collection PubMed
description Mapping of reads to reference sequences is an essential step in a wide range of biological studies. The large size of datasets generated with next-generation sequencing technologies motivates the development of fast mapping software. Here, I describe URMAP, a new read mapping algorithm. URMAP is an order of magnitude faster than BWA with comparable accuracy on several validation tests. On a Genome in a Bottle (GIAB) variant calling test with 30× coverage 2×150 reads, URMAP achieves high accuracy (precision 0.998, sensitivity 0.982 and F-measure 0.990) with the strelka2 caller. However, GIAB reference variants are shown to be biased against repetitive regions which are difficult to map and may therefore pose an unrealistically easy challenge to read mappers and variant callers.
format Online
Article
Text
id pubmed-7320720
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-73207202020-06-30 URMAP, an ultra-fast read mapper Edgar, Robert PeerJ Bioinformatics Mapping of reads to reference sequences is an essential step in a wide range of biological studies. The large size of datasets generated with next-generation sequencing technologies motivates the development of fast mapping software. Here, I describe URMAP, a new read mapping algorithm. URMAP is an order of magnitude faster than BWA with comparable accuracy on several validation tests. On a Genome in a Bottle (GIAB) variant calling test with 30× coverage 2×150 reads, URMAP achieves high accuracy (precision 0.998, sensitivity 0.982 and F-measure 0.990) with the strelka2 caller. However, GIAB reference variants are shown to be biased against repetitive regions which are difficult to map and may therefore pose an unrealistically easy challenge to read mappers and variant callers. PeerJ Inc. 2020-06-24 /pmc/articles/PMC7320720/ /pubmed/32612885 http://dx.doi.org/10.7717/peerj.9338 Text en ©2020 Edgar https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Edgar, Robert
URMAP, an ultra-fast read mapper
title URMAP, an ultra-fast read mapper
title_full URMAP, an ultra-fast read mapper
title_fullStr URMAP, an ultra-fast read mapper
title_full_unstemmed URMAP, an ultra-fast read mapper
title_short URMAP, an ultra-fast read mapper
title_sort urmap, an ultra-fast read mapper
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7320720/
https://www.ncbi.nlm.nih.gov/pubmed/32612885
http://dx.doi.org/10.7717/peerj.9338
work_keys_str_mv AT edgarrobert urmapanultrafastreadmapper