Cargando…
URMAP, an ultra-fast read mapper
Mapping of reads to reference sequences is an essential step in a wide range of biological studies. The large size of datasets generated with next-generation sequencing technologies motivates the development of fast mapping software. Here, I describe URMAP, a new read mapping algorithm. URMAP is an...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7320720/ https://www.ncbi.nlm.nih.gov/pubmed/32612885 http://dx.doi.org/10.7717/peerj.9338 |
_version_ | 1783551301141921792 |
---|---|
author | Edgar, Robert |
author_facet | Edgar, Robert |
author_sort | Edgar, Robert |
collection | PubMed |
description | Mapping of reads to reference sequences is an essential step in a wide range of biological studies. The large size of datasets generated with next-generation sequencing technologies motivates the development of fast mapping software. Here, I describe URMAP, a new read mapping algorithm. URMAP is an order of magnitude faster than BWA with comparable accuracy on several validation tests. On a Genome in a Bottle (GIAB) variant calling test with 30× coverage 2×150 reads, URMAP achieves high accuracy (precision 0.998, sensitivity 0.982 and F-measure 0.990) with the strelka2 caller. However, GIAB reference variants are shown to be biased against repetitive regions which are difficult to map and may therefore pose an unrealistically easy challenge to read mappers and variant callers. |
format | Online Article Text |
id | pubmed-7320720 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-73207202020-06-30 URMAP, an ultra-fast read mapper Edgar, Robert PeerJ Bioinformatics Mapping of reads to reference sequences is an essential step in a wide range of biological studies. The large size of datasets generated with next-generation sequencing technologies motivates the development of fast mapping software. Here, I describe URMAP, a new read mapping algorithm. URMAP is an order of magnitude faster than BWA with comparable accuracy on several validation tests. On a Genome in a Bottle (GIAB) variant calling test with 30× coverage 2×150 reads, URMAP achieves high accuracy (precision 0.998, sensitivity 0.982 and F-measure 0.990) with the strelka2 caller. However, GIAB reference variants are shown to be biased against repetitive regions which are difficult to map and may therefore pose an unrealistically easy challenge to read mappers and variant callers. PeerJ Inc. 2020-06-24 /pmc/articles/PMC7320720/ /pubmed/32612885 http://dx.doi.org/10.7717/peerj.9338 Text en ©2020 Edgar https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Edgar, Robert URMAP, an ultra-fast read mapper |
title | URMAP, an ultra-fast read mapper |
title_full | URMAP, an ultra-fast read mapper |
title_fullStr | URMAP, an ultra-fast read mapper |
title_full_unstemmed | URMAP, an ultra-fast read mapper |
title_short | URMAP, an ultra-fast read mapper |
title_sort | urmap, an ultra-fast read mapper |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7320720/ https://www.ncbi.nlm.nih.gov/pubmed/32612885 http://dx.doi.org/10.7717/peerj.9338 |
work_keys_str_mv | AT edgarrobert urmapanultrafastreadmapper |