Cargando…

ADaM: augmenting existing approximate fast matching algorithms with efficient and exact range queries

BACKGROUND: Drug discovery, disease detection, and personalized medicine are fast-growing areas of genomic research. With the advancement of next-generation sequencing techniques, researchers can obtain an abundance of data for many different biological assays in a short period of time. When this da...

Descripción completa

Detalles Bibliográficos
Autores principales: Clement, Nathan L, Thompson, Lee P, Miranker, Daniel P
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4110726/
https://www.ncbi.nlm.nih.gov/pubmed/25079667
http://dx.doi.org/10.1186/1471-2105-15-S7-S1
_version_ 1782328021814870016
author Clement, Nathan L
Thompson, Lee P
Miranker, Daniel P
author_facet Clement, Nathan L
Thompson, Lee P
Miranker, Daniel P
author_sort Clement, Nathan L
collection PubMed
description BACKGROUND: Drug discovery, disease detection, and personalized medicine are fast-growing areas of genomic research. With the advancement of next-generation sequencing techniques, researchers can obtain an abundance of data for many different biological assays in a short period of time. When this data is error-free, the result is a high-quality base-pair resolution picture of the genome. However, when the data is lossy the heuristic algorithms currently used when aligning next-generation sequences causes the corresponding accuracy to drop. RESULTS: This paper describes a program, ADaM (APF DNA Mapper) which significantly increases final alignment accuracy. ADaM works by first using an existing program to align "easy" sequences, and then using an algorithm with accuracy guarantees (the APF) to align the remaining sequences. The final result is a technique that increases the mapping accuracy from only 60% to over 90% for harder-to-align sequences.
format Online
Article
Text
id pubmed-4110726
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41107262014-08-05 ADaM: augmenting existing approximate fast matching algorithms with efficient and exact range queries Clement, Nathan L Thompson, Lee P Miranker, Daniel P BMC Bioinformatics Research BACKGROUND: Drug discovery, disease detection, and personalized medicine are fast-growing areas of genomic research. With the advancement of next-generation sequencing techniques, researchers can obtain an abundance of data for many different biological assays in a short period of time. When this data is error-free, the result is a high-quality base-pair resolution picture of the genome. However, when the data is lossy the heuristic algorithms currently used when aligning next-generation sequences causes the corresponding accuracy to drop. RESULTS: This paper describes a program, ADaM (APF DNA Mapper) which significantly increases final alignment accuracy. ADaM works by first using an existing program to align "easy" sequences, and then using an algorithm with accuracy guarantees (the APF) to align the remaining sequences. The final result is a technique that increases the mapping accuracy from only 60% to over 90% for harder-to-align sequences. BioMed Central 2014-05-28 /pmc/articles/PMC4110726/ /pubmed/25079667 http://dx.doi.org/10.1186/1471-2105-15-S7-S1 Text en Copyright © 2014 Clement et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Clement, Nathan L
Thompson, Lee P
Miranker, Daniel P
ADaM: augmenting existing approximate fast matching algorithms with efficient and exact range queries
title ADaM: augmenting existing approximate fast matching algorithms with efficient and exact range queries
title_full ADaM: augmenting existing approximate fast matching algorithms with efficient and exact range queries
title_fullStr ADaM: augmenting existing approximate fast matching algorithms with efficient and exact range queries
title_full_unstemmed ADaM: augmenting existing approximate fast matching algorithms with efficient and exact range queries
title_short ADaM: augmenting existing approximate fast matching algorithms with efficient and exact range queries
title_sort adam: augmenting existing approximate fast matching algorithms with efficient and exact range queries
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4110726/
https://www.ncbi.nlm.nih.gov/pubmed/25079667
http://dx.doi.org/10.1186/1471-2105-15-S7-S1
work_keys_str_mv AT clementnathanl adamaugmentingexistingapproximatefastmatchingalgorithmswithefficientandexactrangequeries
AT thompsonleep adamaugmentingexistingapproximatefastmatchingalgorithmswithefficientandexactrangequeries
AT mirankerdanielp adamaugmentingexistingapproximatefastmatchingalgorithmswithefficientandexactrangequeries