Cargando…

YAHA: fast and flexible long-read alignment with optimal breakpoint detection

Motivation: With improved short-read assembly algorithms and the recent development of long-read sequencers, split mapping will soon be the preferred method for structural variant (SV) detection. Yet, current alignment tools are not well suited for this. Results: We present YAHA, a fast and flexible...

Descripción completa

Detalles Bibliográficos
Autores principales: Faust, Gregory G., Hall, Ira M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3463118/
https://www.ncbi.nlm.nih.gov/pubmed/22829624
http://dx.doi.org/10.1093/bioinformatics/bts456
_version_ 1782245259299782656
author Faust, Gregory G.
Hall, Ira M.
author_facet Faust, Gregory G.
Hall, Ira M.
author_sort Faust, Gregory G.
collection PubMed
description Motivation: With improved short-read assembly algorithms and the recent development of long-read sequencers, split mapping will soon be the preferred method for structural variant (SV) detection. Yet, current alignment tools are not well suited for this. Results: We present YAHA, a fast and flexible hash-based aligner. YAHA is as fast and accurate as BWA-SW at finding the single best alignment per query and is dramatically faster and more sensitive than both SSAHA2 and MegaBLAST at finding all possible alignments. Unlike other aligners that report all, or one, alignment per query, or that use simple heuristics to select alignments, YAHA uses a directed acyclic graph to find the optimal set of alignments that cover a query using a biologically relevant breakpoint penalty. YAHA can also report multiple mappings per defined segment of the query. We show that YAHA detects more breakpoints in less time than BWA-SW across all SV classes, and especially excels at complex SVs comprising multiple breakpoints. Availability: YAHA is currently supported on 64-bit Linux systems. Binaries and sample data are freely available for download from http://faculty.virginia.edu/irahall/YAHA. Contact: imh4y@virginia.edu
format Online
Article
Text
id pubmed-3463118
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-34631182012-12-12 YAHA: fast and flexible long-read alignment with optimal breakpoint detection Faust, Gregory G. Hall, Ira M. Bioinformatics Original Papers Motivation: With improved short-read assembly algorithms and the recent development of long-read sequencers, split mapping will soon be the preferred method for structural variant (SV) detection. Yet, current alignment tools are not well suited for this. Results: We present YAHA, a fast and flexible hash-based aligner. YAHA is as fast and accurate as BWA-SW at finding the single best alignment per query and is dramatically faster and more sensitive than both SSAHA2 and MegaBLAST at finding all possible alignments. Unlike other aligners that report all, or one, alignment per query, or that use simple heuristics to select alignments, YAHA uses a directed acyclic graph to find the optimal set of alignments that cover a query using a biologically relevant breakpoint penalty. YAHA can also report multiple mappings per defined segment of the query. We show that YAHA detects more breakpoints in less time than BWA-SW across all SV classes, and especially excels at complex SVs comprising multiple breakpoints. Availability: YAHA is currently supported on 64-bit Linux systems. Binaries and sample data are freely available for download from http://faculty.virginia.edu/irahall/YAHA. Contact: imh4y@virginia.edu Oxford University Press 2012-10-01 2012-07-24 /pmc/articles/PMC3463118/ /pubmed/22829624 http://dx.doi.org/10.1093/bioinformatics/bts456 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Faust, Gregory G.
Hall, Ira M.
YAHA: fast and flexible long-read alignment with optimal breakpoint detection
title YAHA: fast and flexible long-read alignment with optimal breakpoint detection
title_full YAHA: fast and flexible long-read alignment with optimal breakpoint detection
title_fullStr YAHA: fast and flexible long-read alignment with optimal breakpoint detection
title_full_unstemmed YAHA: fast and flexible long-read alignment with optimal breakpoint detection
title_short YAHA: fast and flexible long-read alignment with optimal breakpoint detection
title_sort yaha: fast and flexible long-read alignment with optimal breakpoint detection
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3463118/
https://www.ncbi.nlm.nih.gov/pubmed/22829624
http://dx.doi.org/10.1093/bioinformatics/bts456
work_keys_str_mv AT faustgregoryg yahafastandflexiblelongreadalignmentwithoptimalbreakpointdetection
AT halliram yahafastandflexiblelongreadalignmentwithoptimalbreakpointdetection