Cargando…

A fast algorithm for determining the best combination of local alignments to a query sequence

BACKGROUND: Existing sequence alignment algorithms assume that similarities between DNA or amino acid sequences are linearly ordered. That is, stretches of similar nucleotides or amino acids are in the same order in both sequences. Recombination perturbs this order. An algorithm that can reconstruct...

Descripción completa

Detalles Bibliográficos
Autores principales: Conant, Gavin C, Wagner, Andreas
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC436051/
https://www.ncbi.nlm.nih.gov/pubmed/15149555
http://dx.doi.org/10.1186/1471-2105-5-62
_version_ 1782121527231119360
author Conant, Gavin C
Wagner, Andreas
author_facet Conant, Gavin C
Wagner, Andreas
author_sort Conant, Gavin C
collection PubMed
description BACKGROUND: Existing sequence alignment algorithms assume that similarities between DNA or amino acid sequences are linearly ordered. That is, stretches of similar nucleotides or amino acids are in the same order in both sequences. Recombination perturbs this order. An algorithm that can reconstruct sequence similarity despite rearrangement would be helpful for reconstructing the evolutionary history of recombined sequences. RESULTS: We propose a graph-based algorithm for combining multiple local alignments to a query sequence into the single combination of alignments that either covers the maximal portion of the query or results in the single highest alignment score to the query. This algorithm can help study the process of genome rearrangement, improve functional gene annotation, and reconstruct the evolutionary history of recombined proteins. The algorithm takes O(n(2)) time, where n is the number of local alignments considered. CONCLUSIONS: We discuss two example applications of the algorithm. The algorithm is able to provide useful reconstructions of the metazoan mitochondrial genome. It is also able to increase the percentage of a query sequence's amino acid residues for which similar stretches of amino acids can be found in sequence databases.
format Text
id pubmed-436051
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-4360512004-06-26 A fast algorithm for determining the best combination of local alignments to a query sequence Conant, Gavin C Wagner, Andreas BMC Bioinformatics Methodology Article BACKGROUND: Existing sequence alignment algorithms assume that similarities between DNA or amino acid sequences are linearly ordered. That is, stretches of similar nucleotides or amino acids are in the same order in both sequences. Recombination perturbs this order. An algorithm that can reconstruct sequence similarity despite rearrangement would be helpful for reconstructing the evolutionary history of recombined sequences. RESULTS: We propose a graph-based algorithm for combining multiple local alignments to a query sequence into the single combination of alignments that either covers the maximal portion of the query or results in the single highest alignment score to the query. This algorithm can help study the process of genome rearrangement, improve functional gene annotation, and reconstruct the evolutionary history of recombined proteins. The algorithm takes O(n(2)) time, where n is the number of local alignments considered. CONCLUSIONS: We discuss two example applications of the algorithm. The algorithm is able to provide useful reconstructions of the metazoan mitochondrial genome. It is also able to increase the percentage of a query sequence's amino acid residues for which similar stretches of amino acids can be found in sequence databases. BioMed Central 2004-05-18 /pmc/articles/PMC436051/ /pubmed/15149555 http://dx.doi.org/10.1186/1471-2105-5-62 Text en Copyright © 2004 Conant and Wagner; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Methodology Article
Conant, Gavin C
Wagner, Andreas
A fast algorithm for determining the best combination of local alignments to a query sequence
title A fast algorithm for determining the best combination of local alignments to a query sequence
title_full A fast algorithm for determining the best combination of local alignments to a query sequence
title_fullStr A fast algorithm for determining the best combination of local alignments to a query sequence
title_full_unstemmed A fast algorithm for determining the best combination of local alignments to a query sequence
title_short A fast algorithm for determining the best combination of local alignments to a query sequence
title_sort fast algorithm for determining the best combination of local alignments to a query sequence
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC436051/
https://www.ncbi.nlm.nih.gov/pubmed/15149555
http://dx.doi.org/10.1186/1471-2105-5-62
work_keys_str_mv AT conantgavinc afastalgorithmfordeterminingthebestcombinationoflocalalignmentstoaquerysequence
AT wagnerandreas afastalgorithmfordeterminingthebestcombinationoflocalalignmentstoaquerysequence
AT conantgavinc fastalgorithmfordeterminingthebestcombinationoflocalalignmentstoaquerysequence
AT wagnerandreas fastalgorithmfordeterminingthebestcombinationoflocalalignmentstoaquerysequence