Cargando…
Lower bounds on multiple sequence alignment using exact 3-way alignment
BACKGROUND: Multiple sequence alignment is fundamental. Exponential growth in computation time appears to be inevitable when an optimal alignment is required for many sequences. Exact costs of optimum alignments are therefore rarely computed. Consequently much effort has been invested in algorithms...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1890300/ https://www.ncbi.nlm.nih.gov/pubmed/17470273 http://dx.doi.org/10.1186/1471-2105-8-140 |
_version_ | 1782133716670218240 |
---|---|
author | Colbourn, Charles J Kumar, Sudhir |
author_facet | Colbourn, Charles J Kumar, Sudhir |
author_sort | Colbourn, Charles J |
collection | PubMed |
description | BACKGROUND: Multiple sequence alignment is fundamental. Exponential growth in computation time appears to be inevitable when an optimal alignment is required for many sequences. Exact costs of optimum alignments are therefore rarely computed. Consequently much effort has been invested in algorithms for alignment that are heuristic, or explore a restricted class of solutions. These give an upper bound on the alignment cost, but it is equally important to determine the quality of the solution obtained. In the absence of an optimal alignment with which to compare, lower bounds may be calculated to assess the quality of the alignment. As more effort is invested in improving upper bounds (alignment algorithms), it is therefore important to improve lower bounds as well. Although numerous cost metrics can be used to determine the quality of an alignment, many are based on sum-of-pairs (SP) measures and their generalizations. RESULTS: Two standard and two new methods are considered for using exact 2-way and 3-way alignments to compute lower bounds on total SP alignment cost; one new method fares well with respect to accuracy, while the other reduces the computation time. The first employs exhaustive computation of exact 3-way alignments, while the second employs an efficient heuristic to compute a much smaller number of exact 3-way alignments. Calculating all 3-way alignments exactly and computing their average improves lower bounds on sum of SP cost in v-way alignments. However judicious selection of a subset of all 3-way alignments can yield a further improvement with minimal additional effort. On the other hand, a simple heuristic to select a random subset of 3-way alignments (a random packing) yields accuracy comparable to averaging all 3-way alignments with substantially less computational effort. CONCLUSION: Calculation of lower bounds on SP cost (and thus the quality of an alignment) can be improved by employing a mixture of 3-way and 2-way alignments. |
format | Text |
id | pubmed-1890300 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-18903002007-06-08 Lower bounds on multiple sequence alignment using exact 3-way alignment Colbourn, Charles J Kumar, Sudhir BMC Bioinformatics Research Article BACKGROUND: Multiple sequence alignment is fundamental. Exponential growth in computation time appears to be inevitable when an optimal alignment is required for many sequences. Exact costs of optimum alignments are therefore rarely computed. Consequently much effort has been invested in algorithms for alignment that are heuristic, or explore a restricted class of solutions. These give an upper bound on the alignment cost, but it is equally important to determine the quality of the solution obtained. In the absence of an optimal alignment with which to compare, lower bounds may be calculated to assess the quality of the alignment. As more effort is invested in improving upper bounds (alignment algorithms), it is therefore important to improve lower bounds as well. Although numerous cost metrics can be used to determine the quality of an alignment, many are based on sum-of-pairs (SP) measures and their generalizations. RESULTS: Two standard and two new methods are considered for using exact 2-way and 3-way alignments to compute lower bounds on total SP alignment cost; one new method fares well with respect to accuracy, while the other reduces the computation time. The first employs exhaustive computation of exact 3-way alignments, while the second employs an efficient heuristic to compute a much smaller number of exact 3-way alignments. Calculating all 3-way alignments exactly and computing their average improves lower bounds on sum of SP cost in v-way alignments. However judicious selection of a subset of all 3-way alignments can yield a further improvement with minimal additional effort. On the other hand, a simple heuristic to select a random subset of 3-way alignments (a random packing) yields accuracy comparable to averaging all 3-way alignments with substantially less computational effort. CONCLUSION: Calculation of lower bounds on SP cost (and thus the quality of an alignment) can be improved by employing a mixture of 3-way and 2-way alignments. BioMed Central 2007-04-30 /pmc/articles/PMC1890300/ /pubmed/17470273 http://dx.doi.org/10.1186/1471-2105-8-140 Text en Copyright © 2007 Colbourn and Kumar; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Colbourn, Charles J Kumar, Sudhir Lower bounds on multiple sequence alignment using exact 3-way alignment |
title | Lower bounds on multiple sequence alignment using exact 3-way alignment |
title_full | Lower bounds on multiple sequence alignment using exact 3-way alignment |
title_fullStr | Lower bounds on multiple sequence alignment using exact 3-way alignment |
title_full_unstemmed | Lower bounds on multiple sequence alignment using exact 3-way alignment |
title_short | Lower bounds on multiple sequence alignment using exact 3-way alignment |
title_sort | lower bounds on multiple sequence alignment using exact 3-way alignment |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1890300/ https://www.ncbi.nlm.nih.gov/pubmed/17470273 http://dx.doi.org/10.1186/1471-2105-8-140 |
work_keys_str_mv | AT colbourncharlesj lowerboundsonmultiplesequencealignmentusingexact3wayalignment AT kumarsudhir lowerboundsonmultiplesequencealignmentusingexact3wayalignment |