Cargando…
Multiple sequence alignment with user-defined anchor points
BACKGROUND: Automated software tools for multiple alignment often fail to produce biologically meaningful results. In such situations, expert knowledge can help to improve the quality of alignments. RESULTS: Herein, we describe a semi-automatic version of the alignment program DIALIGN that can take...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1481597/ https://www.ncbi.nlm.nih.gov/pubmed/16722533 http://dx.doi.org/10.1186/1748-7188-1-6 |
_version_ | 1782128272503472128 |
---|---|
author | Morgenstern, Burkhard Prohaska, Sonja J Pöhler, Dirk Stadler, Peter F |
author_facet | Morgenstern, Burkhard Prohaska, Sonja J Pöhler, Dirk Stadler, Peter F |
author_sort | Morgenstern, Burkhard |
collection | PubMed |
description | BACKGROUND: Automated software tools for multiple alignment often fail to produce biologically meaningful results. In such situations, expert knowledge can help to improve the quality of alignments. RESULTS: Herein, we describe a semi-automatic version of the alignment program DIALIGN that can take pre-defined constraints into account. It is possible for the user to specify parts of the sequences that are assumed to be homologous and should therefore be aligned to each other. Our software program can use these sites as anchor points by creating a multiple alignment respecting these constraints. This way, our alignment method can produce alignments that are biologically more meaningful than alignments produced by fully automated procedures. As a demonstration of how our method works, we apply our approach to genomic sequences around the Hox gene cluster and to a set of DNA-binding proteins. As a by-product, we obtain insights about the performance of the greedy algorithm that our program uses for multiple alignment and about the underlying objective function. This information will be useful for the further development of DIALIGN. The described alignment approach has been integrated into the TRACKER software system. |
format | Text |
id | pubmed-1481597 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-14815972006-06-22 Multiple sequence alignment with user-defined anchor points Morgenstern, Burkhard Prohaska, Sonja J Pöhler, Dirk Stadler, Peter F Algorithms Mol Biol Research BACKGROUND: Automated software tools for multiple alignment often fail to produce biologically meaningful results. In such situations, expert knowledge can help to improve the quality of alignments. RESULTS: Herein, we describe a semi-automatic version of the alignment program DIALIGN that can take pre-defined constraints into account. It is possible for the user to specify parts of the sequences that are assumed to be homologous and should therefore be aligned to each other. Our software program can use these sites as anchor points by creating a multiple alignment respecting these constraints. This way, our alignment method can produce alignments that are biologically more meaningful than alignments produced by fully automated procedures. As a demonstration of how our method works, we apply our approach to genomic sequences around the Hox gene cluster and to a set of DNA-binding proteins. As a by-product, we obtain insights about the performance of the greedy algorithm that our program uses for multiple alignment and about the underlying objective function. This information will be useful for the further development of DIALIGN. The described alignment approach has been integrated into the TRACKER software system. BioMed Central 2006-04-19 /pmc/articles/PMC1481597/ /pubmed/16722533 http://dx.doi.org/10.1186/1748-7188-1-6 Text en Copyright © 2006 Morgenstern et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Morgenstern, Burkhard Prohaska, Sonja J Pöhler, Dirk Stadler, Peter F Multiple sequence alignment with user-defined anchor points |
title | Multiple sequence alignment with user-defined anchor points |
title_full | Multiple sequence alignment with user-defined anchor points |
title_fullStr | Multiple sequence alignment with user-defined anchor points |
title_full_unstemmed | Multiple sequence alignment with user-defined anchor points |
title_short | Multiple sequence alignment with user-defined anchor points |
title_sort | multiple sequence alignment with user-defined anchor points |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1481597/ https://www.ncbi.nlm.nih.gov/pubmed/16722533 http://dx.doi.org/10.1186/1748-7188-1-6 |
work_keys_str_mv | AT morgensternburkhard multiplesequencealignmentwithuserdefinedanchorpoints AT prohaskasonjaj multiplesequencealignmentwithuserdefinedanchorpoints AT pohlerdirk multiplesequencealignmentwithuserdefinedanchorpoints AT stadlerpeterf multiplesequencealignmentwithuserdefinedanchorpoints |