Cargando…

Multiple sequence alignment with user-defined anchor points

BACKGROUND: Automated software tools for multiple alignment often fail to produce biologically meaningful results. In such situations, expert knowledge can help to improve the quality of alignments. RESULTS: Herein, we describe a semi-automatic version of the alignment program DIALIGN that can take...

Descripción completa

Detalles Bibliográficos
Autores principales: Morgenstern, Burkhard, Prohaska, Sonja J, Pöhler, Dirk, Stadler, Peter F
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1481597/
https://www.ncbi.nlm.nih.gov/pubmed/16722533
http://dx.doi.org/10.1186/1748-7188-1-6
_version_ 1782128272503472128
author Morgenstern, Burkhard
Prohaska, Sonja J
Pöhler, Dirk
Stadler, Peter F
author_facet Morgenstern, Burkhard
Prohaska, Sonja J
Pöhler, Dirk
Stadler, Peter F
author_sort Morgenstern, Burkhard
collection PubMed
description BACKGROUND: Automated software tools for multiple alignment often fail to produce biologically meaningful results. In such situations, expert knowledge can help to improve the quality of alignments. RESULTS: Herein, we describe a semi-automatic version of the alignment program DIALIGN that can take pre-defined constraints into account. It is possible for the user to specify parts of the sequences that are assumed to be homologous and should therefore be aligned to each other. Our software program can use these sites as anchor points by creating a multiple alignment respecting these constraints. This way, our alignment method can produce alignments that are biologically more meaningful than alignments produced by fully automated procedures. As a demonstration of how our method works, we apply our approach to genomic sequences around the Hox gene cluster and to a set of DNA-binding proteins. As a by-product, we obtain insights about the performance of the greedy algorithm that our program uses for multiple alignment and about the underlying objective function. This information will be useful for the further development of DIALIGN. The described alignment approach has been integrated into the TRACKER software system.
format Text
id pubmed-1481597
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14815972006-06-22 Multiple sequence alignment with user-defined anchor points Morgenstern, Burkhard Prohaska, Sonja J Pöhler, Dirk Stadler, Peter F Algorithms Mol Biol Research BACKGROUND: Automated software tools for multiple alignment often fail to produce biologically meaningful results. In such situations, expert knowledge can help to improve the quality of alignments. RESULTS: Herein, we describe a semi-automatic version of the alignment program DIALIGN that can take pre-defined constraints into account. It is possible for the user to specify parts of the sequences that are assumed to be homologous and should therefore be aligned to each other. Our software program can use these sites as anchor points by creating a multiple alignment respecting these constraints. This way, our alignment method can produce alignments that are biologically more meaningful than alignments produced by fully automated procedures. As a demonstration of how our method works, we apply our approach to genomic sequences around the Hox gene cluster and to a set of DNA-binding proteins. As a by-product, we obtain insights about the performance of the greedy algorithm that our program uses for multiple alignment and about the underlying objective function. This information will be useful for the further development of DIALIGN. The described alignment approach has been integrated into the TRACKER software system. BioMed Central 2006-04-19 /pmc/articles/PMC1481597/ /pubmed/16722533 http://dx.doi.org/10.1186/1748-7188-1-6 Text en Copyright © 2006 Morgenstern et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Morgenstern, Burkhard
Prohaska, Sonja J
Pöhler, Dirk
Stadler, Peter F
Multiple sequence alignment with user-defined anchor points
title Multiple sequence alignment with user-defined anchor points
title_full Multiple sequence alignment with user-defined anchor points
title_fullStr Multiple sequence alignment with user-defined anchor points
title_full_unstemmed Multiple sequence alignment with user-defined anchor points
title_short Multiple sequence alignment with user-defined anchor points
title_sort multiple sequence alignment with user-defined anchor points
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1481597/
https://www.ncbi.nlm.nih.gov/pubmed/16722533
http://dx.doi.org/10.1186/1748-7188-1-6
work_keys_str_mv AT morgensternburkhard multiplesequencealignmentwithuserdefinedanchorpoints
AT prohaskasonjaj multiplesequencealignmentwithuserdefinedanchorpoints
AT pohlerdirk multiplesequencealignmentwithuserdefinedanchorpoints
AT stadlerpeterf multiplesequencealignmentwithuserdefinedanchorpoints