Cargando…
ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach
BACKGROUND: Obtaining an accurate sequence alignment is fundamental for consistently analyzing biological data. Although this problem may be efficiently solved when only two sequences are considered, the exact inference of the optimal alignment easily gets computationally intractable for the multipl...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4133627/ https://www.ncbi.nlm.nih.gov/pubmed/25099134 http://dx.doi.org/10.1186/1471-2105-15-265 |
_version_ | 1782330767636955136 |
---|---|
author | Lyras, Dimitrios P Metzler, Dirk |
author_facet | Lyras, Dimitrios P Metzler, Dirk |
author_sort | Lyras, Dimitrios P |
collection | PubMed |
description | BACKGROUND: Obtaining an accurate sequence alignment is fundamental for consistently analyzing biological data. Although this problem may be efficiently solved when only two sequences are considered, the exact inference of the optimal alignment easily gets computationally intractable for the multiple sequence alignment case. To cope with the high computational expenses, approximate heuristic methods have been proposed that address the problem indirectly by progressively aligning the sequences in pairs according to their relatedness. These methods however are not flexible to change the alignment of an already aligned group of sequences in the view of new data, resulting thus in compromises on the quality of the deriving alignment. In this paper we present ReformAlign, a novel meta-alignment approach that may significantly improve on the quality of the deriving alignments from popular aligners. We call ReformAlign a meta-aligner as it requires an initial alignment, for which a variety of alignment programs can be used. The main idea behind ReformAlign is quite straightforward: at first, an existing alignment is used to construct a standard profile which summarizes the initial alignment and then all sequences are individually re-aligned against the formed profile. From each sequence-profile comparison, the alignment of each sequence against the profile is recorded and the final alignment is indirectly inferred by merging all the individual sub-alignments into a unified set. The employment of ReformAlign may often result in alignments which are significantly more accurate than the starting alignments. RESULTS: We evaluated the effect of ReformAlign on the generated alignments from ten leading alignment methods using real data of variable size and sequence identity. The experimental results suggest that the proposed meta-aligner approach may often lead to statistically significant more accurate alignments. Furthermore, we show that ReformAlign results in more substantial improvement in cases where the starting alignment is of relatively inferior quality or when the input sequences are harder to align. CONCLUSIONS: The proposed profile-based meta-alignment approach seems to be a promising and computationally efficient method that can be combined with practically all popular alignment methods and may lead to significant improvements in the generated alignments. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-265) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4133627 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41336272014-08-16 ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach Lyras, Dimitrios P Metzler, Dirk BMC Bioinformatics Software BACKGROUND: Obtaining an accurate sequence alignment is fundamental for consistently analyzing biological data. Although this problem may be efficiently solved when only two sequences are considered, the exact inference of the optimal alignment easily gets computationally intractable for the multiple sequence alignment case. To cope with the high computational expenses, approximate heuristic methods have been proposed that address the problem indirectly by progressively aligning the sequences in pairs according to their relatedness. These methods however are not flexible to change the alignment of an already aligned group of sequences in the view of new data, resulting thus in compromises on the quality of the deriving alignment. In this paper we present ReformAlign, a novel meta-alignment approach that may significantly improve on the quality of the deriving alignments from popular aligners. We call ReformAlign a meta-aligner as it requires an initial alignment, for which a variety of alignment programs can be used. The main idea behind ReformAlign is quite straightforward: at first, an existing alignment is used to construct a standard profile which summarizes the initial alignment and then all sequences are individually re-aligned against the formed profile. From each sequence-profile comparison, the alignment of each sequence against the profile is recorded and the final alignment is indirectly inferred by merging all the individual sub-alignments into a unified set. The employment of ReformAlign may often result in alignments which are significantly more accurate than the starting alignments. RESULTS: We evaluated the effect of ReformAlign on the generated alignments from ten leading alignment methods using real data of variable size and sequence identity. The experimental results suggest that the proposed meta-aligner approach may often lead to statistically significant more accurate alignments. Furthermore, we show that ReformAlign results in more substantial improvement in cases where the starting alignment is of relatively inferior quality or when the input sequences are harder to align. CONCLUSIONS: The proposed profile-based meta-alignment approach seems to be a promising and computationally efficient method that can be combined with practically all popular alignment methods and may lead to significant improvements in the generated alignments. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-265) contains supplementary material, which is available to authorized users. BioMed Central 2014-08-07 /pmc/articles/PMC4133627/ /pubmed/25099134 http://dx.doi.org/10.1186/1471-2105-15-265 Text en © Lyras and Metzler; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Lyras, Dimitrios P Metzler, Dirk ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach |
title | ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach |
title_full | ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach |
title_fullStr | ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach |
title_full_unstemmed | ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach |
title_short | ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach |
title_sort | reformalign: improved multiple sequence alignments using a profile-based meta-alignment approach |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4133627/ https://www.ncbi.nlm.nih.gov/pubmed/25099134 http://dx.doi.org/10.1186/1471-2105-15-265 |
work_keys_str_mv | AT lyrasdimitriosp reformalignimprovedmultiplesequencealignmentsusingaprofilebasedmetaalignmentapproach AT metzlerdirk reformalignimprovedmultiplesequencealignmentsusingaprofilebasedmetaalignmentapproach |