Cargando…

Vertical decomposition with Genetic Algorithm for Multiple Sequence Alignment

BACKGROUND: Many Bioinformatics studies begin with a multiple sequence alignment as the foundation for their research. This is because multiple sequence alignment can be a useful technique for studying molecular evolution and analyzing sequence structure relationships. RESULTS: In this paper, we hav...

Descripción completa

Detalles Bibliográficos
Autores principales: Naznin, Farhana, Sarker, Ruhul, Essam, Daryl
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3180391/
https://www.ncbi.nlm.nih.gov/pubmed/21867510
http://dx.doi.org/10.1186/1471-2105-12-353
_version_ 1782212626163433472
author Naznin, Farhana
Sarker, Ruhul
Essam, Daryl
author_facet Naznin, Farhana
Sarker, Ruhul
Essam, Daryl
author_sort Naznin, Farhana
collection PubMed
description BACKGROUND: Many Bioinformatics studies begin with a multiple sequence alignment as the foundation for their research. This is because multiple sequence alignment can be a useful technique for studying molecular evolution and analyzing sequence structure relationships. RESULTS: In this paper, we have proposed a Vertical Decomposition with Genetic Algorithm (VDGA) for Multiple Sequence Alignment (MSA). In VDGA, we divide the sequences vertically into two or more subsequences, and then solve them individually using a guide tree approach. Finally, we combine all the subsequences to generate a new multiple sequence alignment. This technique is applied on the solutions of the initial generation and of each child generation within VDGA. We have used two mechanisms to generate an initial population in this research: the first mechanism is to generate guide trees with randomly selected sequences and the second is shuffling the sequences inside such trees. Two different genetic operators have been implemented with VDGA. To test the performance of our algorithm, we have compared it with existing well-known methods, namely PRRP, CLUSTALX, DIALIGN, HMMT, SB_PIMA, ML_PIMA, MULTALIGN, and PILEUP8, and also other methods, based on Genetic Algorithms (GA), such as SAGA, MSA-GA and RBT-GA, by solving a number of benchmark datasets from BAliBase 2.0. CONCLUSIONS: The experimental results showed that the VDGA with three vertical divisions was the most successful variant for most of the test cases in comparison to other divisions considered with VDGA. The experimental results also confirmed that VDGA outperformed the other methods considered in this research.
format Online
Article
Text
id pubmed-3180391
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31803912011-09-27 Vertical decomposition with Genetic Algorithm for Multiple Sequence Alignment Naznin, Farhana Sarker, Ruhul Essam, Daryl BMC Bioinformatics Research Article BACKGROUND: Many Bioinformatics studies begin with a multiple sequence alignment as the foundation for their research. This is because multiple sequence alignment can be a useful technique for studying molecular evolution and analyzing sequence structure relationships. RESULTS: In this paper, we have proposed a Vertical Decomposition with Genetic Algorithm (VDGA) for Multiple Sequence Alignment (MSA). In VDGA, we divide the sequences vertically into two or more subsequences, and then solve them individually using a guide tree approach. Finally, we combine all the subsequences to generate a new multiple sequence alignment. This technique is applied on the solutions of the initial generation and of each child generation within VDGA. We have used two mechanisms to generate an initial population in this research: the first mechanism is to generate guide trees with randomly selected sequences and the second is shuffling the sequences inside such trees. Two different genetic operators have been implemented with VDGA. To test the performance of our algorithm, we have compared it with existing well-known methods, namely PRRP, CLUSTALX, DIALIGN, HMMT, SB_PIMA, ML_PIMA, MULTALIGN, and PILEUP8, and also other methods, based on Genetic Algorithms (GA), such as SAGA, MSA-GA and RBT-GA, by solving a number of benchmark datasets from BAliBase 2.0. CONCLUSIONS: The experimental results showed that the VDGA with three vertical divisions was the most successful variant for most of the test cases in comparison to other divisions considered with VDGA. The experimental results also confirmed that VDGA outperformed the other methods considered in this research. BioMed Central 2011-08-25 /pmc/articles/PMC3180391/ /pubmed/21867510 http://dx.doi.org/10.1186/1471-2105-12-353 Text en Copyright ©2011 Naznin et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Naznin, Farhana
Sarker, Ruhul
Essam, Daryl
Vertical decomposition with Genetic Algorithm for Multiple Sequence Alignment
title Vertical decomposition with Genetic Algorithm for Multiple Sequence Alignment
title_full Vertical decomposition with Genetic Algorithm for Multiple Sequence Alignment
title_fullStr Vertical decomposition with Genetic Algorithm for Multiple Sequence Alignment
title_full_unstemmed Vertical decomposition with Genetic Algorithm for Multiple Sequence Alignment
title_short Vertical decomposition with Genetic Algorithm for Multiple Sequence Alignment
title_sort vertical decomposition with genetic algorithm for multiple sequence alignment
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3180391/
https://www.ncbi.nlm.nih.gov/pubmed/21867510
http://dx.doi.org/10.1186/1471-2105-12-353
work_keys_str_mv AT nazninfarhana verticaldecompositionwithgeneticalgorithmformultiplesequencealignment
AT sarkerruhul verticaldecompositionwithgeneticalgorithmformultiplesequencealignment
AT essamdaryl verticaldecompositionwithgeneticalgorithmformultiplesequencealignment