Cargando…
Protein multiple sequence alignment by hybrid bio-inspired algorithms
This article presents an immune inspired algorithm to tackle the Multiple Sequence Alignment (MSA) problem. MSA is one of the most important tasks in biological sequence analysis. Although this paper focuses on protein alignments, most of the discussion and methodology may also be applied to DNA ali...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3064771/ https://www.ncbi.nlm.nih.gov/pubmed/21071394 http://dx.doi.org/10.1093/nar/gkq1052 |
_version_ | 1782200915831291904 |
---|---|
author | Cutello, Vincenzo Nicosia, Giuseppe Pavone, Mario Prizzi, Igor |
author_facet | Cutello, Vincenzo Nicosia, Giuseppe Pavone, Mario Prizzi, Igor |
author_sort | Cutello, Vincenzo |
collection | PubMed |
description | This article presents an immune inspired algorithm to tackle the Multiple Sequence Alignment (MSA) problem. MSA is one of the most important tasks in biological sequence analysis. Although this paper focuses on protein alignments, most of the discussion and methodology may also be applied to DNA alignments. The problem of finding the multiple alignment was investigated in the study by Bonizzoni and Vedova and Wang and Jiang, and proved to be a NP-hard (non-deterministic polynomial-time hard) problem. The presented algorithm, called Immunological Multiple Sequence Alignment Algorithm (IMSA), incorporates two new strategies to create the initial population and specific ad hoc mutation operators. It is based on the ‘weighted sum of pairs’ as objective function, to evaluate a given candidate alignment. IMSA was tested using both classical benchmarks of BAliBASE (versions 1.0, 2.0 and 3.0), and experimental results indicate that it is comparable with state-of-the-art multiple alignment algorithms, in terms of quality of alignments, weighted Sums-of-Pairs (SP) and Column Score (CS) values. The main novelty of IMSA is its ability to generate more than a single suboptimal alignment, for every MSA instance; this behaviour is due to the stochastic nature of the algorithm and of the populations evolved during the convergence process. This feature will help the decision maker to assess and select a biologically relevant multiple sequence alignment. Finally, the designed algorithm can be used as a local search procedure to properly explore promising alignments of the search space. |
format | Text |
id | pubmed-3064771 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-30647712011-03-28 Protein multiple sequence alignment by hybrid bio-inspired algorithms Cutello, Vincenzo Nicosia, Giuseppe Pavone, Mario Prizzi, Igor Nucleic Acids Res Computational Biology This article presents an immune inspired algorithm to tackle the Multiple Sequence Alignment (MSA) problem. MSA is one of the most important tasks in biological sequence analysis. Although this paper focuses on protein alignments, most of the discussion and methodology may also be applied to DNA alignments. The problem of finding the multiple alignment was investigated in the study by Bonizzoni and Vedova and Wang and Jiang, and proved to be a NP-hard (non-deterministic polynomial-time hard) problem. The presented algorithm, called Immunological Multiple Sequence Alignment Algorithm (IMSA), incorporates two new strategies to create the initial population and specific ad hoc mutation operators. It is based on the ‘weighted sum of pairs’ as objective function, to evaluate a given candidate alignment. IMSA was tested using both classical benchmarks of BAliBASE (versions 1.0, 2.0 and 3.0), and experimental results indicate that it is comparable with state-of-the-art multiple alignment algorithms, in terms of quality of alignments, weighted Sums-of-Pairs (SP) and Column Score (CS) values. The main novelty of IMSA is its ability to generate more than a single suboptimal alignment, for every MSA instance; this behaviour is due to the stochastic nature of the algorithm and of the populations evolved during the convergence process. This feature will help the decision maker to assess and select a biologically relevant multiple sequence alignment. Finally, the designed algorithm can be used as a local search procedure to properly explore promising alignments of the search space. Oxford University Press 2011-03 2010-11-10 /pmc/articles/PMC3064771/ /pubmed/21071394 http://dx.doi.org/10.1093/nar/gkq1052 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Cutello, Vincenzo Nicosia, Giuseppe Pavone, Mario Prizzi, Igor Protein multiple sequence alignment by hybrid bio-inspired algorithms |
title | Protein multiple sequence alignment by hybrid bio-inspired algorithms |
title_full | Protein multiple sequence alignment by hybrid bio-inspired algorithms |
title_fullStr | Protein multiple sequence alignment by hybrid bio-inspired algorithms |
title_full_unstemmed | Protein multiple sequence alignment by hybrid bio-inspired algorithms |
title_short | Protein multiple sequence alignment by hybrid bio-inspired algorithms |
title_sort | protein multiple sequence alignment by hybrid bio-inspired algorithms |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3064771/ https://www.ncbi.nlm.nih.gov/pubmed/21071394 http://dx.doi.org/10.1093/nar/gkq1052 |
work_keys_str_mv | AT cutellovincenzo proteinmultiplesequencealignmentbyhybridbioinspiredalgorithms AT nicosiagiuseppe proteinmultiplesequencealignmentbyhybridbioinspiredalgorithms AT pavonemario proteinmultiplesequencealignmentbyhybridbioinspiredalgorithms AT prizziigor proteinmultiplesequencealignmentbyhybridbioinspiredalgorithms |