Cargando…

Protein multiple sequence alignment by hybrid bio-inspired algorithms

This article presents an immune inspired algorithm to tackle the Multiple Sequence Alignment (MSA) problem. MSA is one of the most important tasks in biological sequence analysis. Although this paper focuses on protein alignments, most of the discussion and methodology may also be applied to DNA ali...

Descripción completa

Detalles Bibliográficos
Autores principales: Cutello, Vincenzo, Nicosia, Giuseppe, Pavone, Mario, Prizzi, Igor
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3064771/
https://www.ncbi.nlm.nih.gov/pubmed/21071394
http://dx.doi.org/10.1093/nar/gkq1052
_version_ 1782200915831291904
author Cutello, Vincenzo
Nicosia, Giuseppe
Pavone, Mario
Prizzi, Igor
author_facet Cutello, Vincenzo
Nicosia, Giuseppe
Pavone, Mario
Prizzi, Igor
author_sort Cutello, Vincenzo
collection PubMed
description This article presents an immune inspired algorithm to tackle the Multiple Sequence Alignment (MSA) problem. MSA is one of the most important tasks in biological sequence analysis. Although this paper focuses on protein alignments, most of the discussion and methodology may also be applied to DNA alignments. The problem of finding the multiple alignment was investigated in the study by Bonizzoni and Vedova and Wang and Jiang, and proved to be a NP-hard (non-deterministic polynomial-time hard) problem. The presented algorithm, called Immunological Multiple Sequence Alignment Algorithm (IMSA), incorporates two new strategies to create the initial population and specific ad hoc mutation operators. It is based on the ‘weighted sum of pairs’ as objective function, to evaluate a given candidate alignment. IMSA was tested using both classical benchmarks of BAliBASE (versions 1.0, 2.0 and 3.0), and experimental results indicate that it is comparable with state-of-the-art multiple alignment algorithms, in terms of quality of alignments, weighted Sums-of-Pairs (SP) and Column Score (CS) values. The main novelty of IMSA is its ability to generate more than a single suboptimal alignment, for every MSA instance; this behaviour is due to the stochastic nature of the algorithm and of the populations evolved during the convergence process. This feature will help the decision maker to assess and select a biologically relevant multiple sequence alignment. Finally, the designed algorithm can be used as a local search procedure to properly explore promising alignments of the search space.
format Text
id pubmed-3064771
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30647712011-03-28 Protein multiple sequence alignment by hybrid bio-inspired algorithms Cutello, Vincenzo Nicosia, Giuseppe Pavone, Mario Prizzi, Igor Nucleic Acids Res Computational Biology This article presents an immune inspired algorithm to tackle the Multiple Sequence Alignment (MSA) problem. MSA is one of the most important tasks in biological sequence analysis. Although this paper focuses on protein alignments, most of the discussion and methodology may also be applied to DNA alignments. The problem of finding the multiple alignment was investigated in the study by Bonizzoni and Vedova and Wang and Jiang, and proved to be a NP-hard (non-deterministic polynomial-time hard) problem. The presented algorithm, called Immunological Multiple Sequence Alignment Algorithm (IMSA), incorporates two new strategies to create the initial population and specific ad hoc mutation operators. It is based on the ‘weighted sum of pairs’ as objective function, to evaluate a given candidate alignment. IMSA was tested using both classical benchmarks of BAliBASE (versions 1.0, 2.0 and 3.0), and experimental results indicate that it is comparable with state-of-the-art multiple alignment algorithms, in terms of quality of alignments, weighted Sums-of-Pairs (SP) and Column Score (CS) values. The main novelty of IMSA is its ability to generate more than a single suboptimal alignment, for every MSA instance; this behaviour is due to the stochastic nature of the algorithm and of the populations evolved during the convergence process. This feature will help the decision maker to assess and select a biologically relevant multiple sequence alignment. Finally, the designed algorithm can be used as a local search procedure to properly explore promising alignments of the search space. Oxford University Press 2011-03 2010-11-10 /pmc/articles/PMC3064771/ /pubmed/21071394 http://dx.doi.org/10.1093/nar/gkq1052 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Cutello, Vincenzo
Nicosia, Giuseppe
Pavone, Mario
Prizzi, Igor
Protein multiple sequence alignment by hybrid bio-inspired algorithms
title Protein multiple sequence alignment by hybrid bio-inspired algorithms
title_full Protein multiple sequence alignment by hybrid bio-inspired algorithms
title_fullStr Protein multiple sequence alignment by hybrid bio-inspired algorithms
title_full_unstemmed Protein multiple sequence alignment by hybrid bio-inspired algorithms
title_short Protein multiple sequence alignment by hybrid bio-inspired algorithms
title_sort protein multiple sequence alignment by hybrid bio-inspired algorithms
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3064771/
https://www.ncbi.nlm.nih.gov/pubmed/21071394
http://dx.doi.org/10.1093/nar/gkq1052
work_keys_str_mv AT cutellovincenzo proteinmultiplesequencealignmentbyhybridbioinspiredalgorithms
AT nicosiagiuseppe proteinmultiplesequencealignmentbyhybridbioinspiredalgorithms
AT pavonemario proteinmultiplesequencealignmentbyhybridbioinspiredalgorithms
AT prizziigor proteinmultiplesequencealignmentbyhybridbioinspiredalgorithms