Cargando…

Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques

Multiple sequence alignments (MSAs) have become one of the most studied approaches in bioinformatics to perform other outstanding tasks such as structure prediction, biological function analysis or next-generation sequencing. However, current MSA algorithms do not always provide consistent solutions...

Descripción completa

Detalles Bibliográficos
Autores principales: Ortuño, Francisco M., Valenzuela, Olga, Pomares, Hector, Rojas, Fernando, Florido, Javier P., Urquiza, Jose M., Rojas, Ignacio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592395/
https://www.ncbi.nlm.nih.gov/pubmed/23066102
http://dx.doi.org/10.1093/nar/gks919
_version_ 1782262106079363072
author Ortuño, Francisco M.
Valenzuela, Olga
Pomares, Hector
Rojas, Fernando
Florido, Javier P.
Urquiza, Jose M.
Rojas, Ignacio
author_facet Ortuño, Francisco M.
Valenzuela, Olga
Pomares, Hector
Rojas, Fernando
Florido, Javier P.
Urquiza, Jose M.
Rojas, Ignacio
author_sort Ortuño, Francisco M.
collection PubMed
description Multiple sequence alignments (MSAs) have become one of the most studied approaches in bioinformatics to perform other outstanding tasks such as structure prediction, biological function analysis or next-generation sequencing. However, current MSA algorithms do not always provide consistent solutions, since alignments become increasingly difficult when dealing with low similarity sequences. As widely known, these algorithms directly depend on specific features of the sequences, causing relevant influence on the alignment accuracy. Many MSA tools have been recently designed but it is not possible to know in advance which one is the most suitable for a particular set of sequences. In this work, we analyze some of the most used algorithms presented in the bibliography and their dependences on several features. A novel intelligent algorithm based on least square support vector machine is then developed to predict how accurate each alignment could be, depending on its analyzed features. This algorithm is performed with a dataset of 2180 MSAs. The proposed system first estimates the accuracy of possible alignments. The most promising methodologies are then selected in order to align each set of sequences. Since only one selected algorithm is run, the computational time is not excessively increased.
format Online
Article
Text
id pubmed-3592395
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35923952013-03-08 Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques Ortuño, Francisco M. Valenzuela, Olga Pomares, Hector Rojas, Fernando Florido, Javier P. Urquiza, Jose M. Rojas, Ignacio Nucleic Acids Res Methods Online Multiple sequence alignments (MSAs) have become one of the most studied approaches in bioinformatics to perform other outstanding tasks such as structure prediction, biological function analysis or next-generation sequencing. However, current MSA algorithms do not always provide consistent solutions, since alignments become increasingly difficult when dealing with low similarity sequences. As widely known, these algorithms directly depend on specific features of the sequences, causing relevant influence on the alignment accuracy. Many MSA tools have been recently designed but it is not possible to know in advance which one is the most suitable for a particular set of sequences. In this work, we analyze some of the most used algorithms presented in the bibliography and their dependences on several features. A novel intelligent algorithm based on least square support vector machine is then developed to predict how accurate each alignment could be, depending on its analyzed features. This algorithm is performed with a dataset of 2180 MSAs. The proposed system first estimates the accuracy of possible alignments. The most promising methodologies are then selected in order to align each set of sequences. Since only one selected algorithm is run, the computational time is not excessively increased. Oxford University Press 2013-01 2012-10-11 /pmc/articles/PMC3592395/ /pubmed/23066102 http://dx.doi.org/10.1093/nar/gks919 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.
spellingShingle Methods Online
Ortuño, Francisco M.
Valenzuela, Olga
Pomares, Hector
Rojas, Fernando
Florido, Javier P.
Urquiza, Jose M.
Rojas, Ignacio
Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques
title Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques
title_full Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques
title_fullStr Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques
title_full_unstemmed Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques
title_short Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques
title_sort predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592395/
https://www.ncbi.nlm.nih.gov/pubmed/23066102
http://dx.doi.org/10.1093/nar/gks919
work_keys_str_mv AT ortunofranciscom predictingtheaccuracyofmultiplesequencealignmentalgorithmsbyusingcomputationalintelligenttechniques
AT valenzuelaolga predictingtheaccuracyofmultiplesequencealignmentalgorithmsbyusingcomputationalintelligenttechniques
AT pomareshector predictingtheaccuracyofmultiplesequencealignmentalgorithmsbyusingcomputationalintelligenttechniques
AT rojasfernando predictingtheaccuracyofmultiplesequencealignmentalgorithmsbyusingcomputationalintelligenttechniques
AT floridojavierp predictingtheaccuracyofmultiplesequencealignmentalgorithmsbyusingcomputationalintelligenttechniques
AT urquizajosem predictingtheaccuracyofmultiplesequencealignmentalgorithmsbyusingcomputationalintelligenttechniques
AT rojasignacio predictingtheaccuracyofmultiplesequencealignmentalgorithmsbyusingcomputationalintelligenttechniques