Cargando…
Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques
Multiple sequence alignments (MSAs) have become one of the most studied approaches in bioinformatics to perform other outstanding tasks such as structure prediction, biological function analysis or next-generation sequencing. However, current MSA algorithms do not always provide consistent solutions...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592395/ https://www.ncbi.nlm.nih.gov/pubmed/23066102 http://dx.doi.org/10.1093/nar/gks919 |
_version_ | 1782262106079363072 |
---|---|
author | Ortuño, Francisco M. Valenzuela, Olga Pomares, Hector Rojas, Fernando Florido, Javier P. Urquiza, Jose M. Rojas, Ignacio |
author_facet | Ortuño, Francisco M. Valenzuela, Olga Pomares, Hector Rojas, Fernando Florido, Javier P. Urquiza, Jose M. Rojas, Ignacio |
author_sort | Ortuño, Francisco M. |
collection | PubMed |
description | Multiple sequence alignments (MSAs) have become one of the most studied approaches in bioinformatics to perform other outstanding tasks such as structure prediction, biological function analysis or next-generation sequencing. However, current MSA algorithms do not always provide consistent solutions, since alignments become increasingly difficult when dealing with low similarity sequences. As widely known, these algorithms directly depend on specific features of the sequences, causing relevant influence on the alignment accuracy. Many MSA tools have been recently designed but it is not possible to know in advance which one is the most suitable for a particular set of sequences. In this work, we analyze some of the most used algorithms presented in the bibliography and their dependences on several features. A novel intelligent algorithm based on least square support vector machine is then developed to predict how accurate each alignment could be, depending on its analyzed features. This algorithm is performed with a dataset of 2180 MSAs. The proposed system first estimates the accuracy of possible alignments. The most promising methodologies are then selected in order to align each set of sequences. Since only one selected algorithm is run, the computational time is not excessively increased. |
format | Online Article Text |
id | pubmed-3592395 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-35923952013-03-08 Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques Ortuño, Francisco M. Valenzuela, Olga Pomares, Hector Rojas, Fernando Florido, Javier P. Urquiza, Jose M. Rojas, Ignacio Nucleic Acids Res Methods Online Multiple sequence alignments (MSAs) have become one of the most studied approaches in bioinformatics to perform other outstanding tasks such as structure prediction, biological function analysis or next-generation sequencing. However, current MSA algorithms do not always provide consistent solutions, since alignments become increasingly difficult when dealing with low similarity sequences. As widely known, these algorithms directly depend on specific features of the sequences, causing relevant influence on the alignment accuracy. Many MSA tools have been recently designed but it is not possible to know in advance which one is the most suitable for a particular set of sequences. In this work, we analyze some of the most used algorithms presented in the bibliography and their dependences on several features. A novel intelligent algorithm based on least square support vector machine is then developed to predict how accurate each alignment could be, depending on its analyzed features. This algorithm is performed with a dataset of 2180 MSAs. The proposed system first estimates the accuracy of possible alignments. The most promising methodologies are then selected in order to align each set of sequences. Since only one selected algorithm is run, the computational time is not excessively increased. Oxford University Press 2013-01 2012-10-11 /pmc/articles/PMC3592395/ /pubmed/23066102 http://dx.doi.org/10.1093/nar/gks919 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com. |
spellingShingle | Methods Online Ortuño, Francisco M. Valenzuela, Olga Pomares, Hector Rojas, Fernando Florido, Javier P. Urquiza, Jose M. Rojas, Ignacio Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques |
title | Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques |
title_full | Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques |
title_fullStr | Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques |
title_full_unstemmed | Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques |
title_short | Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques |
title_sort | predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592395/ https://www.ncbi.nlm.nih.gov/pubmed/23066102 http://dx.doi.org/10.1093/nar/gks919 |
work_keys_str_mv | AT ortunofranciscom predictingtheaccuracyofmultiplesequencealignmentalgorithmsbyusingcomputationalintelligenttechniques AT valenzuelaolga predictingtheaccuracyofmultiplesequencealignmentalgorithmsbyusingcomputationalintelligenttechniques AT pomareshector predictingtheaccuracyofmultiplesequencealignmentalgorithmsbyusingcomputationalintelligenttechniques AT rojasfernando predictingtheaccuracyofmultiplesequencealignmentalgorithmsbyusingcomputationalintelligenttechniques AT floridojavierp predictingtheaccuracyofmultiplesequencealignmentalgorithmsbyusingcomputationalintelligenttechniques AT urquizajosem predictingtheaccuracyofmultiplesequencealignmentalgorithmsbyusingcomputationalintelligenttechniques AT rojasignacio predictingtheaccuracyofmultiplesequencealignmentalgorithmsbyusingcomputationalintelligenttechniques |