Cargando…
PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences
Accurate tools for multiple sequence alignment (MSA) are essential for comparative studies of the function and structure of biological sequences. However, it is very challenging to develop a computationally efficient algorithm that can consistently predict accurate alignments for various types of se...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2926610/ https://www.ncbi.nlm.nih.gov/pubmed/20413579 http://dx.doi.org/10.1093/nar/gkq255 |
_version_ | 1782185711475097600 |
---|---|
author | Sahraeian, Sayed Mohammad Ebrahim Yoon, Byung-Jun |
author_facet | Sahraeian, Sayed Mohammad Ebrahim Yoon, Byung-Jun |
author_sort | Sahraeian, Sayed Mohammad Ebrahim |
collection | PubMed |
description | Accurate tools for multiple sequence alignment (MSA) are essential for comparative studies of the function and structure of biological sequences. However, it is very challenging to develop a computationally efficient algorithm that can consistently predict accurate alignments for various types of sequence sets. In this article, we introduce PicXAA (Probabilistic Maximum Accuracy Alignment), a probabilistic non-progressive alignment algorithm that aims to find protein alignments with maximum expected accuracy. PicXAA greedily builds up the multiple alignment from sequence regions with high local similarities, thereby yielding an accurate global alignment that effectively grasps the local similarities among sequences. Evaluations on several widely used benchmark sets show that PicXAA constantly yields accurate alignment results on a wide range of reference sets, with especially remarkable improvements over other leading algorithms on sequence sets with local similarities. PicXAA source code is freely available at: http://www.ece.tamu.edu/∼bjyoon/picxaa/. |
format | Text |
id | pubmed-2926610 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-29266102010-08-30 PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences Sahraeian, Sayed Mohammad Ebrahim Yoon, Byung-Jun Nucleic Acids Res Computational Biology Accurate tools for multiple sequence alignment (MSA) are essential for comparative studies of the function and structure of biological sequences. However, it is very challenging to develop a computationally efficient algorithm that can consistently predict accurate alignments for various types of sequence sets. In this article, we introduce PicXAA (Probabilistic Maximum Accuracy Alignment), a probabilistic non-progressive alignment algorithm that aims to find protein alignments with maximum expected accuracy. PicXAA greedily builds up the multiple alignment from sequence regions with high local similarities, thereby yielding an accurate global alignment that effectively grasps the local similarities among sequences. Evaluations on several widely used benchmark sets show that PicXAA constantly yields accurate alignment results on a wide range of reference sets, with especially remarkable improvements over other leading algorithms on sequence sets with local similarities. PicXAA source code is freely available at: http://www.ece.tamu.edu/∼bjyoon/picxaa/. Oxford University Press 2010-08 2010-04-22 /pmc/articles/PMC2926610/ /pubmed/20413579 http://dx.doi.org/10.1093/nar/gkq255 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Sahraeian, Sayed Mohammad Ebrahim Yoon, Byung-Jun PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences |
title | PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences |
title_full | PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences |
title_fullStr | PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences |
title_full_unstemmed | PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences |
title_short | PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences |
title_sort | picxaa: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2926610/ https://www.ncbi.nlm.nih.gov/pubmed/20413579 http://dx.doi.org/10.1093/nar/gkq255 |
work_keys_str_mv | AT sahraeiansayedmohammadebrahim picxaagreedyprobabilisticconstructionofmaximumexpectedaccuracyalignmentofmultiplesequences AT yoonbyungjun picxaagreedyprobabilisticconstructionofmaximumexpectedaccuracyalignmentofmultiplesequences |