Cargando…

PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences

Accurate tools for multiple sequence alignment (MSA) are essential for comparative studies of the function and structure of biological sequences. However, it is very challenging to develop a computationally efficient algorithm that can consistently predict accurate alignments for various types of se...

Descripción completa

Detalles Bibliográficos
Autores principales: Sahraeian, Sayed Mohammad Ebrahim, Yoon, Byung-Jun
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2926610/
https://www.ncbi.nlm.nih.gov/pubmed/20413579
http://dx.doi.org/10.1093/nar/gkq255
_version_ 1782185711475097600
author Sahraeian, Sayed Mohammad Ebrahim
Yoon, Byung-Jun
author_facet Sahraeian, Sayed Mohammad Ebrahim
Yoon, Byung-Jun
author_sort Sahraeian, Sayed Mohammad Ebrahim
collection PubMed
description Accurate tools for multiple sequence alignment (MSA) are essential for comparative studies of the function and structure of biological sequences. However, it is very challenging to develop a computationally efficient algorithm that can consistently predict accurate alignments for various types of sequence sets. In this article, we introduce PicXAA (Probabilistic Maximum Accuracy Alignment), a probabilistic non-progressive alignment algorithm that aims to find protein alignments with maximum expected accuracy. PicXAA greedily builds up the multiple alignment from sequence regions with high local similarities, thereby yielding an accurate global alignment that effectively grasps the local similarities among sequences. Evaluations on several widely used benchmark sets show that PicXAA constantly yields accurate alignment results on a wide range of reference sets, with especially remarkable improvements over other leading algorithms on sequence sets with local similarities. PicXAA source code is freely available at: http://www.ece.tamu.edu/∼bjyoon/picxaa/.
format Text
id pubmed-2926610
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-29266102010-08-30 PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences Sahraeian, Sayed Mohammad Ebrahim Yoon, Byung-Jun Nucleic Acids Res Computational Biology Accurate tools for multiple sequence alignment (MSA) are essential for comparative studies of the function and structure of biological sequences. However, it is very challenging to develop a computationally efficient algorithm that can consistently predict accurate alignments for various types of sequence sets. In this article, we introduce PicXAA (Probabilistic Maximum Accuracy Alignment), a probabilistic non-progressive alignment algorithm that aims to find protein alignments with maximum expected accuracy. PicXAA greedily builds up the multiple alignment from sequence regions with high local similarities, thereby yielding an accurate global alignment that effectively grasps the local similarities among sequences. Evaluations on several widely used benchmark sets show that PicXAA constantly yields accurate alignment results on a wide range of reference sets, with especially remarkable improvements over other leading algorithms on sequence sets with local similarities. PicXAA source code is freely available at: http://www.ece.tamu.edu/∼bjyoon/picxaa/. Oxford University Press 2010-08 2010-04-22 /pmc/articles/PMC2926610/ /pubmed/20413579 http://dx.doi.org/10.1093/nar/gkq255 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Sahraeian, Sayed Mohammad Ebrahim
Yoon, Byung-Jun
PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences
title PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences
title_full PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences
title_fullStr PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences
title_full_unstemmed PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences
title_short PicXAA: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences
title_sort picxaa: greedy probabilistic construction of maximum expected accuracy alignment of multiple sequences
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2926610/
https://www.ncbi.nlm.nih.gov/pubmed/20413579
http://dx.doi.org/10.1093/nar/gkq255
work_keys_str_mv AT sahraeiansayedmohammadebrahim picxaagreedyprobabilisticconstructionofmaximumexpectedaccuracyalignmentofmultiplesequences
AT yoonbyungjun picxaagreedyprobabilisticconstructionofmaximumexpectedaccuracyalignmentofmultiplesequences