Cargando…

State of the art: refinement of multiple sequence alignments

BACKGROUND: Accurate multiple sequence alignments of proteins are very important in computational biology today. Despite the numerous efforts made in this field, all alignment strategies have certain shortcomings resulting in alignments that are not always correct. Refinement of existing alignment c...

Descripción completa

Detalles Bibliográficos
Autores principales: Chakrabarti, Saikat, Lanczycki, Christopher J, Panchenko, Anna R, Przytycka, Teresa M, Thiessen, Paul A, Bryant, Stephen H
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1654193/
https://www.ncbi.nlm.nih.gov/pubmed/17105653
http://dx.doi.org/10.1186/1471-2105-7-499
_version_ 1782131015686291456
author Chakrabarti, Saikat
Lanczycki, Christopher J
Panchenko, Anna R
Przytycka, Teresa M
Thiessen, Paul A
Bryant, Stephen H
author_facet Chakrabarti, Saikat
Lanczycki, Christopher J
Panchenko, Anna R
Przytycka, Teresa M
Thiessen, Paul A
Bryant, Stephen H
author_sort Chakrabarti, Saikat
collection PubMed
description BACKGROUND: Accurate multiple sequence alignments of proteins are very important in computational biology today. Despite the numerous efforts made in this field, all alignment strategies have certain shortcomings resulting in alignments that are not always correct. Refinement of existing alignment can prove to be an intelligent choice considering the increasing importance of high quality alignments in large scale high-throughput analysis. RESULTS: We provide an extensive comparison of the performance of the alignment refinement algorithms. The accuracy and efficiency of the refinement programs are compared using the 3D structure-based alignments in the BAliBASE benchmark database as well as manually curated high quality alignments from Conserved Domain Database (CDD). CONCLUSION: Comparison of performance for refined alignments revealed that despite the absence of dramatic improvements, our refinement method, REFINER, which uses conserved regions as constraints performs better in improving the alignments generated by different alignment algorithms. In most cases REFINER produces a higher-scoring, modestly improved alignment that does not deteriorate the well-conserved regions of the original alignment.
format Text
id pubmed-1654193
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-16541932006-11-21 State of the art: refinement of multiple sequence alignments Chakrabarti, Saikat Lanczycki, Christopher J Panchenko, Anna R Przytycka, Teresa M Thiessen, Paul A Bryant, Stephen H BMC Bioinformatics Research Article BACKGROUND: Accurate multiple sequence alignments of proteins are very important in computational biology today. Despite the numerous efforts made in this field, all alignment strategies have certain shortcomings resulting in alignments that are not always correct. Refinement of existing alignment can prove to be an intelligent choice considering the increasing importance of high quality alignments in large scale high-throughput analysis. RESULTS: We provide an extensive comparison of the performance of the alignment refinement algorithms. The accuracy and efficiency of the refinement programs are compared using the 3D structure-based alignments in the BAliBASE benchmark database as well as manually curated high quality alignments from Conserved Domain Database (CDD). CONCLUSION: Comparison of performance for refined alignments revealed that despite the absence of dramatic improvements, our refinement method, REFINER, which uses conserved regions as constraints performs better in improving the alignments generated by different alignment algorithms. In most cases REFINER produces a higher-scoring, modestly improved alignment that does not deteriorate the well-conserved regions of the original alignment. BioMed Central 2006-11-14 /pmc/articles/PMC1654193/ /pubmed/17105653 http://dx.doi.org/10.1186/1471-2105-7-499 Text en Copyright © 2006 Chakrabarti et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chakrabarti, Saikat
Lanczycki, Christopher J
Panchenko, Anna R
Przytycka, Teresa M
Thiessen, Paul A
Bryant, Stephen H
State of the art: refinement of multiple sequence alignments
title State of the art: refinement of multiple sequence alignments
title_full State of the art: refinement of multiple sequence alignments
title_fullStr State of the art: refinement of multiple sequence alignments
title_full_unstemmed State of the art: refinement of multiple sequence alignments
title_short State of the art: refinement of multiple sequence alignments
title_sort state of the art: refinement of multiple sequence alignments
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1654193/
https://www.ncbi.nlm.nih.gov/pubmed/17105653
http://dx.doi.org/10.1186/1471-2105-7-499
work_keys_str_mv AT chakrabartisaikat stateoftheartrefinementofmultiplesequencealignments
AT lanczyckichristopherj stateoftheartrefinementofmultiplesequencealignments
AT panchenkoannar stateoftheartrefinementofmultiplesequencealignments
AT przytyckateresam stateoftheartrefinementofmultiplesequencealignments
AT thiessenpaula stateoftheartrefinementofmultiplesequencealignments
AT bryantstephenh stateoftheartrefinementofmultiplesequencealignments