Cargando…

Improved alignment quality by combining evolutionary information, predicted secondary structure and self-organizing maps

BACKGROUND: Protein sequence alignment is one of the basic tools in bioinformatics. Correct alignments are required for a range of tasks including the derivation of phylogenetic trees and protein structure prediction. Numerous studies have shown that the incorporation of predicted secondary structur...

Descripción completa

Detalles Bibliográficos
Autores principales: Ohlson, Tomas, Aggarwal, Varun, Elofsson, Arne, MacCallum, Robert M
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1562450/
https://www.ncbi.nlm.nih.gov/pubmed/16869963
http://dx.doi.org/10.1186/1471-2105-7-357
_version_ 1782129505694908416
author Ohlson, Tomas
Aggarwal, Varun
Elofsson, Arne
MacCallum, Robert M
author_facet Ohlson, Tomas
Aggarwal, Varun
Elofsson, Arne
MacCallum, Robert M
author_sort Ohlson, Tomas
collection PubMed
description BACKGROUND: Protein sequence alignment is one of the basic tools in bioinformatics. Correct alignments are required for a range of tasks including the derivation of phylogenetic trees and protein structure prediction. Numerous studies have shown that the incorporation of predicted secondary structure information into alignment algorithms improves their performance. Secondary structure predictors have to be trained on a set of somewhat arbitrarily defined states (e.g. helix, strand, coil), and it has been shown that the choice of these states has some effect on alignment quality. However, it is not unlikely that prediction of other structural features also could provide an improvement. In this study we use an unsupervised clustering method, the self-organizing map, to assign sequence profile windows to "structural states" and assess their use in sequence alignment. RESULTS: The addition of self-organizing map locations as inputs to a profile-profile scoring function improves the alignment quality of distantly related proteins slightly. The improvement is slightly smaller than that gained from the inclusion of predicted secondary structure. However, the information seems to be complementary as the two prediction schemes can be combined to improve the alignment quality by a further small but significant amount. CONCLUSION: It has been observed in many studies that predicted secondary structure significantly improves the alignments. Here we have shown that the addition of self-organizing map locations can further improve the alignments as the self-organizing map locations seem to contain some information that is not captured by the predicted secondary structure.
format Text
id pubmed-1562450
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15624502006-09-08 Improved alignment quality by combining evolutionary information, predicted secondary structure and self-organizing maps Ohlson, Tomas Aggarwal, Varun Elofsson, Arne MacCallum, Robert M BMC Bioinformatics Methodology Article BACKGROUND: Protein sequence alignment is one of the basic tools in bioinformatics. Correct alignments are required for a range of tasks including the derivation of phylogenetic trees and protein structure prediction. Numerous studies have shown that the incorporation of predicted secondary structure information into alignment algorithms improves their performance. Secondary structure predictors have to be trained on a set of somewhat arbitrarily defined states (e.g. helix, strand, coil), and it has been shown that the choice of these states has some effect on alignment quality. However, it is not unlikely that prediction of other structural features also could provide an improvement. In this study we use an unsupervised clustering method, the self-organizing map, to assign sequence profile windows to "structural states" and assess their use in sequence alignment. RESULTS: The addition of self-organizing map locations as inputs to a profile-profile scoring function improves the alignment quality of distantly related proteins slightly. The improvement is slightly smaller than that gained from the inclusion of predicted secondary structure. However, the information seems to be complementary as the two prediction schemes can be combined to improve the alignment quality by a further small but significant amount. CONCLUSION: It has been observed in many studies that predicted secondary structure significantly improves the alignments. Here we have shown that the addition of self-organizing map locations can further improve the alignments as the self-organizing map locations seem to contain some information that is not captured by the predicted secondary structure. BioMed Central 2006-07-25 /pmc/articles/PMC1562450/ /pubmed/16869963 http://dx.doi.org/10.1186/1471-2105-7-357 Text en Copyright © 2006 Ohlson et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Ohlson, Tomas
Aggarwal, Varun
Elofsson, Arne
MacCallum, Robert M
Improved alignment quality by combining evolutionary information, predicted secondary structure and self-organizing maps
title Improved alignment quality by combining evolutionary information, predicted secondary structure and self-organizing maps
title_full Improved alignment quality by combining evolutionary information, predicted secondary structure and self-organizing maps
title_fullStr Improved alignment quality by combining evolutionary information, predicted secondary structure and self-organizing maps
title_full_unstemmed Improved alignment quality by combining evolutionary information, predicted secondary structure and self-organizing maps
title_short Improved alignment quality by combining evolutionary information, predicted secondary structure and self-organizing maps
title_sort improved alignment quality by combining evolutionary information, predicted secondary structure and self-organizing maps
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1562450/
https://www.ncbi.nlm.nih.gov/pubmed/16869963
http://dx.doi.org/10.1186/1471-2105-7-357
work_keys_str_mv AT ohlsontomas improvedalignmentqualitybycombiningevolutionaryinformationpredictedsecondarystructureandselforganizingmaps
AT aggarwalvarun improvedalignmentqualitybycombiningevolutionaryinformationpredictedsecondarystructureandselforganizingmaps
AT elofssonarne improvedalignmentqualitybycombiningevolutionaryinformationpredictedsecondarystructureandselforganizingmaps
AT maccallumrobertm improvedalignmentqualitybycombiningevolutionaryinformationpredictedsecondarystructureandselforganizingmaps