Cargando…

Detection of distant evolutionary relationships between protein families using theory of sequence profile-profile comparison

BACKGROUND: Detection of common evolutionary origin (homology) is a primary means of inferring protein structure and function. At present, comparison of protein families represented as sequence profiles is arguably the most effective homology detection strategy. However, finding the best way to repr...

Descripción completa

Detalles Bibliográficos
Autores principales: Margelevičius, Mindaugas, Venclovas, Česlovas
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2837030/
https://www.ncbi.nlm.nih.gov/pubmed/20158924
http://dx.doi.org/10.1186/1471-2105-11-89
_version_ 1782178763756273664
author Margelevičius, Mindaugas
Venclovas, Česlovas
author_facet Margelevičius, Mindaugas
Venclovas, Česlovas
author_sort Margelevičius, Mindaugas
collection PubMed
description BACKGROUND: Detection of common evolutionary origin (homology) is a primary means of inferring protein structure and function. At present, comparison of protein families represented as sequence profiles is arguably the most effective homology detection strategy. However, finding the best way to represent evolutionary information of a protein sequence family in the profile, to compare profiles and to estimate the biological significance of such comparisons, remains an active area of research. RESULTS: Here, we present a new homology detection method based on sequence profile-profile comparison. The method has a number of new features including position-dependent gap penalties and a global score system. Position-dependent gap penalties provide a more biologically relevant way to represent and align protein families as sequence profiles. The global score system enables an analytical solution of the statistical parameters needed to estimate the statistical significance of profile-profile similarities. The new method, together with other state-of-the-art profile-based methods (HHsearch, COMPASS and PSI-BLAST), is benchmarked in all-against-all comparison of a challenging set of SCOP domains that share at most 20% sequence identity. For benchmarking, we use a reference ("gold standard") free model-based evaluation framework. Evaluation results show that at the level of protein domains our method compares favorably to all other tested methods. We also provide examples of the new method outperforming structure-based similarity detection and alignment. The implementation of the new method both as a standalone software package and as a web server is available at http://www.ibt.lt/bioinformatics/coma. CONCLUSION: Due to a number of developments, the new profile-profile comparison method shows an improved ability to match distantly related protein domains. Therefore, the method should be useful for annotation and homology modeling of uncharacterized proteins.
format Text
id pubmed-2837030
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28370302010-03-12 Detection of distant evolutionary relationships between protein families using theory of sequence profile-profile comparison Margelevičius, Mindaugas Venclovas, Česlovas BMC Bioinformatics Research article BACKGROUND: Detection of common evolutionary origin (homology) is a primary means of inferring protein structure and function. At present, comparison of protein families represented as sequence profiles is arguably the most effective homology detection strategy. However, finding the best way to represent evolutionary information of a protein sequence family in the profile, to compare profiles and to estimate the biological significance of such comparisons, remains an active area of research. RESULTS: Here, we present a new homology detection method based on sequence profile-profile comparison. The method has a number of new features including position-dependent gap penalties and a global score system. Position-dependent gap penalties provide a more biologically relevant way to represent and align protein families as sequence profiles. The global score system enables an analytical solution of the statistical parameters needed to estimate the statistical significance of profile-profile similarities. The new method, together with other state-of-the-art profile-based methods (HHsearch, COMPASS and PSI-BLAST), is benchmarked in all-against-all comparison of a challenging set of SCOP domains that share at most 20% sequence identity. For benchmarking, we use a reference ("gold standard") free model-based evaluation framework. Evaluation results show that at the level of protein domains our method compares favorably to all other tested methods. We also provide examples of the new method outperforming structure-based similarity detection and alignment. The implementation of the new method both as a standalone software package and as a web server is available at http://www.ibt.lt/bioinformatics/coma. CONCLUSION: Due to a number of developments, the new profile-profile comparison method shows an improved ability to match distantly related protein domains. Therefore, the method should be useful for annotation and homology modeling of uncharacterized proteins. BioMed Central 2010-02-17 /pmc/articles/PMC2837030/ /pubmed/20158924 http://dx.doi.org/10.1186/1471-2105-11-89 Text en Copyright ©2010 Margelevičius and Venclovas; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Margelevičius, Mindaugas
Venclovas, Česlovas
Detection of distant evolutionary relationships between protein families using theory of sequence profile-profile comparison
title Detection of distant evolutionary relationships between protein families using theory of sequence profile-profile comparison
title_full Detection of distant evolutionary relationships between protein families using theory of sequence profile-profile comparison
title_fullStr Detection of distant evolutionary relationships between protein families using theory of sequence profile-profile comparison
title_full_unstemmed Detection of distant evolutionary relationships between protein families using theory of sequence profile-profile comparison
title_short Detection of distant evolutionary relationships between protein families using theory of sequence profile-profile comparison
title_sort detection of distant evolutionary relationships between protein families using theory of sequence profile-profile comparison
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2837030/
https://www.ncbi.nlm.nih.gov/pubmed/20158924
http://dx.doi.org/10.1186/1471-2105-11-89
work_keys_str_mv AT margeleviciusmindaugas detectionofdistantevolutionaryrelationshipsbetweenproteinfamiliesusingtheoryofsequenceprofileprofilecomparison
AT venclovasceslovas detectionofdistantevolutionaryrelationshipsbetweenproteinfamiliesusingtheoryofsequenceprofileprofilecomparison