Cargando…

A Score of the Ability of a Three-Dimensional Protein Model to Retrieve Its Own Sequence as a Quantitative Measure of Its Quality and Appropriateness

BACKGROUND: Despite the remarkable progress of bioinformatics, how the primary structure of a protein leads to a three-dimensional fold, and in turn determines its function remains an elusive question. Alignments of sequences with known function can be used to identify proteins with the same or simi...

Descripción completa

Detalles Bibliográficos
Autores principales: Martínez-Castilla, León P., Rodríguez-Sotres, Rogelio
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935356/
https://www.ncbi.nlm.nih.gov/pubmed/20830209
http://dx.doi.org/10.1371/journal.pone.0012483
_version_ 1782186384013918208
author Martínez-Castilla, León P.
Rodríguez-Sotres, Rogelio
author_facet Martínez-Castilla, León P.
Rodríguez-Sotres, Rogelio
author_sort Martínez-Castilla, León P.
collection PubMed
description BACKGROUND: Despite the remarkable progress of bioinformatics, how the primary structure of a protein leads to a three-dimensional fold, and in turn determines its function remains an elusive question. Alignments of sequences with known function can be used to identify proteins with the same or similar function with high success. However, identification of function-related and structure-related amino acid positions is only possible after a detailed study of every protein. Folding pattern diversity seems to be much narrower than sequence diversity, and the amino acid sequences of natural proteins have evolved under a selective pressure comprising structural and functional requirements acting in parallel. PRINCIPAL FINDINGS: The approach described in this work begins by generating a large number of amino acid sequences using ROSETTA [Dantas G et al. (2003) J Mol Biol 332:449–460], a program with notable robustness in the assignment of amino acids to a known three-dimensional structure. The resulting sequence-sets showed no conservation of amino acids at active sites, or protein-protein interfaces. Hidden Markov models built from the resulting sequence sets were used to search sequence databases. Surprisingly, the models retrieved from the database sequences belonged to proteins with the same or a very similar function. Given an appropriate cutoff, the rate of false positives was zero. According to our results, this protocol, here referred to as Rd.HMM, detects fine structural details on the folding patterns, that seem to be tightly linked to the fitness of a structural framework for a specific biological function. CONCLUSION: Because the sequence of the native protein used to create the Rd.HMM model was always amongst the top hits, the procedure is a reliable tool to score, very accurately, the quality and appropriateness of computer-modeled 3D-structures, without the need for spectroscopy data. However, Rd.HMM is very sensitive to the conformational features of the models' backbone.
format Text
id pubmed-2935356
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-29353562010-09-09 A Score of the Ability of a Three-Dimensional Protein Model to Retrieve Its Own Sequence as a Quantitative Measure of Its Quality and Appropriateness Martínez-Castilla, León P. Rodríguez-Sotres, Rogelio PLoS One Research Article BACKGROUND: Despite the remarkable progress of bioinformatics, how the primary structure of a protein leads to a three-dimensional fold, and in turn determines its function remains an elusive question. Alignments of sequences with known function can be used to identify proteins with the same or similar function with high success. However, identification of function-related and structure-related amino acid positions is only possible after a detailed study of every protein. Folding pattern diversity seems to be much narrower than sequence diversity, and the amino acid sequences of natural proteins have evolved under a selective pressure comprising structural and functional requirements acting in parallel. PRINCIPAL FINDINGS: The approach described in this work begins by generating a large number of amino acid sequences using ROSETTA [Dantas G et al. (2003) J Mol Biol 332:449–460], a program with notable robustness in the assignment of amino acids to a known three-dimensional structure. The resulting sequence-sets showed no conservation of amino acids at active sites, or protein-protein interfaces. Hidden Markov models built from the resulting sequence sets were used to search sequence databases. Surprisingly, the models retrieved from the database sequences belonged to proteins with the same or a very similar function. Given an appropriate cutoff, the rate of false positives was zero. According to our results, this protocol, here referred to as Rd.HMM, detects fine structural details on the folding patterns, that seem to be tightly linked to the fitness of a structural framework for a specific biological function. CONCLUSION: Because the sequence of the native protein used to create the Rd.HMM model was always amongst the top hits, the procedure is a reliable tool to score, very accurately, the quality and appropriateness of computer-modeled 3D-structures, without the need for spectroscopy data. However, Rd.HMM is very sensitive to the conformational features of the models' backbone. Public Library of Science 2010-09-07 /pmc/articles/PMC2935356/ /pubmed/20830209 http://dx.doi.org/10.1371/journal.pone.0012483 Text en Martínez-Castilla, Rodríguez-Sotres. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Martínez-Castilla, León P.
Rodríguez-Sotres, Rogelio
A Score of the Ability of a Three-Dimensional Protein Model to Retrieve Its Own Sequence as a Quantitative Measure of Its Quality and Appropriateness
title A Score of the Ability of a Three-Dimensional Protein Model to Retrieve Its Own Sequence as a Quantitative Measure of Its Quality and Appropriateness
title_full A Score of the Ability of a Three-Dimensional Protein Model to Retrieve Its Own Sequence as a Quantitative Measure of Its Quality and Appropriateness
title_fullStr A Score of the Ability of a Three-Dimensional Protein Model to Retrieve Its Own Sequence as a Quantitative Measure of Its Quality and Appropriateness
title_full_unstemmed A Score of the Ability of a Three-Dimensional Protein Model to Retrieve Its Own Sequence as a Quantitative Measure of Its Quality and Appropriateness
title_short A Score of the Ability of a Three-Dimensional Protein Model to Retrieve Its Own Sequence as a Quantitative Measure of Its Quality and Appropriateness
title_sort score of the ability of a three-dimensional protein model to retrieve its own sequence as a quantitative measure of its quality and appropriateness
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935356/
https://www.ncbi.nlm.nih.gov/pubmed/20830209
http://dx.doi.org/10.1371/journal.pone.0012483
work_keys_str_mv AT martinezcastillaleonp ascoreoftheabilityofathreedimensionalproteinmodeltoretrieveitsownsequenceasaquantitativemeasureofitsqualityandappropriateness
AT rodriguezsotresrogelio ascoreoftheabilityofathreedimensionalproteinmodeltoretrieveitsownsequenceasaquantitativemeasureofitsqualityandappropriateness
AT martinezcastillaleonp scoreoftheabilityofathreedimensionalproteinmodeltoretrieveitsownsequenceasaquantitativemeasureofitsqualityandappropriateness
AT rodriguezsotresrogelio scoreoftheabilityofathreedimensionalproteinmodeltoretrieveitsownsequenceasaquantitativemeasureofitsqualityandappropriateness