Cargando…
Prediction of Contact Residue Pairs Based on Co-Substitution between Sites in Protein Structures
Residue-residue interactions that fold a protein into a unique three-dimensional structure and make it play a specific function impose structural and functional constraints in varying degrees on each residue site. Selective constraints on residue sites are recorded in amino acid orders in homologous...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3546969/ https://www.ncbi.nlm.nih.gov/pubmed/23342110 http://dx.doi.org/10.1371/journal.pone.0054252 |
_version_ | 1782256147332333568 |
---|---|
author | Miyazawa, Sanzo |
author_facet | Miyazawa, Sanzo |
author_sort | Miyazawa, Sanzo |
collection | PubMed |
description | Residue-residue interactions that fold a protein into a unique three-dimensional structure and make it play a specific function impose structural and functional constraints in varying degrees on each residue site. Selective constraints on residue sites are recorded in amino acid orders in homologous sequences and also in the evolutionary trace of amino acid substitutions. A challenge is to extract direct dependences between residue sites by removing phylogenetic correlations and indirect dependences through other residues within a protein or even through other molecules. Rapid growth of protein families with unknown folds requires an accurate de novo prediction method for protein structure. Recent attempts of disentangling direct from indirect dependences of amino acid types between residue positions in multiple sequence alignments have revealed that inferred residue-residue proximities can be sufficient information to predict a protein fold without the use of known three-dimensional structures. Here, we propose an alternative method of inferring coevolving site pairs from concurrent and compensatory substitutions between sites in each branch of a phylogenetic tree. Substitution probability and physico-chemical changes (volume, charge, hydrogen-bonding capability, and others) accompanied by substitutions at each site in each branch of a phylogenetic tree are estimated with the likelihood of each substitution, and their direct correlations between sites are used to detect concurrent and compensatory substitutions. In order to extract direct dependences between sites, partial correlation coefficients of the characteristic changes along branches between sites, in which linear multiple dependences on feature vectors at other sites are removed, are calculated and used to rank coevolving site pairs. Accuracy of contact prediction based on the present coevolution score is comparable to that achieved by a maximum entropy model of protein sequences for 15 protein families taken from the Pfam release 26.0. Besides, this excellent accuracy indicates that compensatory substitutions are significant in protein evolution. |
format | Online Article Text |
id | pubmed-3546969 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-35469692013-01-22 Prediction of Contact Residue Pairs Based on Co-Substitution between Sites in Protein Structures Miyazawa, Sanzo PLoS One Research Article Residue-residue interactions that fold a protein into a unique three-dimensional structure and make it play a specific function impose structural and functional constraints in varying degrees on each residue site. Selective constraints on residue sites are recorded in amino acid orders in homologous sequences and also in the evolutionary trace of amino acid substitutions. A challenge is to extract direct dependences between residue sites by removing phylogenetic correlations and indirect dependences through other residues within a protein or even through other molecules. Rapid growth of protein families with unknown folds requires an accurate de novo prediction method for protein structure. Recent attempts of disentangling direct from indirect dependences of amino acid types between residue positions in multiple sequence alignments have revealed that inferred residue-residue proximities can be sufficient information to predict a protein fold without the use of known three-dimensional structures. Here, we propose an alternative method of inferring coevolving site pairs from concurrent and compensatory substitutions between sites in each branch of a phylogenetic tree. Substitution probability and physico-chemical changes (volume, charge, hydrogen-bonding capability, and others) accompanied by substitutions at each site in each branch of a phylogenetic tree are estimated with the likelihood of each substitution, and their direct correlations between sites are used to detect concurrent and compensatory substitutions. In order to extract direct dependences between sites, partial correlation coefficients of the characteristic changes along branches between sites, in which linear multiple dependences on feature vectors at other sites are removed, are calculated and used to rank coevolving site pairs. Accuracy of contact prediction based on the present coevolution score is comparable to that achieved by a maximum entropy model of protein sequences for 15 protein families taken from the Pfam release 26.0. Besides, this excellent accuracy indicates that compensatory substitutions are significant in protein evolution. Public Library of Science 2013-01-16 /pmc/articles/PMC3546969/ /pubmed/23342110 http://dx.doi.org/10.1371/journal.pone.0054252 Text en © 2013 Sanzo Miyazawa http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Miyazawa, Sanzo Prediction of Contact Residue Pairs Based on Co-Substitution between Sites in Protein Structures |
title | Prediction of Contact Residue Pairs Based on Co-Substitution between Sites in Protein Structures |
title_full | Prediction of Contact Residue Pairs Based on Co-Substitution between Sites in Protein Structures |
title_fullStr | Prediction of Contact Residue Pairs Based on Co-Substitution between Sites in Protein Structures |
title_full_unstemmed | Prediction of Contact Residue Pairs Based on Co-Substitution between Sites in Protein Structures |
title_short | Prediction of Contact Residue Pairs Based on Co-Substitution between Sites in Protein Structures |
title_sort | prediction of contact residue pairs based on co-substitution between sites in protein structures |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3546969/ https://www.ncbi.nlm.nih.gov/pubmed/23342110 http://dx.doi.org/10.1371/journal.pone.0054252 |
work_keys_str_mv | AT miyazawasanzo predictionofcontactresiduepairsbasedoncosubstitutionbetweensitesinproteinstructures |