Cargando…

Statistical investigations of protein residue direct couplings

Protein Direct Coupling Analysis (DCA), which predicts residue-residue contacts based on covarying positions within a multiple sequence alignment, has been remarkably effective. This suggests that there is more to learn from sequence correlations than is generally assumed, and calls for deeper inves...

Descripción completa

Detalles Bibliográficos
Autores principales: Neuwald, Andrew F., Altschul, Stephen F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6329532/
https://www.ncbi.nlm.nih.gov/pubmed/30596639
http://dx.doi.org/10.1371/journal.pcbi.1006237
_version_ 1783386844739665920
author Neuwald, Andrew F.
Altschul, Stephen F.
author_facet Neuwald, Andrew F.
Altschul, Stephen F.
author_sort Neuwald, Andrew F.
collection PubMed
description Protein Direct Coupling Analysis (DCA), which predicts residue-residue contacts based on covarying positions within a multiple sequence alignment, has been remarkably effective. This suggests that there is more to learn from sequence correlations than is generally assumed, and calls for deeper investigations into DCA and perhaps into other types of correlations. Here we describe an approach that enables such investigations by measuring, as an estimated p-value, the statistical significance of the association between residue-residue covariance and structural interactions, either internal or homodimeric. Its application to thirty protein superfamilies confirms that direct coupling (DC) scores correlate with 3D pairwise contacts with very high significance. This method also permits quantitative assessment of the relative performance of alternative DCA methods, and of the degree to which they detect direct versus indirect couplings. We illustrate its use to assess, for a given protein, the biological relevance of alternative conformational states, to investigate the possible mechanistic implications of differences between these states, and to characterize subtle aspects of direct couplings. Our analysis indicates that direct pairwise correlations may be largely distinct from correlated patterns associated with functional specialization, and that the joint analysis of both types of correlations can yield greater power. Data, programs, and source code are freely available at http://evaldca.igs.umaryland.edu.
format Online
Article
Text
id pubmed-6329532
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-63295322019-01-30 Statistical investigations of protein residue direct couplings Neuwald, Andrew F. Altschul, Stephen F. PLoS Comput Biol Research Article Protein Direct Coupling Analysis (DCA), which predicts residue-residue contacts based on covarying positions within a multiple sequence alignment, has been remarkably effective. This suggests that there is more to learn from sequence correlations than is generally assumed, and calls for deeper investigations into DCA and perhaps into other types of correlations. Here we describe an approach that enables such investigations by measuring, as an estimated p-value, the statistical significance of the association between residue-residue covariance and structural interactions, either internal or homodimeric. Its application to thirty protein superfamilies confirms that direct coupling (DC) scores correlate with 3D pairwise contacts with very high significance. This method also permits quantitative assessment of the relative performance of alternative DCA methods, and of the degree to which they detect direct versus indirect couplings. We illustrate its use to assess, for a given protein, the biological relevance of alternative conformational states, to investigate the possible mechanistic implications of differences between these states, and to characterize subtle aspects of direct couplings. Our analysis indicates that direct pairwise correlations may be largely distinct from correlated patterns associated with functional specialization, and that the joint analysis of both types of correlations can yield greater power. Data, programs, and source code are freely available at http://evaldca.igs.umaryland.edu. Public Library of Science 2018-12-31 /pmc/articles/PMC6329532/ /pubmed/30596639 http://dx.doi.org/10.1371/journal.pcbi.1006237 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Neuwald, Andrew F.
Altschul, Stephen F.
Statistical investigations of protein residue direct couplings
title Statistical investigations of protein residue direct couplings
title_full Statistical investigations of protein residue direct couplings
title_fullStr Statistical investigations of protein residue direct couplings
title_full_unstemmed Statistical investigations of protein residue direct couplings
title_short Statistical investigations of protein residue direct couplings
title_sort statistical investigations of protein residue direct couplings
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6329532/
https://www.ncbi.nlm.nih.gov/pubmed/30596639
http://dx.doi.org/10.1371/journal.pcbi.1006237
work_keys_str_mv AT neuwaldandrewf statisticalinvestigationsofproteinresiduedirectcouplings
AT altschulstephenf statisticalinvestigationsofproteinresiduedirectcouplings