Cargando…

Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences

Residue contact predictions were calculated based on the mutual information observed between pairs of positions in large multiple protein sequence alignments. Where previously only the statistical properties of these data have been considered important, we introduce new measures to impose constraint...

Descripción completa

Detalles Bibliográficos
Autores principales: Taylor, William R., Sadowski, Michael I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3237328/
https://www.ncbi.nlm.nih.gov/pubmed/22194819
http://dx.doi.org/10.1371/journal.pone.0028265
_version_ 1782218874850115584
author Taylor, William R.
Sadowski, Michael I.
author_facet Taylor, William R.
Sadowski, Michael I.
author_sort Taylor, William R.
collection PubMed
description Residue contact predictions were calculated based on the mutual information observed between pairs of positions in large multiple protein sequence alignments. Where previously only the statistical properties of these data have been considered important, we introduce new measures to impose constraints that make the contact map more consistent with a three dimensional structure. These included global (bulk) properties and local secondary structure properties. The latter allowed the contact constraints to be employed at the level of filtering pairs of secondary structure contacts which led to a more efficient (lower-level) implementation in the PLATO structure prediction server. Where previously the measure of success with this method had been whether the correct fold was predicted in the top 10 ranked models, with the current implementation, our summary statistic is the number of correct folds included in the top 10 models — which is on average over 50 percent.
format Online
Article
Text
id pubmed-3237328
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32373282011-12-22 Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences Taylor, William R. Sadowski, Michael I. PLoS One Research Article Residue contact predictions were calculated based on the mutual information observed between pairs of positions in large multiple protein sequence alignments. Where previously only the statistical properties of these data have been considered important, we introduce new measures to impose constraints that make the contact map more consistent with a three dimensional structure. These included global (bulk) properties and local secondary structure properties. The latter allowed the contact constraints to be employed at the level of filtering pairs of secondary structure contacts which led to a more efficient (lower-level) implementation in the PLATO structure prediction server. Where previously the measure of success with this method had been whether the correct fold was predicted in the top 10 ranked models, with the current implementation, our summary statistic is the number of correct folds included in the top 10 models — which is on average over 50 percent. Public Library of Science 2011-12-05 /pmc/articles/PMC3237328/ /pubmed/22194819 http://dx.doi.org/10.1371/journal.pone.0028265 Text en Taylor, Sadowski. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Taylor, William R.
Sadowski, Michael I.
Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences
title Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences
title_full Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences
title_fullStr Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences
title_full_unstemmed Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences
title_short Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences
title_sort structural constraints on the covariance matrix derived from multiple aligned protein sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3237328/
https://www.ncbi.nlm.nih.gov/pubmed/22194819
http://dx.doi.org/10.1371/journal.pone.0028265
work_keys_str_mv AT taylorwilliamr structuralconstraintsonthecovariancematrixderivedfrommultiplealignedproteinsequences
AT sadowskimichaeli structuralconstraintsonthecovariancematrixderivedfrommultiplealignedproteinsequences