Cargando…
Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences
Residue contact predictions were calculated based on the mutual information observed between pairs of positions in large multiple protein sequence alignments. Where previously only the statistical properties of these data have been considered important, we introduce new measures to impose constraint...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3237328/ https://www.ncbi.nlm.nih.gov/pubmed/22194819 http://dx.doi.org/10.1371/journal.pone.0028265 |
_version_ | 1782218874850115584 |
---|---|
author | Taylor, William R. Sadowski, Michael I. |
author_facet | Taylor, William R. Sadowski, Michael I. |
author_sort | Taylor, William R. |
collection | PubMed |
description | Residue contact predictions were calculated based on the mutual information observed between pairs of positions in large multiple protein sequence alignments. Where previously only the statistical properties of these data have been considered important, we introduce new measures to impose constraints that make the contact map more consistent with a three dimensional structure. These included global (bulk) properties and local secondary structure properties. The latter allowed the contact constraints to be employed at the level of filtering pairs of secondary structure contacts which led to a more efficient (lower-level) implementation in the PLATO structure prediction server. Where previously the measure of success with this method had been whether the correct fold was predicted in the top 10 ranked models, with the current implementation, our summary statistic is the number of correct folds included in the top 10 models — which is on average over 50 percent. |
format | Online Article Text |
id | pubmed-3237328 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-32373282011-12-22 Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences Taylor, William R. Sadowski, Michael I. PLoS One Research Article Residue contact predictions were calculated based on the mutual information observed between pairs of positions in large multiple protein sequence alignments. Where previously only the statistical properties of these data have been considered important, we introduce new measures to impose constraints that make the contact map more consistent with a three dimensional structure. These included global (bulk) properties and local secondary structure properties. The latter allowed the contact constraints to be employed at the level of filtering pairs of secondary structure contacts which led to a more efficient (lower-level) implementation in the PLATO structure prediction server. Where previously the measure of success with this method had been whether the correct fold was predicted in the top 10 ranked models, with the current implementation, our summary statistic is the number of correct folds included in the top 10 models — which is on average over 50 percent. Public Library of Science 2011-12-05 /pmc/articles/PMC3237328/ /pubmed/22194819 http://dx.doi.org/10.1371/journal.pone.0028265 Text en Taylor, Sadowski. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Taylor, William R. Sadowski, Michael I. Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences |
title | Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences |
title_full | Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences |
title_fullStr | Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences |
title_full_unstemmed | Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences |
title_short | Structural Constraints on the Covariance Matrix Derived from Multiple Aligned Protein Sequences |
title_sort | structural constraints on the covariance matrix derived from multiple aligned protein sequences |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3237328/ https://www.ncbi.nlm.nih.gov/pubmed/22194819 http://dx.doi.org/10.1371/journal.pone.0028265 |
work_keys_str_mv | AT taylorwilliamr structuralconstraintsonthecovariancematrixderivedfrommultiplealignedproteinsequences AT sadowskimichaeli structuralconstraintsonthecovariancematrixderivedfrommultiplealignedproteinsequences |