Cargando…
Gibbs sampling and helix-cap motifs
Protein backbones have characteristic secondary structures, including α-helices and β-sheets. Which structure is adopted locally is strongly biased by the local amino acid sequence of the protein. Accurate (probabilistic) mappings from sequence to structure are valuable for both secondary-structure...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2005
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1234247/ https://www.ncbi.nlm.nih.gov/pubmed/16174845 http://dx.doi.org/10.1093/nar/gki842 |
_version_ | 1782124998573424640 |
---|---|
author | Kruus, Erik Thumfort, Peter Tang, Chao Wingreen, Ned S. |
author_facet | Kruus, Erik Thumfort, Peter Tang, Chao Wingreen, Ned S. |
author_sort | Kruus, Erik |
collection | PubMed |
description | Protein backbones have characteristic secondary structures, including α-helices and β-sheets. Which structure is adopted locally is strongly biased by the local amino acid sequence of the protein. Accurate (probabilistic) mappings from sequence to structure are valuable for both secondary-structure prediction and protein design. For the case of α-helix caps, we test whether the information content of the sequence–structure mapping can be self-consistently improved by using a relaxed definition of the structure. We derive helix-cap sequence motifs using database helix assignments for proteins of known structure. These motifs are refined using Gibbs sampling in competition with a null motif. Then Gibbs sampling is repeated, allowing for frameshifts of ±1 amino acid residue, in order to find sequence motifs of higher total information content. All helix-cap motifs were found to have good generalization capability, as judged by training on a small set of non-redundant proteins and testing on a larger set. For overall prediction purposes, frameshift motifs using all training examples yielded the best results. Frameshift motifs using a fraction of all training examples performed best in terms of true positives among top predictions. However, motifs without frameshifts also performed well, despite a roughly one-third lower total information content. |
format | Text |
id | pubmed-1234247 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2005 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-12342472005-09-27 Gibbs sampling and helix-cap motifs Kruus, Erik Thumfort, Peter Tang, Chao Wingreen, Ned S. Nucleic Acids Res Article Protein backbones have characteristic secondary structures, including α-helices and β-sheets. Which structure is adopted locally is strongly biased by the local amino acid sequence of the protein. Accurate (probabilistic) mappings from sequence to structure are valuable for both secondary-structure prediction and protein design. For the case of α-helix caps, we test whether the information content of the sequence–structure mapping can be self-consistently improved by using a relaxed definition of the structure. We derive helix-cap sequence motifs using database helix assignments for proteins of known structure. These motifs are refined using Gibbs sampling in competition with a null motif. Then Gibbs sampling is repeated, allowing for frameshifts of ±1 amino acid residue, in order to find sequence motifs of higher total information content. All helix-cap motifs were found to have good generalization capability, as judged by training on a small set of non-redundant proteins and testing on a larger set. For overall prediction purposes, frameshift motifs using all training examples yielded the best results. Frameshift motifs using a fraction of all training examples performed best in terms of true positives among top predictions. However, motifs without frameshifts also performed well, despite a roughly one-third lower total information content. Oxford University Press 2005 2005-09-20 /pmc/articles/PMC1234247/ /pubmed/16174845 http://dx.doi.org/10.1093/nar/gki842 Text en © The Author 2005. Published by Oxford University Press. All rights reserved |
spellingShingle | Article Kruus, Erik Thumfort, Peter Tang, Chao Wingreen, Ned S. Gibbs sampling and helix-cap motifs |
title | Gibbs sampling and helix-cap motifs |
title_full | Gibbs sampling and helix-cap motifs |
title_fullStr | Gibbs sampling and helix-cap motifs |
title_full_unstemmed | Gibbs sampling and helix-cap motifs |
title_short | Gibbs sampling and helix-cap motifs |
title_sort | gibbs sampling and helix-cap motifs |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1234247/ https://www.ncbi.nlm.nih.gov/pubmed/16174845 http://dx.doi.org/10.1093/nar/gki842 |
work_keys_str_mv | AT kruuserik gibbssamplingandhelixcapmotifs AT thumfortpeter gibbssamplingandhelixcapmotifs AT tangchao gibbssamplingandhelixcapmotifs AT wingreenneds gibbssamplingandhelixcapmotifs |