Cargando…

Gibbs sampling and helix-cap motifs

Protein backbones have characteristic secondary structures, including α-helices and β-sheets. Which structure is adopted locally is strongly biased by the local amino acid sequence of the protein. Accurate (probabilistic) mappings from sequence to structure are valuable for both secondary-structure...

Descripción completa

Detalles Bibliográficos
Autores principales: Kruus, Erik, Thumfort, Peter, Tang, Chao, Wingreen, Ned S.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1234247/
https://www.ncbi.nlm.nih.gov/pubmed/16174845
http://dx.doi.org/10.1093/nar/gki842
_version_ 1782124998573424640
author Kruus, Erik
Thumfort, Peter
Tang, Chao
Wingreen, Ned S.
author_facet Kruus, Erik
Thumfort, Peter
Tang, Chao
Wingreen, Ned S.
author_sort Kruus, Erik
collection PubMed
description Protein backbones have characteristic secondary structures, including α-helices and β-sheets. Which structure is adopted locally is strongly biased by the local amino acid sequence of the protein. Accurate (probabilistic) mappings from sequence to structure are valuable for both secondary-structure prediction and protein design. For the case of α-helix caps, we test whether the information content of the sequence–structure mapping can be self-consistently improved by using a relaxed definition of the structure. We derive helix-cap sequence motifs using database helix assignments for proteins of known structure. These motifs are refined using Gibbs sampling in competition with a null motif. Then Gibbs sampling is repeated, allowing for frameshifts of ±1 amino acid residue, in order to find sequence motifs of higher total information content. All helix-cap motifs were found to have good generalization capability, as judged by training on a small set of non-redundant proteins and testing on a larger set. For overall prediction purposes, frameshift motifs using all training examples yielded the best results. Frameshift motifs using a fraction of all training examples performed best in terms of true positives among top predictions. However, motifs without frameshifts also performed well, despite a roughly one-third lower total information content.
format Text
id pubmed-1234247
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-12342472005-09-27 Gibbs sampling and helix-cap motifs Kruus, Erik Thumfort, Peter Tang, Chao Wingreen, Ned S. Nucleic Acids Res Article Protein backbones have characteristic secondary structures, including α-helices and β-sheets. Which structure is adopted locally is strongly biased by the local amino acid sequence of the protein. Accurate (probabilistic) mappings from sequence to structure are valuable for both secondary-structure prediction and protein design. For the case of α-helix caps, we test whether the information content of the sequence–structure mapping can be self-consistently improved by using a relaxed definition of the structure. We derive helix-cap sequence motifs using database helix assignments for proteins of known structure. These motifs are refined using Gibbs sampling in competition with a null motif. Then Gibbs sampling is repeated, allowing for frameshifts of ±1 amino acid residue, in order to find sequence motifs of higher total information content. All helix-cap motifs were found to have good generalization capability, as judged by training on a small set of non-redundant proteins and testing on a larger set. For overall prediction purposes, frameshift motifs using all training examples yielded the best results. Frameshift motifs using a fraction of all training examples performed best in terms of true positives among top predictions. However, motifs without frameshifts also performed well, despite a roughly one-third lower total information content. Oxford University Press 2005 2005-09-20 /pmc/articles/PMC1234247/ /pubmed/16174845 http://dx.doi.org/10.1093/nar/gki842 Text en © The Author 2005. Published by Oxford University Press. All rights reserved
spellingShingle Article
Kruus, Erik
Thumfort, Peter
Tang, Chao
Wingreen, Ned S.
Gibbs sampling and helix-cap motifs
title Gibbs sampling and helix-cap motifs
title_full Gibbs sampling and helix-cap motifs
title_fullStr Gibbs sampling and helix-cap motifs
title_full_unstemmed Gibbs sampling and helix-cap motifs
title_short Gibbs sampling and helix-cap motifs
title_sort gibbs sampling and helix-cap motifs
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1234247/
https://www.ncbi.nlm.nih.gov/pubmed/16174845
http://dx.doi.org/10.1093/nar/gki842
work_keys_str_mv AT kruuserik gibbssamplingandhelixcapmotifs
AT thumfortpeter gibbssamplingandhelixcapmotifs
AT tangchao gibbssamplingandhelixcapmotifs
AT wingreenneds gibbssamplingandhelixcapmotifs