Cargando…

Prediction of protein continuum secondary structure with probabilistic models based on NMR solved structures

BACKGROUND: The structure of proteins may change as a result of the inherent flexibility of some protein regions. We develop and explore probabilistic machine learning methods for predicting a continuum secondary structure, i.e. assigning probabilities to the conformational states of a residue. We t...

Descripción completa

Detalles Bibliográficos
Autores principales: Bodén, Mikael, Yuan, Zheng, Bailey, Timothy L
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1386714/
https://www.ncbi.nlm.nih.gov/pubmed/16478545
http://dx.doi.org/10.1186/1471-2105-7-68
_version_ 1782126883581722624
author Bodén, Mikael
Yuan, Zheng
Bailey, Timothy L
author_facet Bodén, Mikael
Yuan, Zheng
Bailey, Timothy L
author_sort Bodén, Mikael
collection PubMed
description BACKGROUND: The structure of proteins may change as a result of the inherent flexibility of some protein regions. We develop and explore probabilistic machine learning methods for predicting a continuum secondary structure, i.e. assigning probabilities to the conformational states of a residue. We train our methods using data derived from high-quality NMR models. RESULTS: Several probabilistic models not only successfully estimate the continuum secondary structure, but also provide a categorical output on par with models directly trained on categorical data. Importantly, models trained on the continuum secondary structure are also better than their categorical counterparts at identifying the conformational state for structurally ambivalent residues. CONCLUSION: Cascaded probabilistic neural networks trained on the continuum secondary structure exhibit better accuracy in structurally ambivalent regions of proteins, while sustaining an overall classification accuracy on par with standard, categorical prediction methods.
format Text
id pubmed-1386714
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-13867142006-04-21 Prediction of protein continuum secondary structure with probabilistic models based on NMR solved structures Bodén, Mikael Yuan, Zheng Bailey, Timothy L BMC Bioinformatics Methodology Article BACKGROUND: The structure of proteins may change as a result of the inherent flexibility of some protein regions. We develop and explore probabilistic machine learning methods for predicting a continuum secondary structure, i.e. assigning probabilities to the conformational states of a residue. We train our methods using data derived from high-quality NMR models. RESULTS: Several probabilistic models not only successfully estimate the continuum secondary structure, but also provide a categorical output on par with models directly trained on categorical data. Importantly, models trained on the continuum secondary structure are also better than their categorical counterparts at identifying the conformational state for structurally ambivalent residues. CONCLUSION: Cascaded probabilistic neural networks trained on the continuum secondary structure exhibit better accuracy in structurally ambivalent regions of proteins, while sustaining an overall classification accuracy on par with standard, categorical prediction methods. BioMed Central 2006-02-14 /pmc/articles/PMC1386714/ /pubmed/16478545 http://dx.doi.org/10.1186/1471-2105-7-68 Text en Copyright © 2006 Bodén et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Bodén, Mikael
Yuan, Zheng
Bailey, Timothy L
Prediction of protein continuum secondary structure with probabilistic models based on NMR solved structures
title Prediction of protein continuum secondary structure with probabilistic models based on NMR solved structures
title_full Prediction of protein continuum secondary structure with probabilistic models based on NMR solved structures
title_fullStr Prediction of protein continuum secondary structure with probabilistic models based on NMR solved structures
title_full_unstemmed Prediction of protein continuum secondary structure with probabilistic models based on NMR solved structures
title_short Prediction of protein continuum secondary structure with probabilistic models based on NMR solved structures
title_sort prediction of protein continuum secondary structure with probabilistic models based on nmr solved structures
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1386714/
https://www.ncbi.nlm.nih.gov/pubmed/16478545
http://dx.doi.org/10.1186/1471-2105-7-68
work_keys_str_mv AT bodenmikael predictionofproteincontinuumsecondarystructurewithprobabilisticmodelsbasedonnmrsolvedstructures
AT yuanzheng predictionofproteincontinuumsecondarystructurewithprobabilisticmodelsbasedonnmrsolvedstructures
AT baileytimothyl predictionofproteincontinuumsecondarystructurewithprobabilisticmodelsbasedonnmrsolvedstructures