Cargando…

Bayesian Model of Protein Primary Sequence for Secondary Structure Prediction

Determining the primary structure (i.e., amino acid sequence) of a protein has become cheaper, faster, and more accurate. Higher order protein structure provides insight into a protein’s function in the cell. Understanding a protein’s secondary structure is a first step towards this goal. Therefore,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Qiwei, Dahl, David B., Vannucci, Marina, Hyun Joo, Tsai, Jerry W.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2014
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4196994/ https://www.ncbi.nlm.nih.gov/pubmed/25314659 http://dx.doi.org/10.1371/journal.pone.0109832

_version_	1782339560491974656
author	Li, Qiwei Dahl, David B. Vannucci, Marina Hyun Joo, Tsai, Jerry W.
author_facet	Li, Qiwei Dahl, David B. Vannucci, Marina Hyun Joo, Tsai, Jerry W.
author_sort	Li, Qiwei
collection	PubMed
description	Determining the primary structure (i.e., amino acid sequence) of a protein has become cheaper, faster, and more accurate. Higher order protein structure provides insight into a protein’s function in the cell. Understanding a protein’s secondary structure is a first step towards this goal. Therefore, a number of computational prediction methods have been developed to predict secondary structure from just the primary amino acid sequence. The most successful methods use machine learning approaches that are quite accurate, but do not directly incorporate structural information. As a step towards improving secondary structure reduction given the primary structure, we propose a Bayesian model based on the knob-socket model of protein packing in secondary structure. The method considers the packing influence of residues on the secondary structure determination, including those packed close in space but distant in sequence. By performing an assessment of our method on 2 test sets we show how incorporation of multiple sequence alignment data, similarly to PSIPRED, provides balance and improves the accuracy of the predictions. Software implementing the methods is provided as a web application and a stand-alone implementation.
format	Online Article Text
id	pubmed-4196994
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-41969942014-10-16 Bayesian Model of Protein Primary Sequence for Secondary Structure Prediction Li, Qiwei Dahl, David B. Vannucci, Marina Hyun Joo, Tsai, Jerry W. PLoS One Research Article Determining the primary structure (i.e., amino acid sequence) of a protein has become cheaper, faster, and more accurate. Higher order protein structure provides insight into a protein’s function in the cell. Understanding a protein’s secondary structure is a first step towards this goal. Therefore, a number of computational prediction methods have been developed to predict secondary structure from just the primary amino acid sequence. The most successful methods use machine learning approaches that are quite accurate, but do not directly incorporate structural information. As a step towards improving secondary structure reduction given the primary structure, we propose a Bayesian model based on the knob-socket model of protein packing in secondary structure. The method considers the packing influence of residues on the secondary structure determination, including those packed close in space but distant in sequence. By performing an assessment of our method on 2 test sets we show how incorporation of multiple sequence alignment data, similarly to PSIPRED, provides balance and improves the accuracy of the predictions. Software implementing the methods is provided as a web application and a stand-alone implementation. Public Library of Science 2014-10-14 /pmc/articles/PMC4196994/ /pubmed/25314659 http://dx.doi.org/10.1371/journal.pone.0109832 Text en © 2014 Li et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Li, Qiwei Dahl, David B. Vannucci, Marina Hyun Joo, Tsai, Jerry W. Bayesian Model of Protein Primary Sequence for Secondary Structure Prediction
title	Bayesian Model of Protein Primary Sequence for Secondary Structure Prediction
title_full	Bayesian Model of Protein Primary Sequence for Secondary Structure Prediction
title_fullStr	Bayesian Model of Protein Primary Sequence for Secondary Structure Prediction
title_full_unstemmed	Bayesian Model of Protein Primary Sequence for Secondary Structure Prediction
title_short	Bayesian Model of Protein Primary Sequence for Secondary Structure Prediction
title_sort	bayesian model of protein primary sequence for secondary structure prediction
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4196994/ https://www.ncbi.nlm.nih.gov/pubmed/25314659 http://dx.doi.org/10.1371/journal.pone.0109832
work_keys_str_mv	AT liqiwei bayesianmodelofproteinprimarysequenceforsecondarystructureprediction AT dahldavidb bayesianmodelofproteinprimarysequenceforsecondarystructureprediction AT vannuccimarina bayesianmodelofproteinprimarysequenceforsecondarystructureprediction AT hyunjoo bayesianmodelofproteinprimarysequenceforsecondarystructureprediction AT tsaijerryw bayesianmodelofproteinprimarysequenceforsecondarystructureprediction

Bayesian Model of Protein Primary Sequence for Secondary Structure Prediction

Ejemplares similares