Cargando…

Minimum message length inference of secondary structure from protein coordinate data

Motivation: Secondary structure underpins the folding pattern and architecture of most proteins. Accurate assignment of the secondary structure elements is therefore an important problem. Although many approximate solutions of the secondary structure assignment problem exist, the statement of the pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Konagurthu, Arun S., Lesk, Arthur M., Allison, Lloyd
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3371855/
https://www.ncbi.nlm.nih.gov/pubmed/22689785
http://dx.doi.org/10.1093/bioinformatics/bts223
_version_ 1782235271366967296
author Konagurthu, Arun S.
Lesk, Arthur M.
Allison, Lloyd
author_facet Konagurthu, Arun S.
Lesk, Arthur M.
Allison, Lloyd
author_sort Konagurthu, Arun S.
collection PubMed
description Motivation: Secondary structure underpins the folding pattern and architecture of most proteins. Accurate assignment of the secondary structure elements is therefore an important problem. Although many approximate solutions of the secondary structure assignment problem exist, the statement of the problem has resisted a consistent and mathematically rigorous definition. A variety of comparative studies have highlighted major disagreements in the way the available methods define and assign secondary structure to coordinate data. Results: We report a new method to infer secondary structure based on the Bayesian method of minimum message length inference. It treats assignments of secondary structure as hypotheses that explain the given coordinate data. The method seeks to maximize the joint probability of a hypothesis and the data. There is a natural null hypothesis and any assignment that cannot better it is unacceptable. We developed a program SST based on this approach and compared it with popular programs, such as DSSP and STRIDE among others. Our evaluation suggests that SST gives reliable assignments even on low-resolution structures. Availability: http://www.csse.monash.edu.au/~karun/sst Contact: arun.konagurthu@monash.edu (or lloyd.allison@monash.edu)
format Online
Article
Text
id pubmed-3371855
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-33718552012-06-11 Minimum message length inference of secondary structure from protein coordinate data Konagurthu, Arun S. Lesk, Arthur M. Allison, Lloyd Bioinformatics Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa Motivation: Secondary structure underpins the folding pattern and architecture of most proteins. Accurate assignment of the secondary structure elements is therefore an important problem. Although many approximate solutions of the secondary structure assignment problem exist, the statement of the problem has resisted a consistent and mathematically rigorous definition. A variety of comparative studies have highlighted major disagreements in the way the available methods define and assign secondary structure to coordinate data. Results: We report a new method to infer secondary structure based on the Bayesian method of minimum message length inference. It treats assignments of secondary structure as hypotheses that explain the given coordinate data. The method seeks to maximize the joint probability of a hypothesis and the data. There is a natural null hypothesis and any assignment that cannot better it is unacceptable. We developed a program SST based on this approach and compared it with popular programs, such as DSSP and STRIDE among others. Our evaluation suggests that SST gives reliable assignments even on low-resolution structures. Availability: http://www.csse.monash.edu.au/~karun/sst Contact: arun.konagurthu@monash.edu (or lloyd.allison@monash.edu) Oxford University Press 2012-06-15 2012-06-09 /pmc/articles/PMC3371855/ /pubmed/22689785 http://dx.doi.org/10.1093/bioinformatics/bts223 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa
Konagurthu, Arun S.
Lesk, Arthur M.
Allison, Lloyd
Minimum message length inference of secondary structure from protein coordinate data
title Minimum message length inference of secondary structure from protein coordinate data
title_full Minimum message length inference of secondary structure from protein coordinate data
title_fullStr Minimum message length inference of secondary structure from protein coordinate data
title_full_unstemmed Minimum message length inference of secondary structure from protein coordinate data
title_short Minimum message length inference of secondary structure from protein coordinate data
title_sort minimum message length inference of secondary structure from protein coordinate data
topic Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3371855/
https://www.ncbi.nlm.nih.gov/pubmed/22689785
http://dx.doi.org/10.1093/bioinformatics/bts223
work_keys_str_mv AT konagurthuaruns minimummessagelengthinferenceofsecondarystructurefromproteincoordinatedata
AT leskarthurm minimummessagelengthinferenceofsecondarystructurefromproteincoordinatedata
AT allisonlloyd minimummessagelengthinferenceofsecondarystructurefromproteincoordinatedata