Cargando…
Minimum message length inference of secondary structure from protein coordinate data
Motivation: Secondary structure underpins the folding pattern and architecture of most proteins. Accurate assignment of the secondary structure elements is therefore an important problem. Although many approximate solutions of the secondary structure assignment problem exist, the statement of the pr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3371855/ https://www.ncbi.nlm.nih.gov/pubmed/22689785 http://dx.doi.org/10.1093/bioinformatics/bts223 |
_version_ | 1782235271366967296 |
---|---|
author | Konagurthu, Arun S. Lesk, Arthur M. Allison, Lloyd |
author_facet | Konagurthu, Arun S. Lesk, Arthur M. Allison, Lloyd |
author_sort | Konagurthu, Arun S. |
collection | PubMed |
description | Motivation: Secondary structure underpins the folding pattern and architecture of most proteins. Accurate assignment of the secondary structure elements is therefore an important problem. Although many approximate solutions of the secondary structure assignment problem exist, the statement of the problem has resisted a consistent and mathematically rigorous definition. A variety of comparative studies have highlighted major disagreements in the way the available methods define and assign secondary structure to coordinate data. Results: We report a new method to infer secondary structure based on the Bayesian method of minimum message length inference. It treats assignments of secondary structure as hypotheses that explain the given coordinate data. The method seeks to maximize the joint probability of a hypothesis and the data. There is a natural null hypothesis and any assignment that cannot better it is unacceptable. We developed a program SST based on this approach and compared it with popular programs, such as DSSP and STRIDE among others. Our evaluation suggests that SST gives reliable assignments even on low-resolution structures. Availability: http://www.csse.monash.edu.au/~karun/sst Contact: arun.konagurthu@monash.edu (or lloyd.allison@monash.edu) |
format | Online Article Text |
id | pubmed-3371855 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-33718552012-06-11 Minimum message length inference of secondary structure from protein coordinate data Konagurthu, Arun S. Lesk, Arthur M. Allison, Lloyd Bioinformatics Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa Motivation: Secondary structure underpins the folding pattern and architecture of most proteins. Accurate assignment of the secondary structure elements is therefore an important problem. Although many approximate solutions of the secondary structure assignment problem exist, the statement of the problem has resisted a consistent and mathematically rigorous definition. A variety of comparative studies have highlighted major disagreements in the way the available methods define and assign secondary structure to coordinate data. Results: We report a new method to infer secondary structure based on the Bayesian method of minimum message length inference. It treats assignments of secondary structure as hypotheses that explain the given coordinate data. The method seeks to maximize the joint probability of a hypothesis and the data. There is a natural null hypothesis and any assignment that cannot better it is unacceptable. We developed a program SST based on this approach and compared it with popular programs, such as DSSP and STRIDE among others. Our evaluation suggests that SST gives reliable assignments even on low-resolution structures. Availability: http://www.csse.monash.edu.au/~karun/sst Contact: arun.konagurthu@monash.edu (or lloyd.allison@monash.edu) Oxford University Press 2012-06-15 2012-06-09 /pmc/articles/PMC3371855/ /pubmed/22689785 http://dx.doi.org/10.1093/bioinformatics/bts223 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa Konagurthu, Arun S. Lesk, Arthur M. Allison, Lloyd Minimum message length inference of secondary structure from protein coordinate data |
title | Minimum message length inference of secondary structure from protein coordinate data |
title_full | Minimum message length inference of secondary structure from protein coordinate data |
title_fullStr | Minimum message length inference of secondary structure from protein coordinate data |
title_full_unstemmed | Minimum message length inference of secondary structure from protein coordinate data |
title_short | Minimum message length inference of secondary structure from protein coordinate data |
title_sort | minimum message length inference of secondary structure from protein coordinate data |
topic | Ismb 2012 Proceedings Papers Committee July 15 to July 19, 2012, Long Beach, Ca, Usa |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3371855/ https://www.ncbi.nlm.nih.gov/pubmed/22689785 http://dx.doi.org/10.1093/bioinformatics/bts223 |
work_keys_str_mv | AT konagurthuaruns minimummessagelengthinferenceofsecondarystructurefromproteincoordinatedata AT leskarthurm minimummessagelengthinferenceofsecondarystructurefromproteincoordinatedata AT allisonlloyd minimummessagelengthinferenceofsecondarystructurefromproteincoordinatedata |