Cargando…
A polynomial time algorithm for computing the area under a GDT curve
BACKGROUND: Progress in the field of protein three-dimensional structure prediction depends on the development of new and improved algorithms for measuring the quality of protein models. Perhaps the best descriptor of the quality of a protein model is the GDT function that maps each distance cutoff...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4620747/ https://www.ncbi.nlm.nih.gov/pubmed/26504491 http://dx.doi.org/10.1186/s13015-015-0058-0 |
_version_ | 1782397345407696896 |
---|---|
author | Poleksic, Aleksandar |
author_facet | Poleksic, Aleksandar |
author_sort | Poleksic, Aleksandar |
collection | PubMed |
description | BACKGROUND: Progress in the field of protein three-dimensional structure prediction depends on the development of new and improved algorithms for measuring the quality of protein models. Perhaps the best descriptor of the quality of a protein model is the GDT function that maps each distance cutoff θ to the number of atoms in the protein model that can be fit under the distance θ from the corresponding atoms in the experimentally determined structure. It has long been known that the area under the graph of this function (GDT_A) can serve as a reliable, single numerical measure of the model quality. Unfortunately, while the well-known GDT_TS metric provides a crude approximation of GDT_A, no algorithm currently exists that is capable of computing accurate estimates of GDT_A. METHODS: We prove that GDT_A is well defined and that it can be approximated by the Riemann sums, using available methods for computing accurate (near-optimal) GDT function values. RESULTS: In contrast to the GDT_TS metric, GDT_A is neither insensitive to large nor oversensitive to small changes in model’s coordinates. Moreover, the problem of computing GDT_A is tractable. More specifically, GDT_A can be computed in cubic asymptotic time in the size of the protein model. CONCLUSIONS: This paper presents the first algorithm capable of computing the near-optimal estimates of the area under the GDT function for a protein model. We believe that the techniques implemented in our algorithm will pave ways for the development of more practical and reliable procedures for estimating 3D model quality. |
format | Online Article Text |
id | pubmed-4620747 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-46207472015-10-27 A polynomial time algorithm for computing the area under a GDT curve Poleksic, Aleksandar Algorithms Mol Biol Research BACKGROUND: Progress in the field of protein three-dimensional structure prediction depends on the development of new and improved algorithms for measuring the quality of protein models. Perhaps the best descriptor of the quality of a protein model is the GDT function that maps each distance cutoff θ to the number of atoms in the protein model that can be fit under the distance θ from the corresponding atoms in the experimentally determined structure. It has long been known that the area under the graph of this function (GDT_A) can serve as a reliable, single numerical measure of the model quality. Unfortunately, while the well-known GDT_TS metric provides a crude approximation of GDT_A, no algorithm currently exists that is capable of computing accurate estimates of GDT_A. METHODS: We prove that GDT_A is well defined and that it can be approximated by the Riemann sums, using available methods for computing accurate (near-optimal) GDT function values. RESULTS: In contrast to the GDT_TS metric, GDT_A is neither insensitive to large nor oversensitive to small changes in model’s coordinates. Moreover, the problem of computing GDT_A is tractable. More specifically, GDT_A can be computed in cubic asymptotic time in the size of the protein model. CONCLUSIONS: This paper presents the first algorithm capable of computing the near-optimal estimates of the area under the GDT function for a protein model. We believe that the techniques implemented in our algorithm will pave ways for the development of more practical and reliable procedures for estimating 3D model quality. BioMed Central 2015-10-26 /pmc/articles/PMC4620747/ /pubmed/26504491 http://dx.doi.org/10.1186/s13015-015-0058-0 Text en © Poleksic. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Poleksic, Aleksandar A polynomial time algorithm for computing the area under a GDT curve |
title | A polynomial time algorithm for computing the area under a GDT curve |
title_full | A polynomial time algorithm for computing the area under a GDT curve |
title_fullStr | A polynomial time algorithm for computing the area under a GDT curve |
title_full_unstemmed | A polynomial time algorithm for computing the area under a GDT curve |
title_short | A polynomial time algorithm for computing the area under a GDT curve |
title_sort | polynomial time algorithm for computing the area under a gdt curve |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4620747/ https://www.ncbi.nlm.nih.gov/pubmed/26504491 http://dx.doi.org/10.1186/s13015-015-0058-0 |
work_keys_str_mv | AT poleksicaleksandar apolynomialtimealgorithmforcomputingtheareaunderagdtcurve AT poleksicaleksandar polynomialtimealgorithmforcomputingtheareaunderagdtcurve |