Cargando…

Prediction of protein domain boundaries from inverse covariances

It has been known even since relatively few structures had been solved that longer protein chains often contain multiple domains, which may fold separately and play the role of reusable functional modules found in many contexts. In many structural biology tasks, in particular structure prediction, i...

Descripción completa

Detalles Bibliográficos
Autor principal: Sadowski, Michael I
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Wiley Subscription Services, Inc., A Wiley Company 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3563215/
https://www.ncbi.nlm.nih.gov/pubmed/22987736
http://dx.doi.org/10.1002/prot.24181
_version_ 1782258164732788736
author Sadowski, Michael I
author_facet Sadowski, Michael I
author_sort Sadowski, Michael I
collection PubMed
description It has been known even since relatively few structures had been solved that longer protein chains often contain multiple domains, which may fold separately and play the role of reusable functional modules found in many contexts. In many structural biology tasks, in particular structure prediction, it is of great use to be able to identify domains within the structure and analyze these regions separately. However, when using sequence data alone this task has proven exceptionally difficult, with relatively little improvement over the naive method of choosing boundaries based on size distributions of observed domains. The recent significant improvement in contact prediction provides a new source of information for domain prediction. We test several methods for using this information including a kernel smoothing-based approach and methods based on building alpha-carbon models and compare performance with a length-based predictor, a homology search method and four published sequence-based predictors: DOMCUT, DomPRO, DLP-SVM, and SCOOBY-DOmain. We show that the kernel-smoothing method is significantly better than the other ab initio predictors when both single-domain and multidomain targets are considered and is not significantly different to the homology-based method. Considering only multidomain targets the kernel-smoothing method outperforms all of the published methods except DLP-SVM. The kernel smoothing method therefore represents a potentially useful improvement to ab initio domain prediction. Proteins 2013. © 2012 Wiley Periodicals, Inc.
format Online
Article
Text
id pubmed-3563215
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Wiley Subscription Services, Inc., A Wiley Company
record_format MEDLINE/PubMed
spelling pubmed-35632152013-02-07 Prediction of protein domain boundaries from inverse covariances Sadowski, Michael I Proteins Articles It has been known even since relatively few structures had been solved that longer protein chains often contain multiple domains, which may fold separately and play the role of reusable functional modules found in many contexts. In many structural biology tasks, in particular structure prediction, it is of great use to be able to identify domains within the structure and analyze these regions separately. However, when using sequence data alone this task has proven exceptionally difficult, with relatively little improvement over the naive method of choosing boundaries based on size distributions of observed domains. The recent significant improvement in contact prediction provides a new source of information for domain prediction. We test several methods for using this information including a kernel smoothing-based approach and methods based on building alpha-carbon models and compare performance with a length-based predictor, a homology search method and four published sequence-based predictors: DOMCUT, DomPRO, DLP-SVM, and SCOOBY-DOmain. We show that the kernel-smoothing method is significantly better than the other ab initio predictors when both single-domain and multidomain targets are considered and is not significantly different to the homology-based method. Considering only multidomain targets the kernel-smoothing method outperforms all of the published methods except DLP-SVM. The kernel smoothing method therefore represents a potentially useful improvement to ab initio domain prediction. Proteins 2013. © 2012 Wiley Periodicals, Inc. Wiley Subscription Services, Inc., A Wiley Company 2013-02 /pmc/articles/PMC3563215/ /pubmed/22987736 http://dx.doi.org/10.1002/prot.24181 Text en Copyright © 2012 Wiley Periodicals, Inc. http://creativecommons.org/licenses/by/2.5/ Re-use of this article is permitted in accordance with the Creative Commons Deed, Attribution 2.5, which does not permit commercial exploitation.
spellingShingle Articles
Sadowski, Michael I
Prediction of protein domain boundaries from inverse covariances
title Prediction of protein domain boundaries from inverse covariances
title_full Prediction of protein domain boundaries from inverse covariances
title_fullStr Prediction of protein domain boundaries from inverse covariances
title_full_unstemmed Prediction of protein domain boundaries from inverse covariances
title_short Prediction of protein domain boundaries from inverse covariances
title_sort prediction of protein domain boundaries from inverse covariances
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3563215/
https://www.ncbi.nlm.nih.gov/pubmed/22987736
http://dx.doi.org/10.1002/prot.24181
work_keys_str_mv AT sadowskimichaeli predictionofproteindomainboundariesfrominversecovariances