Cargando…

Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling

Homology modeling predicts the 3D structure of a query protein based on the sequence alignment with one or more template proteins of known structure. Its great importance for biological research is owed to its speed, simplicity, reliability and wide applicability, covering more than half of the resi...

Descripción completa

Detalles Bibliográficos
Autores principales: Meier, Armin, Söding, Johannes
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619893/
https://www.ncbi.nlm.nih.gov/pubmed/26496371
http://dx.doi.org/10.1371/journal.pcbi.1004343
_version_ 1782397207238934528
author Meier, Armin
Söding, Johannes
author_facet Meier, Armin
Söding, Johannes
author_sort Meier, Armin
collection PubMed
description Homology modeling predicts the 3D structure of a query protein based on the sequence alignment with one or more template proteins of known structure. Its great importance for biological research is owed to its speed, simplicity, reliability and wide applicability, covering more than half of the residues in protein sequence space. Although multiple templates have been shown to generally increase model quality over single templates, the information from multiple templates has so far been combined using empirically motivated, heuristic approaches. We present here a rigorous statistical framework for multi-template homology modeling. First, we find that the query proteins’ atomic distance restraints can be accurately described by two-component Gaussian mixtures. This insight allowed us to apply the standard laws of probability theory to combine restraints from multiple templates. Second, we derive theoretically optimal weights to correct for the redundancy among related templates. Third, a heuristic template selection strategy is proposed. We improve the average GDT-ha model quality score by 11% over single template modeling and by 6.5% over a conventional multi-template approach on a set of 1000 query proteins. Robustness with respect to wrong constraints is likewise improved. We have integrated our multi-template modeling approach with the popular MODELLER homology modeling software in our free HHpred server http://toolkit.tuebingen.mpg.de/hhpred and also offer open source software for running MODELLER with the new restraints at https://bitbucket.org/soedinglab/hh-suite.
format Online
Article
Text
id pubmed-4619893
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-46198932015-10-29 Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling Meier, Armin Söding, Johannes PLoS Comput Biol Research Article Homology modeling predicts the 3D structure of a query protein based on the sequence alignment with one or more template proteins of known structure. Its great importance for biological research is owed to its speed, simplicity, reliability and wide applicability, covering more than half of the residues in protein sequence space. Although multiple templates have been shown to generally increase model quality over single templates, the information from multiple templates has so far been combined using empirically motivated, heuristic approaches. We present here a rigorous statistical framework for multi-template homology modeling. First, we find that the query proteins’ atomic distance restraints can be accurately described by two-component Gaussian mixtures. This insight allowed us to apply the standard laws of probability theory to combine restraints from multiple templates. Second, we derive theoretically optimal weights to correct for the redundancy among related templates. Third, a heuristic template selection strategy is proposed. We improve the average GDT-ha model quality score by 11% over single template modeling and by 6.5% over a conventional multi-template approach on a set of 1000 query proteins. Robustness with respect to wrong constraints is likewise improved. We have integrated our multi-template modeling approach with the popular MODELLER homology modeling software in our free HHpred server http://toolkit.tuebingen.mpg.de/hhpred and also offer open source software for running MODELLER with the new restraints at https://bitbucket.org/soedinglab/hh-suite. Public Library of Science 2015-10-23 /pmc/articles/PMC4619893/ /pubmed/26496371 http://dx.doi.org/10.1371/journal.pcbi.1004343 Text en © 2015 Meier, Söding http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Meier, Armin
Söding, Johannes
Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling
title Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling
title_full Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling
title_fullStr Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling
title_full_unstemmed Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling
title_short Automatic Prediction of Protein 3D Structures by Probabilistic Multi-template Homology Modeling
title_sort automatic prediction of protein 3d structures by probabilistic multi-template homology modeling
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4619893/
https://www.ncbi.nlm.nih.gov/pubmed/26496371
http://dx.doi.org/10.1371/journal.pcbi.1004343
work_keys_str_mv AT meierarmin automaticpredictionofprotein3dstructuresbyprobabilisticmultitemplatehomologymodeling
AT sodingjohannes automaticpredictionofprotein3dstructuresbyprobabilisticmultitemplatehomologymodeling