Cargando…

MMM: Integrative ensemble modeling and ensemble analysis

Proteins and their complexes can be heterogeneously disordered. In ensemble modeling of such systems with restraints from several experimental techniques the following problems arise: (a) integration of diverse restraints obtained on different samples under different conditions; (b) estimation of a...

Descripción completa

Detalles Bibliográficos
Autor principal: Jeschke, Gunnar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7737775/
https://www.ncbi.nlm.nih.gov/pubmed/33015891
http://dx.doi.org/10.1002/pro.3965
_version_ 1783622991809085440
author Jeschke, Gunnar
author_facet Jeschke, Gunnar
author_sort Jeschke, Gunnar
collection PubMed
description Proteins and their complexes can be heterogeneously disordered. In ensemble modeling of such systems with restraints from several experimental techniques the following problems arise: (a) integration of diverse restraints obtained on different samples under different conditions; (b) estimation of a realistic ensemble width; (c) sufficient sampling of conformational space; (d) representation of the ensemble by an interpretable number of conformers; (e) recognition of weak order with site resolution. Here, I introduce several tools that address these problems, focusing on utilization of distance distribution information for estimating ensemble width. The RigiFlex approach integrates such information with high‐resolution structures of ordered domains and small‐angle scattering data. The EnsembleFit module provides moderately sized ensembles by fitting conformer populations and discarding conformers with low population. EnsembleFit balances the loss in fit quality upon combining restraint subsets from different techniques. Pair correlation analysis for residues and local compaction analysis help in feature detection. The RigiFlex pipeline is tested on data simulated from the structure 70 kDa protein‐RNA complex RsmE/RsmZ. It recovers this structure with ensemble width and difference from ground truth both being on the order of 4.2 Å. EnsembleFit reduces the ensemble of the proliferating‐cell‐nuclear‐antigen‐associated factor p15(PAF) from 4,939 to 75 conformers while maintaining good fit quality of restraints. Local compaction analysis for the PaaA2 antitoxin from E. coli O157 revealed correlations between compactness and enhanced residual dipolar couplings in the original NMR restraint set.
format Online
Article
Text
id pubmed-7737775
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-77377752020-12-18 MMM: Integrative ensemble modeling and ensemble analysis Jeschke, Gunnar Protein Sci Tools for Protein Science Proteins and their complexes can be heterogeneously disordered. In ensemble modeling of such systems with restraints from several experimental techniques the following problems arise: (a) integration of diverse restraints obtained on different samples under different conditions; (b) estimation of a realistic ensemble width; (c) sufficient sampling of conformational space; (d) representation of the ensemble by an interpretable number of conformers; (e) recognition of weak order with site resolution. Here, I introduce several tools that address these problems, focusing on utilization of distance distribution information for estimating ensemble width. The RigiFlex approach integrates such information with high‐resolution structures of ordered domains and small‐angle scattering data. The EnsembleFit module provides moderately sized ensembles by fitting conformer populations and discarding conformers with low population. EnsembleFit balances the loss in fit quality upon combining restraint subsets from different techniques. Pair correlation analysis for residues and local compaction analysis help in feature detection. The RigiFlex pipeline is tested on data simulated from the structure 70 kDa protein‐RNA complex RsmE/RsmZ. It recovers this structure with ensemble width and difference from ground truth both being on the order of 4.2 Å. EnsembleFit reduces the ensemble of the proliferating‐cell‐nuclear‐antigen‐associated factor p15(PAF) from 4,939 to 75 conformers while maintaining good fit quality of restraints. Local compaction analysis for the PaaA2 antitoxin from E. coli O157 revealed correlations between compactness and enhanced residual dipolar couplings in the original NMR restraint set. John Wiley & Sons, Inc. 2020-10-17 2021-01 /pmc/articles/PMC7737775/ /pubmed/33015891 http://dx.doi.org/10.1002/pro.3965 Text en © 2020 The Protein Society This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Tools for Protein Science
Jeschke, Gunnar
MMM: Integrative ensemble modeling and ensemble analysis
title MMM: Integrative ensemble modeling and ensemble analysis
title_full MMM: Integrative ensemble modeling and ensemble analysis
title_fullStr MMM: Integrative ensemble modeling and ensemble analysis
title_full_unstemmed MMM: Integrative ensemble modeling and ensemble analysis
title_short MMM: Integrative ensemble modeling and ensemble analysis
title_sort mmm: integrative ensemble modeling and ensemble analysis
topic Tools for Protein Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7737775/
https://www.ncbi.nlm.nih.gov/pubmed/33015891
http://dx.doi.org/10.1002/pro.3965
work_keys_str_mv AT jeschkegunnar mmmintegrativeensemblemodelingandensembleanalysis