Cargando…
MMM: Integrative ensemble modeling and ensemble analysis
Proteins and their complexes can be heterogeneously disordered. In ensemble modeling of such systems with restraints from several experimental techniques the following problems arise: (a) integration of diverse restraints obtained on different samples under different conditions; (b) estimation of a...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley & Sons, Inc.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7737775/ https://www.ncbi.nlm.nih.gov/pubmed/33015891 http://dx.doi.org/10.1002/pro.3965 |
_version_ | 1783622991809085440 |
---|---|
author | Jeschke, Gunnar |
author_facet | Jeschke, Gunnar |
author_sort | Jeschke, Gunnar |
collection | PubMed |
description | Proteins and their complexes can be heterogeneously disordered. In ensemble modeling of such systems with restraints from several experimental techniques the following problems arise: (a) integration of diverse restraints obtained on different samples under different conditions; (b) estimation of a realistic ensemble width; (c) sufficient sampling of conformational space; (d) representation of the ensemble by an interpretable number of conformers; (e) recognition of weak order with site resolution. Here, I introduce several tools that address these problems, focusing on utilization of distance distribution information for estimating ensemble width. The RigiFlex approach integrates such information with high‐resolution structures of ordered domains and small‐angle scattering data. The EnsembleFit module provides moderately sized ensembles by fitting conformer populations and discarding conformers with low population. EnsembleFit balances the loss in fit quality upon combining restraint subsets from different techniques. Pair correlation analysis for residues and local compaction analysis help in feature detection. The RigiFlex pipeline is tested on data simulated from the structure 70 kDa protein‐RNA complex RsmE/RsmZ. It recovers this structure with ensemble width and difference from ground truth both being on the order of 4.2 Å. EnsembleFit reduces the ensemble of the proliferating‐cell‐nuclear‐antigen‐associated factor p15(PAF) from 4,939 to 75 conformers while maintaining good fit quality of restraints. Local compaction analysis for the PaaA2 antitoxin from E. coli O157 revealed correlations between compactness and enhanced residual dipolar couplings in the original NMR restraint set. |
format | Online Article Text |
id | pubmed-7737775 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | John Wiley & Sons, Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-77377752020-12-18 MMM: Integrative ensemble modeling and ensemble analysis Jeschke, Gunnar Protein Sci Tools for Protein Science Proteins and their complexes can be heterogeneously disordered. In ensemble modeling of such systems with restraints from several experimental techniques the following problems arise: (a) integration of diverse restraints obtained on different samples under different conditions; (b) estimation of a realistic ensemble width; (c) sufficient sampling of conformational space; (d) representation of the ensemble by an interpretable number of conformers; (e) recognition of weak order with site resolution. Here, I introduce several tools that address these problems, focusing on utilization of distance distribution information for estimating ensemble width. The RigiFlex approach integrates such information with high‐resolution structures of ordered domains and small‐angle scattering data. The EnsembleFit module provides moderately sized ensembles by fitting conformer populations and discarding conformers with low population. EnsembleFit balances the loss in fit quality upon combining restraint subsets from different techniques. Pair correlation analysis for residues and local compaction analysis help in feature detection. The RigiFlex pipeline is tested on data simulated from the structure 70 kDa protein‐RNA complex RsmE/RsmZ. It recovers this structure with ensemble width and difference from ground truth both being on the order of 4.2 Å. EnsembleFit reduces the ensemble of the proliferating‐cell‐nuclear‐antigen‐associated factor p15(PAF) from 4,939 to 75 conformers while maintaining good fit quality of restraints. Local compaction analysis for the PaaA2 antitoxin from E. coli O157 revealed correlations between compactness and enhanced residual dipolar couplings in the original NMR restraint set. John Wiley & Sons, Inc. 2020-10-17 2021-01 /pmc/articles/PMC7737775/ /pubmed/33015891 http://dx.doi.org/10.1002/pro.3965 Text en © 2020 The Protein Society This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Tools for Protein Science Jeschke, Gunnar MMM: Integrative ensemble modeling and ensemble analysis |
title |
MMM: Integrative ensemble modeling and ensemble analysis |
title_full |
MMM: Integrative ensemble modeling and ensemble analysis |
title_fullStr |
MMM: Integrative ensemble modeling and ensemble analysis |
title_full_unstemmed |
MMM: Integrative ensemble modeling and ensemble analysis |
title_short |
MMM: Integrative ensemble modeling and ensemble analysis |
title_sort | mmm: integrative ensemble modeling and ensemble analysis |
topic | Tools for Protein Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7737775/ https://www.ncbi.nlm.nih.gov/pubmed/33015891 http://dx.doi.org/10.1002/pro.3965 |
work_keys_str_mv | AT jeschkegunnar mmmintegrativeensemblemodelingandensembleanalysis |