Cargando…

Coarse-Graining Protein Structures With Local Multivariate Features from Molecular Dynamics

A multivariate statistical theory, local feature analysis (LFA), extracts functionally relevant domains from molecular dynamics (MD) trajectories. The LFA representations, like those of principal component analysis (PCA), are low dimensional and provide a reduced basis set for collective motions of...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Zhiyong, Wriggers, Willy
Formato: Texto
Lenguaje:English
Publicado: American Chemical Society 2008
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2651751/
https://www.ncbi.nlm.nih.gov/pubmed/18855436
http://dx.doi.org/10.1021/jp806291p
_version_ 1782165181742186496
author Zhang, Zhiyong
Wriggers, Willy
author_facet Zhang, Zhiyong
Wriggers, Willy
author_sort Zhang, Zhiyong
collection PubMed
description A multivariate statistical theory, local feature analysis (LFA), extracts functionally relevant domains from molecular dynamics (MD) trajectories. The LFA representations, like those of principal component analysis (PCA), are low dimensional and provide a reduced basis set for collective motions of simulated proteins, but the local features are sparsely distributed and spatially localized, in contrast to global PCA modes. One key problem in the assignment of local features is the coarse-graining of redundant LFA output functions by means of seed atoms. One can solve the combinatorial problem by adding seed atoms one after another to a growing set, minimizing a reconstruction error at each addition. This allows for an efficient implementation, but the sequential algorithm does not guarantee the optimal mutual correlation of the sequentially assigned features. Here, we present a novel coarse-graining algorithm for proteins that directly minimizes the mutual correlation of seed atoms by Monte Carlo (MC) simulations. Tests on MD trajectories of two biological systems, bacteriophage T4 lysozyme and myosin II motor domain S1, demonstrate that the new algorithm provides statistically reproducible results and describes functionally relevant dynamics. The well-known undersampling of large-scale motion by short MD simulations is apparent also in our model, but the new coarse-graining offers a major advantage over PCA; converged features are invariant across multiple windows of the trajectory, dividing the protein into converged regions and a smaller number of localized, undersampled regions. In addition to its use in structure classification, the proposed coarse-graining thus provides a localized measure of MD sampling efficiency.
format Text
id pubmed-2651751
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-26517512009-03-20 Coarse-Graining Protein Structures With Local Multivariate Features from Molecular Dynamics Zhang, Zhiyong Wriggers, Willy J Phys Chem B A multivariate statistical theory, local feature analysis (LFA), extracts functionally relevant domains from molecular dynamics (MD) trajectories. The LFA representations, like those of principal component analysis (PCA), are low dimensional and provide a reduced basis set for collective motions of simulated proteins, but the local features are sparsely distributed and spatially localized, in contrast to global PCA modes. One key problem in the assignment of local features is the coarse-graining of redundant LFA output functions by means of seed atoms. One can solve the combinatorial problem by adding seed atoms one after another to a growing set, minimizing a reconstruction error at each addition. This allows for an efficient implementation, but the sequential algorithm does not guarantee the optimal mutual correlation of the sequentially assigned features. Here, we present a novel coarse-graining algorithm for proteins that directly minimizes the mutual correlation of seed atoms by Monte Carlo (MC) simulations. Tests on MD trajectories of two biological systems, bacteriophage T4 lysozyme and myosin II motor domain S1, demonstrate that the new algorithm provides statistically reproducible results and describes functionally relevant dynamics. The well-known undersampling of large-scale motion by short MD simulations is apparent also in our model, but the new coarse-graining offers a major advantage over PCA; converged features are invariant across multiple windows of the trajectory, dividing the protein into converged regions and a smaller number of localized, undersampled regions. In addition to its use in structure classification, the proposed coarse-graining thus provides a localized measure of MD sampling efficiency. American Chemical Society 2008-10-15 2008-11-06 /pmc/articles/PMC2651751/ /pubmed/18855436 http://dx.doi.org/10.1021/jp806291p Text en Copyright © 2008 American Chemical Society http://pubs.acs.org This is an open-access article distributed under the ACS AuthorChoice Terms & Conditions. Any use of this article, must conform to the terms of that license which are available at http://pubs.acs.org. 40.75
spellingShingle Zhang, Zhiyong
Wriggers, Willy
Coarse-Graining Protein Structures With Local Multivariate Features from Molecular Dynamics
title Coarse-Graining Protein Structures With Local Multivariate Features from Molecular Dynamics
title_full Coarse-Graining Protein Structures With Local Multivariate Features from Molecular Dynamics
title_fullStr Coarse-Graining Protein Structures With Local Multivariate Features from Molecular Dynamics
title_full_unstemmed Coarse-Graining Protein Structures With Local Multivariate Features from Molecular Dynamics
title_short Coarse-Graining Protein Structures With Local Multivariate Features from Molecular Dynamics
title_sort coarse-graining protein structures with local multivariate features from molecular dynamics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2651751/
https://www.ncbi.nlm.nih.gov/pubmed/18855436
http://dx.doi.org/10.1021/jp806291p
work_keys_str_mv AT zhangzhiyong coarsegrainingproteinstructureswithlocalmultivariatefeaturesfrommoleculardynamics
AT wriggerswilly coarsegrainingproteinstructureswithlocalmultivariatefeaturesfrommoleculardynamics