Cargando…

Building insightful, memory-enriched models to capture long-time biochemical processes from short-time simulations

The ability to predict and understand complex molecular motions occurring over diverse timescales ranging from picoseconds to seconds and even hours in biological systems remains one of the largest challenges to chemical theory. Markov state models (MSMs), which provide a memoryless description of t...

Descripción completa

Detalles Bibliográficos
Autores principales: Dominic, Anthony J., Sayer, Thomas, Cao, Siqin, Markland, Thomas E., Huang, Xuhui, Montoya-Castillo, Andrés
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041170/
https://www.ncbi.nlm.nih.gov/pubmed/36920924
http://dx.doi.org/10.1073/pnas.2221048120
_version_ 1784912650094772224
author Dominic, Anthony J.
Sayer, Thomas
Cao, Siqin
Markland, Thomas E.
Huang, Xuhui
Montoya-Castillo, Andrés
author_facet Dominic, Anthony J.
Sayer, Thomas
Cao, Siqin
Markland, Thomas E.
Huang, Xuhui
Montoya-Castillo, Andrés
author_sort Dominic, Anthony J.
collection PubMed
description The ability to predict and understand complex molecular motions occurring over diverse timescales ranging from picoseconds to seconds and even hours in biological systems remains one of the largest challenges to chemical theory. Markov state models (MSMs), which provide a memoryless description of the transitions between different states of a biochemical system, have provided numerous important physically transparent insights into biological function. However, constructing these models often necessitates performing extremely long molecular simulations to converge the rates. Here, we show that by incorporating memory via the time-convolutionless generalized master equation (TCL-GME) one can build a theoretically transparent and physically intuitive memory-enriched model of biochemical processes with up to a three order of magnitude reduction in the simulation data required while also providing a higher temporal resolution. We derive the conditions under which the TCL-GME provides a more efficient means to capture slow dynamics than MSMs and rigorously prove when the two provide equally valid and efficient descriptions of the slow configurational dynamics. We further introduce a simple averaging procedure that enables our TCL-GME approach to quickly converge and accurately predict long-time dynamics even when parameterized with noisy reference data arising from short trajectories. We illustrate the advantages of the TCL-GME using alanine dipeptide, the human argonaute complex, and FiP35 WW domain.
format Online
Article
Text
id pubmed-10041170
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-100411702023-09-15 Building insightful, memory-enriched models to capture long-time biochemical processes from short-time simulations Dominic, Anthony J. Sayer, Thomas Cao, Siqin Markland, Thomas E. Huang, Xuhui Montoya-Castillo, Andrés Proc Natl Acad Sci U S A Biological Sciences The ability to predict and understand complex molecular motions occurring over diverse timescales ranging from picoseconds to seconds and even hours in biological systems remains one of the largest challenges to chemical theory. Markov state models (MSMs), which provide a memoryless description of the transitions between different states of a biochemical system, have provided numerous important physically transparent insights into biological function. However, constructing these models often necessitates performing extremely long molecular simulations to converge the rates. Here, we show that by incorporating memory via the time-convolutionless generalized master equation (TCL-GME) one can build a theoretically transparent and physically intuitive memory-enriched model of biochemical processes with up to a three order of magnitude reduction in the simulation data required while also providing a higher temporal resolution. We derive the conditions under which the TCL-GME provides a more efficient means to capture slow dynamics than MSMs and rigorously prove when the two provide equally valid and efficient descriptions of the slow configurational dynamics. We further introduce a simple averaging procedure that enables our TCL-GME approach to quickly converge and accurately predict long-time dynamics even when parameterized with noisy reference data arising from short trajectories. We illustrate the advantages of the TCL-GME using alanine dipeptide, the human argonaute complex, and FiP35 WW domain. National Academy of Sciences 2023-03-15 2023-03-21 /pmc/articles/PMC10041170/ /pubmed/36920924 http://dx.doi.org/10.1073/pnas.2221048120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Biological Sciences
Dominic, Anthony J.
Sayer, Thomas
Cao, Siqin
Markland, Thomas E.
Huang, Xuhui
Montoya-Castillo, Andrés
Building insightful, memory-enriched models to capture long-time biochemical processes from short-time simulations
title Building insightful, memory-enriched models to capture long-time biochemical processes from short-time simulations
title_full Building insightful, memory-enriched models to capture long-time biochemical processes from short-time simulations
title_fullStr Building insightful, memory-enriched models to capture long-time biochemical processes from short-time simulations
title_full_unstemmed Building insightful, memory-enriched models to capture long-time biochemical processes from short-time simulations
title_short Building insightful, memory-enriched models to capture long-time biochemical processes from short-time simulations
title_sort building insightful, memory-enriched models to capture long-time biochemical processes from short-time simulations
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10041170/
https://www.ncbi.nlm.nih.gov/pubmed/36920924
http://dx.doi.org/10.1073/pnas.2221048120
work_keys_str_mv AT dominicanthonyj buildinginsightfulmemoryenrichedmodelstocapturelongtimebiochemicalprocessesfromshorttimesimulations
AT sayerthomas buildinginsightfulmemoryenrichedmodelstocapturelongtimebiochemicalprocessesfromshorttimesimulations
AT caosiqin buildinginsightfulmemoryenrichedmodelstocapturelongtimebiochemicalprocessesfromshorttimesimulations
AT marklandthomase buildinginsightfulmemoryenrichedmodelstocapturelongtimebiochemicalprocessesfromshorttimesimulations
AT huangxuhui buildinginsightfulmemoryenrichedmodelstocapturelongtimebiochemicalprocessesfromshorttimesimulations
AT montoyacastilloandres buildinginsightfulmemoryenrichedmodelstocapturelongtimebiochemicalprocessesfromshorttimesimulations