Cargando…

A Deep Graph Network–Enhanced Sampling Approach to Efficiently Explore the Space of Reduced Representations of Proteins

The limits of molecular dynamics (MD) simulations of macromolecules are steadily pushed forward by the relentless development of computer architectures and algorithms. The consequent explosion in the number and extent of MD trajectories induces the need for automated methods to rationalize the raw d...

Descripción completa

Detalles Bibliográficos
Autores principales: Errica, Federico, Giulini, Marco, Bacciu, Davide, Menichetti, Roberto, Micheli, Alessio, Potestio, Raffaello
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8116519/
https://www.ncbi.nlm.nih.gov/pubmed/33996896
http://dx.doi.org/10.3389/fmolb.2021.637396
_version_ 1783691410409521152
author Errica, Federico
Giulini, Marco
Bacciu, Davide
Menichetti, Roberto
Micheli, Alessio
Potestio, Raffaello
author_facet Errica, Federico
Giulini, Marco
Bacciu, Davide
Menichetti, Roberto
Micheli, Alessio
Potestio, Raffaello
author_sort Errica, Federico
collection PubMed
description The limits of molecular dynamics (MD) simulations of macromolecules are steadily pushed forward by the relentless development of computer architectures and algorithms. The consequent explosion in the number and extent of MD trajectories induces the need for automated methods to rationalize the raw data and make quantitative sense of them. Recently, an algorithmic approach was introduced by some of us to identify the subset of a protein’s atoms, or mapping, that enables the most informative description of the system. This method relies on the computation, for a given reduced representation, of the associated mapping entropy, that is, a measure of the information loss due to such simplification; albeit relatively straightforward, this calculation can be time-consuming. Here, we describe the implementation of a deep learning approach aimed at accelerating the calculation of the mapping entropy. We rely on Deep Graph Networks, which provide extreme flexibility in handling structured input data and whose predictions prove to be accurate and-remarkably efficient. The trained network produces a speedup factor as large as 10(5) with respect to the algorithmic computation of the mapping entropy, enabling the reconstruction of its landscape by means of the Wang–Landau sampling scheme. Applications of this method reach much further than this, as the proposed pipeline is easily transferable to the computation of arbitrary properties of a molecular structure.
format Online
Article
Text
id pubmed-8116519
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-81165192021-05-14 A Deep Graph Network–Enhanced Sampling Approach to Efficiently Explore the Space of Reduced Representations of Proteins Errica, Federico Giulini, Marco Bacciu, Davide Menichetti, Roberto Micheli, Alessio Potestio, Raffaello Front Mol Biosci Molecular Biosciences The limits of molecular dynamics (MD) simulations of macromolecules are steadily pushed forward by the relentless development of computer architectures and algorithms. The consequent explosion in the number and extent of MD trajectories induces the need for automated methods to rationalize the raw data and make quantitative sense of them. Recently, an algorithmic approach was introduced by some of us to identify the subset of a protein’s atoms, or mapping, that enables the most informative description of the system. This method relies on the computation, for a given reduced representation, of the associated mapping entropy, that is, a measure of the information loss due to such simplification; albeit relatively straightforward, this calculation can be time-consuming. Here, we describe the implementation of a deep learning approach aimed at accelerating the calculation of the mapping entropy. We rely on Deep Graph Networks, which provide extreme flexibility in handling structured input data and whose predictions prove to be accurate and-remarkably efficient. The trained network produces a speedup factor as large as 10(5) with respect to the algorithmic computation of the mapping entropy, enabling the reconstruction of its landscape by means of the Wang–Landau sampling scheme. Applications of this method reach much further than this, as the proposed pipeline is easily transferable to the computation of arbitrary properties of a molecular structure. Frontiers Media S.A. 2021-04-29 /pmc/articles/PMC8116519/ /pubmed/33996896 http://dx.doi.org/10.3389/fmolb.2021.637396 Text en Copyright © 2021 Errica, Giulini, Bacciu, Menichetti, Micheli and Potestio. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Errica, Federico
Giulini, Marco
Bacciu, Davide
Menichetti, Roberto
Micheli, Alessio
Potestio, Raffaello
A Deep Graph Network–Enhanced Sampling Approach to Efficiently Explore the Space of Reduced Representations of Proteins
title A Deep Graph Network–Enhanced Sampling Approach to Efficiently Explore the Space of Reduced Representations of Proteins
title_full A Deep Graph Network–Enhanced Sampling Approach to Efficiently Explore the Space of Reduced Representations of Proteins
title_fullStr A Deep Graph Network–Enhanced Sampling Approach to Efficiently Explore the Space of Reduced Representations of Proteins
title_full_unstemmed A Deep Graph Network–Enhanced Sampling Approach to Efficiently Explore the Space of Reduced Representations of Proteins
title_short A Deep Graph Network–Enhanced Sampling Approach to Efficiently Explore the Space of Reduced Representations of Proteins
title_sort deep graph network–enhanced sampling approach to efficiently explore the space of reduced representations of proteins
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8116519/
https://www.ncbi.nlm.nih.gov/pubmed/33996896
http://dx.doi.org/10.3389/fmolb.2021.637396
work_keys_str_mv AT erricafederico adeepgraphnetworkenhancedsamplingapproachtoefficientlyexplorethespaceofreducedrepresentationsofproteins
AT giulinimarco adeepgraphnetworkenhancedsamplingapproachtoefficientlyexplorethespaceofreducedrepresentationsofproteins
AT bacciudavide adeepgraphnetworkenhancedsamplingapproachtoefficientlyexplorethespaceofreducedrepresentationsofproteins
AT menichettiroberto adeepgraphnetworkenhancedsamplingapproachtoefficientlyexplorethespaceofreducedrepresentationsofproteins
AT michelialessio adeepgraphnetworkenhancedsamplingapproachtoefficientlyexplorethespaceofreducedrepresentationsofproteins
AT potestioraffaello adeepgraphnetworkenhancedsamplingapproachtoefficientlyexplorethespaceofreducedrepresentationsofproteins
AT erricafederico deepgraphnetworkenhancedsamplingapproachtoefficientlyexplorethespaceofreducedrepresentationsofproteins
AT giulinimarco deepgraphnetworkenhancedsamplingapproachtoefficientlyexplorethespaceofreducedrepresentationsofproteins
AT bacciudavide deepgraphnetworkenhancedsamplingapproachtoefficientlyexplorethespaceofreducedrepresentationsofproteins
AT menichettiroberto deepgraphnetworkenhancedsamplingapproachtoefficientlyexplorethespaceofreducedrepresentationsofproteins
AT michelialessio deepgraphnetworkenhancedsamplingapproachtoefficientlyexplorethespaceofreducedrepresentationsofproteins
AT potestioraffaello deepgraphnetworkenhancedsamplingapproachtoefficientlyexplorethespaceofreducedrepresentationsofproteins