Cargando…

Molecular Insights from Conformational Ensembles via Machine Learning

Biomolecular simulations are intrinsically high dimensional and generate noisy data sets of ever-increasing size. Extracting important features from the data is crucial for understanding the biophysical properties of molecular processes, but remains a big challenge. Machine learning (ML) provides po...

Descripción completa

Detalles Bibliográficos
Autores principales: Fleetwood, Oliver, Kasimova, Marina A., Westerlund, Annie M., Delemotte, Lucie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Biophysical Society 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7002924/
https://www.ncbi.nlm.nih.gov/pubmed/31952811
http://dx.doi.org/10.1016/j.bpj.2019.12.016
_version_ 1783494446260682752
author Fleetwood, Oliver
Kasimova, Marina A.
Westerlund, Annie M.
Delemotte, Lucie
author_facet Fleetwood, Oliver
Kasimova, Marina A.
Westerlund, Annie M.
Delemotte, Lucie
author_sort Fleetwood, Oliver
collection PubMed
description Biomolecular simulations are intrinsically high dimensional and generate noisy data sets of ever-increasing size. Extracting important features from the data is crucial for understanding the biophysical properties of molecular processes, but remains a big challenge. Machine learning (ML) provides powerful dimensionality reduction tools. However, such methods are often criticized as resembling black boxes with limited human-interpretable insight. We use methods from supervised and unsupervised ML to efficiently create interpretable maps of important features from molecular simulations. We benchmark the performance of several methods, including neural networks, random forests, and principal component analysis, using a toy model with properties reminiscent of macromolecular behavior. We then analyze three diverse biological processes: conformational changes within the soluble protein calmodulin, ligand binding to a G protein-coupled receptor, and activation of an ion channel voltage-sensor domain, unraveling features critical for signal transduction, ligand binding, and voltage sensing. This work demonstrates the usefulness of ML in understanding biomolecular states and demystifying complex simulations.
format Online
Article
Text
id pubmed-7002924
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher The Biophysical Society
record_format MEDLINE/PubMed
spelling pubmed-70029242020-10-10 Molecular Insights from Conformational Ensembles via Machine Learning Fleetwood, Oliver Kasimova, Marina A. Westerlund, Annie M. Delemotte, Lucie Biophys J Articles Biomolecular simulations are intrinsically high dimensional and generate noisy data sets of ever-increasing size. Extracting important features from the data is crucial for understanding the biophysical properties of molecular processes, but remains a big challenge. Machine learning (ML) provides powerful dimensionality reduction tools. However, such methods are often criticized as resembling black boxes with limited human-interpretable insight. We use methods from supervised and unsupervised ML to efficiently create interpretable maps of important features from molecular simulations. We benchmark the performance of several methods, including neural networks, random forests, and principal component analysis, using a toy model with properties reminiscent of macromolecular behavior. We then analyze three diverse biological processes: conformational changes within the soluble protein calmodulin, ligand binding to a G protein-coupled receptor, and activation of an ion channel voltage-sensor domain, unraveling features critical for signal transduction, ligand binding, and voltage sensing. This work demonstrates the usefulness of ML in understanding biomolecular states and demystifying complex simulations. The Biophysical Society 2020-02-04 2019-12-21 /pmc/articles/PMC7002924/ /pubmed/31952811 http://dx.doi.org/10.1016/j.bpj.2019.12.016 Text en © 2019 Biophysical Society. http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Articles
Fleetwood, Oliver
Kasimova, Marina A.
Westerlund, Annie M.
Delemotte, Lucie
Molecular Insights from Conformational Ensembles via Machine Learning
title Molecular Insights from Conformational Ensembles via Machine Learning
title_full Molecular Insights from Conformational Ensembles via Machine Learning
title_fullStr Molecular Insights from Conformational Ensembles via Machine Learning
title_full_unstemmed Molecular Insights from Conformational Ensembles via Machine Learning
title_short Molecular Insights from Conformational Ensembles via Machine Learning
title_sort molecular insights from conformational ensembles via machine learning
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7002924/
https://www.ncbi.nlm.nih.gov/pubmed/31952811
http://dx.doi.org/10.1016/j.bpj.2019.12.016
work_keys_str_mv AT fleetwoodoliver molecularinsightsfromconformationalensemblesviamachinelearning
AT kasimovamarinaa molecularinsightsfromconformationalensemblesviamachinelearning
AT westerlundanniem molecularinsightsfromconformationalensemblesviamachinelearning
AT delemottelucie molecularinsightsfromconformationalensemblesviamachinelearning