Cargando…

Unsupervised Learning Methods for Molecular Simulation Data

[Image: see text] Unsupervised learning is becoming an essential tool to analyze the increasingly large amounts of data produced by atomistic and molecular simulations, in material science, solid state physics, biophysics, and biochemistry. In this Review, we provide a comprehensive overview of the...

Descripción completa

Detalles Bibliográficos
Autores principales: Glielmo, Aldo, Husic, Brooke E., Rodriguez, Alex, Clementi, Cecilia, Noé, Frank, Laio, Alessandro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2021
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8391792/
https://www.ncbi.nlm.nih.gov/pubmed/33945269
http://dx.doi.org/10.1021/acs.chemrev.0c01195
_version_ 1783743355309522944
author Glielmo, Aldo
Husic, Brooke E.
Rodriguez, Alex
Clementi, Cecilia
Noé, Frank
Laio, Alessandro
author_facet Glielmo, Aldo
Husic, Brooke E.
Rodriguez, Alex
Clementi, Cecilia
Noé, Frank
Laio, Alessandro
author_sort Glielmo, Aldo
collection PubMed
description [Image: see text] Unsupervised learning is becoming an essential tool to analyze the increasingly large amounts of data produced by atomistic and molecular simulations, in material science, solid state physics, biophysics, and biochemistry. In this Review, we provide a comprehensive overview of the methods of unsupervised learning that have been most commonly used to investigate simulation data and indicate likely directions for further developments in the field. In particular, we discuss feature representation of molecular systems and present state-of-the-art algorithms of dimensionality reduction, density estimation, and clustering, and kinetic models. We divide our discussion into self-contained sections, each discussing a specific method. In each section, we briefly touch upon the mathematical and algorithmic foundations of the method, highlight its strengths and limitations, and describe the specific ways in which it has been used-or can be used-to analyze molecular simulation data.
format Online
Article
Text
id pubmed-8391792
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-83917922021-08-31 Unsupervised Learning Methods for Molecular Simulation Data Glielmo, Aldo Husic, Brooke E. Rodriguez, Alex Clementi, Cecilia Noé, Frank Laio, Alessandro Chem Rev [Image: see text] Unsupervised learning is becoming an essential tool to analyze the increasingly large amounts of data produced by atomistic and molecular simulations, in material science, solid state physics, biophysics, and biochemistry. In this Review, we provide a comprehensive overview of the methods of unsupervised learning that have been most commonly used to investigate simulation data and indicate likely directions for further developments in the field. In particular, we discuss feature representation of molecular systems and present state-of-the-art algorithms of dimensionality reduction, density estimation, and clustering, and kinetic models. We divide our discussion into self-contained sections, each discussing a specific method. In each section, we briefly touch upon the mathematical and algorithmic foundations of the method, highlight its strengths and limitations, and describe the specific ways in which it has been used-or can be used-to analyze molecular simulation data. American Chemical Society 2021-05-04 2021-08-25 /pmc/articles/PMC8391792/ /pubmed/33945269 http://dx.doi.org/10.1021/acs.chemrev.0c01195 Text en © 2021 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Glielmo, Aldo
Husic, Brooke E.
Rodriguez, Alex
Clementi, Cecilia
Noé, Frank
Laio, Alessandro
Unsupervised Learning Methods for Molecular Simulation Data
title Unsupervised Learning Methods for Molecular Simulation Data
title_full Unsupervised Learning Methods for Molecular Simulation Data
title_fullStr Unsupervised Learning Methods for Molecular Simulation Data
title_full_unstemmed Unsupervised Learning Methods for Molecular Simulation Data
title_short Unsupervised Learning Methods for Molecular Simulation Data
title_sort unsupervised learning methods for molecular simulation data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8391792/
https://www.ncbi.nlm.nih.gov/pubmed/33945269
http://dx.doi.org/10.1021/acs.chemrev.0c01195
work_keys_str_mv AT glielmoaldo unsupervisedlearningmethodsformolecularsimulationdata
AT husicbrookee unsupervisedlearningmethodsformolecularsimulationdata
AT rodriguezalex unsupervisedlearningmethodsformolecularsimulationdata
AT clementicecilia unsupervisedlearningmethodsformolecularsimulationdata
AT noefrank unsupervisedlearningmethodsformolecularsimulationdata
AT laioalessandro unsupervisedlearningmethodsformolecularsimulationdata