Cargando…

MLatom 2: An Integrative Platform for Atomistic Machine Learning

Atomistic machine learning (AML) simulations are used in chemistry at an ever-increasing pace. A large number of AML models has been developed, but their implementations are scattered among different packages, each with its own conventions for input and output. Thus, here we give an overview of our...

Descripción completa

Detalles Bibliográficos
Autores principales: Dral, Pavlo O., Ge, Fuchun, Xue, Bao-Xin, Hou, Yi-Fan, Pinheiro, Max, Huang, Jianxing, Barbatti, Mario
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8187220/
https://www.ncbi.nlm.nih.gov/pubmed/34101036
http://dx.doi.org/10.1007/s41061-021-00339-5
_version_ 1783705099978145792
author Dral, Pavlo O.
Ge, Fuchun
Xue, Bao-Xin
Hou, Yi-Fan
Pinheiro, Max
Huang, Jianxing
Barbatti, Mario
author_facet Dral, Pavlo O.
Ge, Fuchun
Xue, Bao-Xin
Hou, Yi-Fan
Pinheiro, Max
Huang, Jianxing
Barbatti, Mario
author_sort Dral, Pavlo O.
collection PubMed
description Atomistic machine learning (AML) simulations are used in chemistry at an ever-increasing pace. A large number of AML models has been developed, but their implementations are scattered among different packages, each with its own conventions for input and output. Thus, here we give an overview of our MLatom 2 software package, which provides an integrative platform for a wide variety of AML simulations by implementing from scratch and interfacing existing software for a range of state-of-the-art models. These include kernel method-based model types such as KREG (native implementation), sGDML, and GAP-SOAP as well as neural-network-based model types such as ANI, DeepPot-SE, and PhysNet. The theoretical foundations behind these methods are overviewed too. The modular structure of MLatom allows for easy extension to more AML model types. MLatom 2 also has many other capabilities useful for AML simulations, such as the support of custom descriptors, farthest-point and structure-based sampling, hyperparameter optimization, model evaluation, and automatic learning curve generation. It can also be used for such multi-step tasks as Δ-learning, self-correction approaches, and absorption spectrum simulation within the machine-learning nuclear-ensemble approach. Several of these MLatom 2 capabilities are showcased in application examples.
format Online
Article
Text
id pubmed-8187220
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-81872202021-06-11 MLatom 2: An Integrative Platform for Atomistic Machine Learning Dral, Pavlo O. Ge, Fuchun Xue, Bao-Xin Hou, Yi-Fan Pinheiro, Max Huang, Jianxing Barbatti, Mario Top Curr Chem (Cham) Review Atomistic machine learning (AML) simulations are used in chemistry at an ever-increasing pace. A large number of AML models has been developed, but their implementations are scattered among different packages, each with its own conventions for input and output. Thus, here we give an overview of our MLatom 2 software package, which provides an integrative platform for a wide variety of AML simulations by implementing from scratch and interfacing existing software for a range of state-of-the-art models. These include kernel method-based model types such as KREG (native implementation), sGDML, and GAP-SOAP as well as neural-network-based model types such as ANI, DeepPot-SE, and PhysNet. The theoretical foundations behind these methods are overviewed too. The modular structure of MLatom allows for easy extension to more AML model types. MLatom 2 also has many other capabilities useful for AML simulations, such as the support of custom descriptors, farthest-point and structure-based sampling, hyperparameter optimization, model evaluation, and automatic learning curve generation. It can also be used for such multi-step tasks as Δ-learning, self-correction approaches, and absorption spectrum simulation within the machine-learning nuclear-ensemble approach. Several of these MLatom 2 capabilities are showcased in application examples. Springer International Publishing 2021-06-08 2021 /pmc/articles/PMC8187220/ /pubmed/34101036 http://dx.doi.org/10.1007/s41061-021-00339-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Review
Dral, Pavlo O.
Ge, Fuchun
Xue, Bao-Xin
Hou, Yi-Fan
Pinheiro, Max
Huang, Jianxing
Barbatti, Mario
MLatom 2: An Integrative Platform for Atomistic Machine Learning
title MLatom 2: An Integrative Platform for Atomistic Machine Learning
title_full MLatom 2: An Integrative Platform for Atomistic Machine Learning
title_fullStr MLatom 2: An Integrative Platform for Atomistic Machine Learning
title_full_unstemmed MLatom 2: An Integrative Platform for Atomistic Machine Learning
title_short MLatom 2: An Integrative Platform for Atomistic Machine Learning
title_sort mlatom 2: an integrative platform for atomistic machine learning
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8187220/
https://www.ncbi.nlm.nih.gov/pubmed/34101036
http://dx.doi.org/10.1007/s41061-021-00339-5
work_keys_str_mv AT dralpavloo mlatom2anintegrativeplatformforatomisticmachinelearning
AT gefuchun mlatom2anintegrativeplatformforatomisticmachinelearning
AT xuebaoxin mlatom2anintegrativeplatformforatomisticmachinelearning
AT houyifan mlatom2anintegrativeplatformforatomisticmachinelearning
AT pinheiromax mlatom2anintegrativeplatformforatomisticmachinelearning
AT huangjianxing mlatom2anintegrativeplatformforatomisticmachinelearning
AT barbattimario mlatom2anintegrativeplatformforatomisticmachinelearning