Cargando…

Benchmarking the acceleration of materials discovery by sequential learning

Sequential learning (SL) strategies, i.e. iteratively updating a machine learning model to guide experiments, have been proposed to significantly accelerate materials discovery and research. Applications on computational datasets and a handful of optimization experiments have demonstrated the promis...

Descripción completa

Detalles Bibliográficos
Autores principales: Rohr, Brian, Stein, Helge S., Guevarra, Dan, Wang, Yu, Haber, Joel A., Aykol, Muratahan, Suram, Santosh K., Gregoire, John M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8157525/
https://www.ncbi.nlm.nih.gov/pubmed/34084328
http://dx.doi.org/10.1039/c9sc05999g
_version_ 1783699700642217984
author Rohr, Brian
Stein, Helge S.
Guevarra, Dan
Wang, Yu
Haber, Joel A.
Aykol, Muratahan
Suram, Santosh K.
Gregoire, John M.
author_facet Rohr, Brian
Stein, Helge S.
Guevarra, Dan
Wang, Yu
Haber, Joel A.
Aykol, Muratahan
Suram, Santosh K.
Gregoire, John M.
author_sort Rohr, Brian
collection PubMed
description Sequential learning (SL) strategies, i.e. iteratively updating a machine learning model to guide experiments, have been proposed to significantly accelerate materials discovery and research. Applications on computational datasets and a handful of optimization experiments have demonstrated the promise of SL, motivating a quantitative evaluation of its ability to accelerate materials discovery, specifically in the case of physical experiments. The benchmarking effort in the present work quantifies the performance of SL algorithms with respect to a breadth of research goals: discovery of any “good” material, discovery of all “good” materials, and discovery of a model that accurately predicts the performance of new materials. To benchmark the effectiveness of different machine learning models against these goals, we use datasets in which the performance of all materials in the search space is known from high-throughput synthesis and electrochemistry experiments. Each dataset contains all pseudo-quaternary metal oxide combinations from a set of six elements (chemical space), the performance metric chosen is the electrocatalytic activity (overpotential) for the oxygen evolution reaction (OER). A diverse set of SL schemes is tested on four chemical spaces, each containing 2121 catalysts. The presented work suggests that research can be accelerated by up to a factor of 20 compared to random acquisition in specific scenarios. The results also show that certain choices of SL models are ill-suited for a given research goal resulting in substantial deceleration compared to random acquisition methods. The results provide quantitative guidance on how to tune an SL strategy for a given research goal and demonstrate the need for a new generation of materials-aware SL algorithms to further accelerate materials discovery.
format Online
Article
Text
id pubmed-8157525
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-81575252021-06-02 Benchmarking the acceleration of materials discovery by sequential learning Rohr, Brian Stein, Helge S. Guevarra, Dan Wang, Yu Haber, Joel A. Aykol, Muratahan Suram, Santosh K. Gregoire, John M. Chem Sci Chemistry Sequential learning (SL) strategies, i.e. iteratively updating a machine learning model to guide experiments, have been proposed to significantly accelerate materials discovery and research. Applications on computational datasets and a handful of optimization experiments have demonstrated the promise of SL, motivating a quantitative evaluation of its ability to accelerate materials discovery, specifically in the case of physical experiments. The benchmarking effort in the present work quantifies the performance of SL algorithms with respect to a breadth of research goals: discovery of any “good” material, discovery of all “good” materials, and discovery of a model that accurately predicts the performance of new materials. To benchmark the effectiveness of different machine learning models against these goals, we use datasets in which the performance of all materials in the search space is known from high-throughput synthesis and electrochemistry experiments. Each dataset contains all pseudo-quaternary metal oxide combinations from a set of six elements (chemical space), the performance metric chosen is the electrocatalytic activity (overpotential) for the oxygen evolution reaction (OER). A diverse set of SL schemes is tested on four chemical spaces, each containing 2121 catalysts. The presented work suggests that research can be accelerated by up to a factor of 20 compared to random acquisition in specific scenarios. The results also show that certain choices of SL models are ill-suited for a given research goal resulting in substantial deceleration compared to random acquisition methods. The results provide quantitative guidance on how to tune an SL strategy for a given research goal and demonstrate the need for a new generation of materials-aware SL algorithms to further accelerate materials discovery. The Royal Society of Chemistry 2020-01-29 /pmc/articles/PMC8157525/ /pubmed/34084328 http://dx.doi.org/10.1039/c9sc05999g Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by/3.0/
spellingShingle Chemistry
Rohr, Brian
Stein, Helge S.
Guevarra, Dan
Wang, Yu
Haber, Joel A.
Aykol, Muratahan
Suram, Santosh K.
Gregoire, John M.
Benchmarking the acceleration of materials discovery by sequential learning
title Benchmarking the acceleration of materials discovery by sequential learning
title_full Benchmarking the acceleration of materials discovery by sequential learning
title_fullStr Benchmarking the acceleration of materials discovery by sequential learning
title_full_unstemmed Benchmarking the acceleration of materials discovery by sequential learning
title_short Benchmarking the acceleration of materials discovery by sequential learning
title_sort benchmarking the acceleration of materials discovery by sequential learning
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8157525/
https://www.ncbi.nlm.nih.gov/pubmed/34084328
http://dx.doi.org/10.1039/c9sc05999g
work_keys_str_mv AT rohrbrian benchmarkingtheaccelerationofmaterialsdiscoverybysequentiallearning
AT steinhelges benchmarkingtheaccelerationofmaterialsdiscoverybysequentiallearning
AT guevarradan benchmarkingtheaccelerationofmaterialsdiscoverybysequentiallearning
AT wangyu benchmarkingtheaccelerationofmaterialsdiscoverybysequentiallearning
AT haberjoela benchmarkingtheaccelerationofmaterialsdiscoverybysequentiallearning
AT aykolmuratahan benchmarkingtheaccelerationofmaterialsdiscoverybysequentiallearning
AT suramsantoshk benchmarkingtheaccelerationofmaterialsdiscoverybysequentiallearning
AT gregoirejohnm benchmarkingtheaccelerationofmaterialsdiscoverybysequentiallearning