Cargando…
Agents for sequential learning using multiple-fidelity data
Sequential learning for materials discovery is a paradigm where a computational agent solicits new data to simultaneously update a model in service of exploration (finding the largest number of materials that meet some criteria) or exploitation (finding materials with an ideal figure of merit). In r...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8933401/ https://www.ncbi.nlm.nih.gov/pubmed/35304496 http://dx.doi.org/10.1038/s41598-022-08413-8 |
_version_ | 1784671642861961216 |
---|---|
author | Palizhati, Aini Torrisi, Steven B. Aykol, Muratahan Suram, Santosh K. Hummelshøj, Jens S. Montoya, Joseph H. |
author_facet | Palizhati, Aini Torrisi, Steven B. Aykol, Muratahan Suram, Santosh K. Hummelshøj, Jens S. Montoya, Joseph H. |
author_sort | Palizhati, Aini |
collection | PubMed |
description | Sequential learning for materials discovery is a paradigm where a computational agent solicits new data to simultaneously update a model in service of exploration (finding the largest number of materials that meet some criteria) or exploitation (finding materials with an ideal figure of merit). In real-world discovery campaigns, new data acquisition may be costly and an optimal strategy may involve using and acquiring data with different levels of fidelity, such as first-principles calculation to supplement an experiment. In this work, we introduce agents which can operate on multiple data fidelities, and benchmark their performance on an emulated discovery campaign to find materials with desired band gap values. The fidelities of data come from the results of DFT calculations as low fidelity and experimental results as high fidelity. We demonstrate performance gains of agents which incorporate multi-fidelity data in two contexts: either using a large body of low fidelity data as a prior knowledge base or acquiring low fidelity data in-tandem with experimental data. This advance provides a tool that enables materials scientists to test various acquisition and model hyperparameters to maximize the discovery rate of their own multi-fidelity sequential learning campaigns for materials discovery. This may also serve as a reference point for those who are interested in practical strategies that can be used when multiple data sources are available for active or sequential learning campaigns. |
format | Online Article Text |
id | pubmed-8933401 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-89334012022-03-28 Agents for sequential learning using multiple-fidelity data Palizhati, Aini Torrisi, Steven B. Aykol, Muratahan Suram, Santosh K. Hummelshøj, Jens S. Montoya, Joseph H. Sci Rep Article Sequential learning for materials discovery is a paradigm where a computational agent solicits new data to simultaneously update a model in service of exploration (finding the largest number of materials that meet some criteria) or exploitation (finding materials with an ideal figure of merit). In real-world discovery campaigns, new data acquisition may be costly and an optimal strategy may involve using and acquiring data with different levels of fidelity, such as first-principles calculation to supplement an experiment. In this work, we introduce agents which can operate on multiple data fidelities, and benchmark their performance on an emulated discovery campaign to find materials with desired band gap values. The fidelities of data come from the results of DFT calculations as low fidelity and experimental results as high fidelity. We demonstrate performance gains of agents which incorporate multi-fidelity data in two contexts: either using a large body of low fidelity data as a prior knowledge base or acquiring low fidelity data in-tandem with experimental data. This advance provides a tool that enables materials scientists to test various acquisition and model hyperparameters to maximize the discovery rate of their own multi-fidelity sequential learning campaigns for materials discovery. This may also serve as a reference point for those who are interested in practical strategies that can be used when multiple data sources are available for active or sequential learning campaigns. Nature Publishing Group UK 2022-03-18 /pmc/articles/PMC8933401/ /pubmed/35304496 http://dx.doi.org/10.1038/s41598-022-08413-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Palizhati, Aini Torrisi, Steven B. Aykol, Muratahan Suram, Santosh K. Hummelshøj, Jens S. Montoya, Joseph H. Agents for sequential learning using multiple-fidelity data |
title | Agents for sequential learning using multiple-fidelity data |
title_full | Agents for sequential learning using multiple-fidelity data |
title_fullStr | Agents for sequential learning using multiple-fidelity data |
title_full_unstemmed | Agents for sequential learning using multiple-fidelity data |
title_short | Agents for sequential learning using multiple-fidelity data |
title_sort | agents for sequential learning using multiple-fidelity data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8933401/ https://www.ncbi.nlm.nih.gov/pubmed/35304496 http://dx.doi.org/10.1038/s41598-022-08413-8 |
work_keys_str_mv | AT palizhatiaini agentsforsequentiallearningusingmultiplefidelitydata AT torrisistevenb agentsforsequentiallearningusingmultiplefidelitydata AT aykolmuratahan agentsforsequentiallearningusingmultiplefidelitydata AT suramsantoshk agentsforsequentiallearningusingmultiplefidelitydata AT hummelshøjjenss agentsforsequentiallearningusingmultiplefidelitydata AT montoyajosephh agentsforsequentiallearningusingmultiplefidelitydata |