Cargando…

Data-Driven Strategies for Accelerated Materials Design

[Image: see text] The ongoing revolution of the natural sciences by the advent of machine learning and artificial intelligence sparked significant interest in the material science community in recent years. The intrinsically high dimensionality of the space of realizable materials makes traditional...

Descripción completa

Detalles Bibliográficos
Autores principales: Pollice, Robert, dos Passos Gomes, Gabriel, Aldeghi, Matteo, Hickman, Riley J., Krenn, Mario, Lavigne, Cyrille, Lindner-D’Addario, Michael, Nigam, AkshatKumar, Ser, Cher Tian, Yao, Zhenpeng, Aspuru-Guzik, Alán
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2021
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7893702/
https://www.ncbi.nlm.nih.gov/pubmed/33528245
http://dx.doi.org/10.1021/acs.accounts.0c00785
_version_ 1783653099585404928
author Pollice, Robert
dos Passos Gomes, Gabriel
Aldeghi, Matteo
Hickman, Riley J.
Krenn, Mario
Lavigne, Cyrille
Lindner-D’Addario, Michael
Nigam, AkshatKumar
Ser, Cher Tian
Yao, Zhenpeng
Aspuru-Guzik, Alán
author_facet Pollice, Robert
dos Passos Gomes, Gabriel
Aldeghi, Matteo
Hickman, Riley J.
Krenn, Mario
Lavigne, Cyrille
Lindner-D’Addario, Michael
Nigam, AkshatKumar
Ser, Cher Tian
Yao, Zhenpeng
Aspuru-Guzik, Alán
author_sort Pollice, Robert
collection PubMed
description [Image: see text] The ongoing revolution of the natural sciences by the advent of machine learning and artificial intelligence sparked significant interest in the material science community in recent years. The intrinsically high dimensionality of the space of realizable materials makes traditional approaches ineffective for large-scale explorations. Modern data science and machine learning tools developed for increasingly complicated problems are an attractive alternative. An imminent climate catastrophe calls for a clean energy transformation by overhauling current technologies within only several years of possible action available. Tackling this crisis requires the development of new materials at an unprecedented pace and scale. For example, organic photovoltaics have the potential to replace existing silicon-based materials to a large extent and open up new fields of application. In recent years, organic light-emitting diodes have emerged as state-of-the-art technology for digital screens and portable devices and are enabling new applications with flexible displays. Reticular frameworks allow the atom-precise synthesis of nanomaterials and promise to revolutionize the field by the potential to realize multifunctional nanoparticles with applications from gas storage, gas separation, and electrochemical energy storage to nanomedicine. In the recent decade, significant advances in all these fields have been facilitated by the comprehensive application of simulation and machine learning for property prediction, property optimization, and chemical space exploration enabled by considerable advances in computing power and algorithmic efficiency. In this Account, we review the most recent contributions of our group in this thriving field of machine learning for material science. We start with a summary of the most important material classes our group has been involved in, focusing on small molecules as organic electronic materials and crystalline materials. Specifically, we highlight the data-driven approaches we employed to speed up discovery and derive material design strategies. Subsequently, our focus lies on the data-driven methodologies our group has developed and employed, elaborating on high-throughput virtual screening, inverse molecular design, Bayesian optimization, and supervised learning. We discuss the general ideas, their working principles, and their use cases with examples of successful implementations in data-driven material discovery and design efforts. Furthermore, we elaborate on potential pitfalls and remaining challenges of these methods. Finally, we provide a brief outlook for the field as we foresee increasing adaptation and implementation of large scale data-driven approaches in material discovery and design campaigns.
format Online
Article
Text
id pubmed-7893702
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-78937022021-02-22 Data-Driven Strategies for Accelerated Materials Design Pollice, Robert dos Passos Gomes, Gabriel Aldeghi, Matteo Hickman, Riley J. Krenn, Mario Lavigne, Cyrille Lindner-D’Addario, Michael Nigam, AkshatKumar Ser, Cher Tian Yao, Zhenpeng Aspuru-Guzik, Alán Acc Chem Res [Image: see text] The ongoing revolution of the natural sciences by the advent of machine learning and artificial intelligence sparked significant interest in the material science community in recent years. The intrinsically high dimensionality of the space of realizable materials makes traditional approaches ineffective for large-scale explorations. Modern data science and machine learning tools developed for increasingly complicated problems are an attractive alternative. An imminent climate catastrophe calls for a clean energy transformation by overhauling current technologies within only several years of possible action available. Tackling this crisis requires the development of new materials at an unprecedented pace and scale. For example, organic photovoltaics have the potential to replace existing silicon-based materials to a large extent and open up new fields of application. In recent years, organic light-emitting diodes have emerged as state-of-the-art technology for digital screens and portable devices and are enabling new applications with flexible displays. Reticular frameworks allow the atom-precise synthesis of nanomaterials and promise to revolutionize the field by the potential to realize multifunctional nanoparticles with applications from gas storage, gas separation, and electrochemical energy storage to nanomedicine. In the recent decade, significant advances in all these fields have been facilitated by the comprehensive application of simulation and machine learning for property prediction, property optimization, and chemical space exploration enabled by considerable advances in computing power and algorithmic efficiency. In this Account, we review the most recent contributions of our group in this thriving field of machine learning for material science. We start with a summary of the most important material classes our group has been involved in, focusing on small molecules as organic electronic materials and crystalline materials. Specifically, we highlight the data-driven approaches we employed to speed up discovery and derive material design strategies. Subsequently, our focus lies on the data-driven methodologies our group has developed and employed, elaborating on high-throughput virtual screening, inverse molecular design, Bayesian optimization, and supervised learning. We discuss the general ideas, their working principles, and their use cases with examples of successful implementations in data-driven material discovery and design efforts. Furthermore, we elaborate on potential pitfalls and remaining challenges of these methods. Finally, we provide a brief outlook for the field as we foresee increasing adaptation and implementation of large scale data-driven approaches in material discovery and design campaigns. American Chemical Society 2021-02-02 2021-02-16 /pmc/articles/PMC7893702/ /pubmed/33528245 http://dx.doi.org/10.1021/acs.accounts.0c00785 Text en © 2021 American Chemical Society This is an open access article published under a Creative Commons Attribution (CC-BY) License (http://pubs.acs.org/page/policy/authorchoice_ccby_termsofuse.html) , which permits unrestricted use, distribution and reproduction in any medium, provided the author and source are cited.
spellingShingle Pollice, Robert
dos Passos Gomes, Gabriel
Aldeghi, Matteo
Hickman, Riley J.
Krenn, Mario
Lavigne, Cyrille
Lindner-D’Addario, Michael
Nigam, AkshatKumar
Ser, Cher Tian
Yao, Zhenpeng
Aspuru-Guzik, Alán
Data-Driven Strategies for Accelerated Materials Design
title Data-Driven Strategies for Accelerated Materials Design
title_full Data-Driven Strategies for Accelerated Materials Design
title_fullStr Data-Driven Strategies for Accelerated Materials Design
title_full_unstemmed Data-Driven Strategies for Accelerated Materials Design
title_short Data-Driven Strategies for Accelerated Materials Design
title_sort data-driven strategies for accelerated materials design
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7893702/
https://www.ncbi.nlm.nih.gov/pubmed/33528245
http://dx.doi.org/10.1021/acs.accounts.0c00785
work_keys_str_mv AT pollicerobert datadrivenstrategiesforacceleratedmaterialsdesign
AT dospassosgomesgabriel datadrivenstrategiesforacceleratedmaterialsdesign
AT aldeghimatteo datadrivenstrategiesforacceleratedmaterialsdesign
AT hickmanrileyj datadrivenstrategiesforacceleratedmaterialsdesign
AT krennmario datadrivenstrategiesforacceleratedmaterialsdesign
AT lavignecyrille datadrivenstrategiesforacceleratedmaterialsdesign
AT lindnerdaddariomichael datadrivenstrategiesforacceleratedmaterialsdesign
AT nigamakshatkumar datadrivenstrategiesforacceleratedmaterialsdesign
AT serchertian datadrivenstrategiesforacceleratedmaterialsdesign
AT yaozhenpeng datadrivenstrategiesforacceleratedmaterialsdesign
AT aspuruguzikalan datadrivenstrategiesforacceleratedmaterialsdesign