Cargando…
Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches
Accelerating the development of π-conjugated molecules for applications such as energy generation and storage, catalysis, sensing, pharmaceuticals, and (semi)conducting technologies requires rapid and accurate evaluation of the electronic, redox, or optical properties. While high-throughput computat...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society of Chemistry
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9769113/ https://www.ncbi.nlm.nih.gov/pubmed/36605753 http://dx.doi.org/10.1039/d2sc04676h |
_version_ | 1784854317621051392 |
---|---|
author | Bhat, Vinayak Sornberger, Parker Pokuri, Balaji Sesha Sarath Duke, Rebekah Ganapathysubramanian, Baskar Risko, Chad |
author_facet | Bhat, Vinayak Sornberger, Parker Pokuri, Balaji Sesha Sarath Duke, Rebekah Ganapathysubramanian, Baskar Risko, Chad |
author_sort | Bhat, Vinayak |
collection | PubMed |
description | Accelerating the development of π-conjugated molecules for applications such as energy generation and storage, catalysis, sensing, pharmaceuticals, and (semi)conducting technologies requires rapid and accurate evaluation of the electronic, redox, or optical properties. While high-throughput computational screening has proven to be a tremendous aid in this regard, machine learning (ML) and other data-driven methods can further enable orders of magnitude reduction in time while at the same time providing dramatic increases in the chemical space that is explored. However, the lack of benchmark datasets containing the electronic, redox, and optical properties that characterize the diverse, known chemical space of organic π-conjugated molecules limits ML model development. Here, we present a curated dataset containing 25k molecules with density functional theory (DFT) and time-dependent DFT (TDDFT) evaluated properties that include frontier molecular orbitals, ionization energies, relaxation energies, and low-lying optical excitation energies. Using the dataset, we train a hierarchy of ML models, ranging from classical models such as ridge regression to sophisticated graph neural networks, with molecular SMILES representation as input. We observe that graph neural networks augmented with contextual information allow for significantly better predictions across a wide array of properties. Our best-performing models also provide an uncertainty quantification for the predictions. To democratize access to the data and trained models, an interactive web platform has been developed and deployed. |
format | Online Article Text |
id | pubmed-9769113 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | The Royal Society of Chemistry |
record_format | MEDLINE/PubMed |
spelling | pubmed-97691132023-01-04 Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches Bhat, Vinayak Sornberger, Parker Pokuri, Balaji Sesha Sarath Duke, Rebekah Ganapathysubramanian, Baskar Risko, Chad Chem Sci Chemistry Accelerating the development of π-conjugated molecules for applications such as energy generation and storage, catalysis, sensing, pharmaceuticals, and (semi)conducting technologies requires rapid and accurate evaluation of the electronic, redox, or optical properties. While high-throughput computational screening has proven to be a tremendous aid in this regard, machine learning (ML) and other data-driven methods can further enable orders of magnitude reduction in time while at the same time providing dramatic increases in the chemical space that is explored. However, the lack of benchmark datasets containing the electronic, redox, and optical properties that characterize the diverse, known chemical space of organic π-conjugated molecules limits ML model development. Here, we present a curated dataset containing 25k molecules with density functional theory (DFT) and time-dependent DFT (TDDFT) evaluated properties that include frontier molecular orbitals, ionization energies, relaxation energies, and low-lying optical excitation energies. Using the dataset, we train a hierarchy of ML models, ranging from classical models such as ridge regression to sophisticated graph neural networks, with molecular SMILES representation as input. We observe that graph neural networks augmented with contextual information allow for significantly better predictions across a wide array of properties. Our best-performing models also provide an uncertainty quantification for the predictions. To democratize access to the data and trained models, an interactive web platform has been developed and deployed. The Royal Society of Chemistry 2022-11-17 /pmc/articles/PMC9769113/ /pubmed/36605753 http://dx.doi.org/10.1039/d2sc04676h Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/ |
spellingShingle | Chemistry Bhat, Vinayak Sornberger, Parker Pokuri, Balaji Sesha Sarath Duke, Rebekah Ganapathysubramanian, Baskar Risko, Chad Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches |
title | Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches |
title_full | Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches |
title_fullStr | Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches |
title_full_unstemmed | Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches |
title_short | Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches |
title_sort | electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches |
topic | Chemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9769113/ https://www.ncbi.nlm.nih.gov/pubmed/36605753 http://dx.doi.org/10.1039/d2sc04676h |
work_keys_str_mv | AT bhatvinayak electronicredoxandopticalpropertypredictionoforganicpconjugatedmoleculesthroughahierarchyofmachinelearningapproaches AT sornbergerparker electronicredoxandopticalpropertypredictionoforganicpconjugatedmoleculesthroughahierarchyofmachinelearningapproaches AT pokuribalajiseshasarath electronicredoxandopticalpropertypredictionoforganicpconjugatedmoleculesthroughahierarchyofmachinelearningapproaches AT dukerebekah electronicredoxandopticalpropertypredictionoforganicpconjugatedmoleculesthroughahierarchyofmachinelearningapproaches AT ganapathysubramanianbaskar electronicredoxandopticalpropertypredictionoforganicpconjugatedmoleculesthroughahierarchyofmachinelearningapproaches AT riskochad electronicredoxandopticalpropertypredictionoforganicpconjugatedmoleculesthroughahierarchyofmachinelearningapproaches |