Cargando…

Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches

Accelerating the development of π-conjugated molecules for applications such as energy generation and storage, catalysis, sensing, pharmaceuticals, and (semi)conducting technologies requires rapid and accurate evaluation of the electronic, redox, or optical properties. While high-throughput computat...

Descripción completa

Detalles Bibliográficos
Autores principales: Bhat, Vinayak, Sornberger, Parker, Pokuri, Balaji Sesha Sarath, Duke, Rebekah, Ganapathysubramanian, Baskar, Risko, Chad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9769113/
https://www.ncbi.nlm.nih.gov/pubmed/36605753
http://dx.doi.org/10.1039/d2sc04676h
_version_ 1784854317621051392
author Bhat, Vinayak
Sornberger, Parker
Pokuri, Balaji Sesha Sarath
Duke, Rebekah
Ganapathysubramanian, Baskar
Risko, Chad
author_facet Bhat, Vinayak
Sornberger, Parker
Pokuri, Balaji Sesha Sarath
Duke, Rebekah
Ganapathysubramanian, Baskar
Risko, Chad
author_sort Bhat, Vinayak
collection PubMed
description Accelerating the development of π-conjugated molecules for applications such as energy generation and storage, catalysis, sensing, pharmaceuticals, and (semi)conducting technologies requires rapid and accurate evaluation of the electronic, redox, or optical properties. While high-throughput computational screening has proven to be a tremendous aid in this regard, machine learning (ML) and other data-driven methods can further enable orders of magnitude reduction in time while at the same time providing dramatic increases in the chemical space that is explored. However, the lack of benchmark datasets containing the electronic, redox, and optical properties that characterize the diverse, known chemical space of organic π-conjugated molecules limits ML model development. Here, we present a curated dataset containing 25k molecules with density functional theory (DFT) and time-dependent DFT (TDDFT) evaluated properties that include frontier molecular orbitals, ionization energies, relaxation energies, and low-lying optical excitation energies. Using the dataset, we train a hierarchy of ML models, ranging from classical models such as ridge regression to sophisticated graph neural networks, with molecular SMILES representation as input. We observe that graph neural networks augmented with contextual information allow for significantly better predictions across a wide array of properties. Our best-performing models also provide an uncertainty quantification for the predictions. To democratize access to the data and trained models, an interactive web platform has been developed and deployed.
format Online
Article
Text
id pubmed-9769113
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-97691132023-01-04 Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches Bhat, Vinayak Sornberger, Parker Pokuri, Balaji Sesha Sarath Duke, Rebekah Ganapathysubramanian, Baskar Risko, Chad Chem Sci Chemistry Accelerating the development of π-conjugated molecules for applications such as energy generation and storage, catalysis, sensing, pharmaceuticals, and (semi)conducting technologies requires rapid and accurate evaluation of the electronic, redox, or optical properties. While high-throughput computational screening has proven to be a tremendous aid in this regard, machine learning (ML) and other data-driven methods can further enable orders of magnitude reduction in time while at the same time providing dramatic increases in the chemical space that is explored. However, the lack of benchmark datasets containing the electronic, redox, and optical properties that characterize the diverse, known chemical space of organic π-conjugated molecules limits ML model development. Here, we present a curated dataset containing 25k molecules with density functional theory (DFT) and time-dependent DFT (TDDFT) evaluated properties that include frontier molecular orbitals, ionization energies, relaxation energies, and low-lying optical excitation energies. Using the dataset, we train a hierarchy of ML models, ranging from classical models such as ridge regression to sophisticated graph neural networks, with molecular SMILES representation as input. We observe that graph neural networks augmented with contextual information allow for significantly better predictions across a wide array of properties. Our best-performing models also provide an uncertainty quantification for the predictions. To democratize access to the data and trained models, an interactive web platform has been developed and deployed. The Royal Society of Chemistry 2022-11-17 /pmc/articles/PMC9769113/ /pubmed/36605753 http://dx.doi.org/10.1039/d2sc04676h Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Bhat, Vinayak
Sornberger, Parker
Pokuri, Balaji Sesha Sarath
Duke, Rebekah
Ganapathysubramanian, Baskar
Risko, Chad
Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches
title Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches
title_full Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches
title_fullStr Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches
title_full_unstemmed Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches
title_short Electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches
title_sort electronic, redox, and optical property prediction of organic π-conjugated molecules through a hierarchy of machine learning approaches
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9769113/
https://www.ncbi.nlm.nih.gov/pubmed/36605753
http://dx.doi.org/10.1039/d2sc04676h
work_keys_str_mv AT bhatvinayak electronicredoxandopticalpropertypredictionoforganicpconjugatedmoleculesthroughahierarchyofmachinelearningapproaches
AT sornbergerparker electronicredoxandopticalpropertypredictionoforganicpconjugatedmoleculesthroughahierarchyofmachinelearningapproaches
AT pokuribalajiseshasarath electronicredoxandopticalpropertypredictionoforganicpconjugatedmoleculesthroughahierarchyofmachinelearningapproaches
AT dukerebekah electronicredoxandopticalpropertypredictionoforganicpconjugatedmoleculesthroughahierarchyofmachinelearningapproaches
AT ganapathysubramanianbaskar electronicredoxandopticalpropertypredictionoforganicpconjugatedmoleculesthroughahierarchyofmachinelearningapproaches
AT riskochad electronicredoxandopticalpropertypredictionoforganicpconjugatedmoleculesthroughahierarchyofmachinelearningapproaches