Cargando…

Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost

Appropriately identifying and treating molecules and materials with significant multi-reference (MR) character is crucial for achieving high data fidelity in virtual high-throughput screening (VHTS). Despite development of numerous MR diagnostics, the extent to which a single value of such a diagnos...

Descripción completa

Detalles Bibliográficos
Autores principales: Duan, Chenru, Chu, Daniel B. K., Nandy, Aditya, Kulik, Heather J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9067623/
https://www.ncbi.nlm.nih.gov/pubmed/35655882
http://dx.doi.org/10.1039/d2sc00393g
_version_ 1784700044150046720
author Duan, Chenru
Chu, Daniel B. K.
Nandy, Aditya
Kulik, Heather J.
author_facet Duan, Chenru
Chu, Daniel B. K.
Nandy, Aditya
Kulik, Heather J.
author_sort Duan, Chenru
collection PubMed
description Appropriately identifying and treating molecules and materials with significant multi-reference (MR) character is crucial for achieving high data fidelity in virtual high-throughput screening (VHTS). Despite development of numerous MR diagnostics, the extent to which a single value of such a diagnostic indicates the MR effect on a chemical property prediction is not well established. We evaluate MR diagnostics for over 10 000 transition-metal complexes (TMCs) and compare to those for organic molecules. We observe that only some MR diagnostics are transferable from one chemical space to another. By studying the influence of MR character on chemical properties (i.e., MR effect) that involve multiple potential energy surfaces (i.e., adiabatic spin splitting, ΔE(H–L), and ionization potential, IP), we show that differences in MR character are more important than the cumulative degree of MR character in predicting the magnitude of an MR effect. Motivated by this observation, we build transfer learning models to predict CCSD(T)-level adiabatic ΔE(H–L) and IP from lower levels of theory. By combining these models with uncertainty quantification and multi-level modeling, we introduce a multi-pronged strategy that accelerates data acquisition by at least a factor of three while achieving coupled cluster accuracy (i.e., to within 1 kcal mol(−1) MAE) for robust VHTS.
format Online
Article
Text
id pubmed-9067623
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-90676232022-06-01 Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost Duan, Chenru Chu, Daniel B. K. Nandy, Aditya Kulik, Heather J. Chem Sci Chemistry Appropriately identifying and treating molecules and materials with significant multi-reference (MR) character is crucial for achieving high data fidelity in virtual high-throughput screening (VHTS). Despite development of numerous MR diagnostics, the extent to which a single value of such a diagnostic indicates the MR effect on a chemical property prediction is not well established. We evaluate MR diagnostics for over 10 000 transition-metal complexes (TMCs) and compare to those for organic molecules. We observe that only some MR diagnostics are transferable from one chemical space to another. By studying the influence of MR character on chemical properties (i.e., MR effect) that involve multiple potential energy surfaces (i.e., adiabatic spin splitting, ΔE(H–L), and ionization potential, IP), we show that differences in MR character are more important than the cumulative degree of MR character in predicting the magnitude of an MR effect. Motivated by this observation, we build transfer learning models to predict CCSD(T)-level adiabatic ΔE(H–L) and IP from lower levels of theory. By combining these models with uncertainty quantification and multi-level modeling, we introduce a multi-pronged strategy that accelerates data acquisition by at least a factor of three while achieving coupled cluster accuracy (i.e., to within 1 kcal mol(−1) MAE) for robust VHTS. The Royal Society of Chemistry 2022-04-05 /pmc/articles/PMC9067623/ /pubmed/35655882 http://dx.doi.org/10.1039/d2sc00393g Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Duan, Chenru
Chu, Daniel B. K.
Nandy, Aditya
Kulik, Heather J.
Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost
title Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost
title_full Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost
title_fullStr Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost
title_full_unstemmed Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost
title_short Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost
title_sort detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at dft cost
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9067623/
https://www.ncbi.nlm.nih.gov/pubmed/35655882
http://dx.doi.org/10.1039/d2sc00393g
work_keys_str_mv AT duanchenru detectionofmultireferencecharacterimbalancesenablesatransferlearningapproachforvirtualhighthroughputscreeningwithcoupledclusteraccuracyatdftcost
AT chudanielbk detectionofmultireferencecharacterimbalancesenablesatransferlearningapproachforvirtualhighthroughputscreeningwithcoupledclusteraccuracyatdftcost
AT nandyaditya detectionofmultireferencecharacterimbalancesenablesatransferlearningapproachforvirtualhighthroughputscreeningwithcoupledclusteraccuracyatdftcost
AT kulikheatherj detectionofmultireferencecharacterimbalancesenablesatransferlearningapproachforvirtualhighthroughputscreeningwithcoupledclusteraccuracyatdftcost