Cargando…
Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost
Appropriately identifying and treating molecules and materials with significant multi-reference (MR) character is crucial for achieving high data fidelity in virtual high-throughput screening (VHTS). Despite development of numerous MR diagnostics, the extent to which a single value of such a diagnos...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society of Chemistry
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9067623/ https://www.ncbi.nlm.nih.gov/pubmed/35655882 http://dx.doi.org/10.1039/d2sc00393g |
_version_ | 1784700044150046720 |
---|---|
author | Duan, Chenru Chu, Daniel B. K. Nandy, Aditya Kulik, Heather J. |
author_facet | Duan, Chenru Chu, Daniel B. K. Nandy, Aditya Kulik, Heather J. |
author_sort | Duan, Chenru |
collection | PubMed |
description | Appropriately identifying and treating molecules and materials with significant multi-reference (MR) character is crucial for achieving high data fidelity in virtual high-throughput screening (VHTS). Despite development of numerous MR diagnostics, the extent to which a single value of such a diagnostic indicates the MR effect on a chemical property prediction is not well established. We evaluate MR diagnostics for over 10 000 transition-metal complexes (TMCs) and compare to those for organic molecules. We observe that only some MR diagnostics are transferable from one chemical space to another. By studying the influence of MR character on chemical properties (i.e., MR effect) that involve multiple potential energy surfaces (i.e., adiabatic spin splitting, ΔE(H–L), and ionization potential, IP), we show that differences in MR character are more important than the cumulative degree of MR character in predicting the magnitude of an MR effect. Motivated by this observation, we build transfer learning models to predict CCSD(T)-level adiabatic ΔE(H–L) and IP from lower levels of theory. By combining these models with uncertainty quantification and multi-level modeling, we introduce a multi-pronged strategy that accelerates data acquisition by at least a factor of three while achieving coupled cluster accuracy (i.e., to within 1 kcal mol(−1) MAE) for robust VHTS. |
format | Online Article Text |
id | pubmed-9067623 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | The Royal Society of Chemistry |
record_format | MEDLINE/PubMed |
spelling | pubmed-90676232022-06-01 Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost Duan, Chenru Chu, Daniel B. K. Nandy, Aditya Kulik, Heather J. Chem Sci Chemistry Appropriately identifying and treating molecules and materials with significant multi-reference (MR) character is crucial for achieving high data fidelity in virtual high-throughput screening (VHTS). Despite development of numerous MR diagnostics, the extent to which a single value of such a diagnostic indicates the MR effect on a chemical property prediction is not well established. We evaluate MR diagnostics for over 10 000 transition-metal complexes (TMCs) and compare to those for organic molecules. We observe that only some MR diagnostics are transferable from one chemical space to another. By studying the influence of MR character on chemical properties (i.e., MR effect) that involve multiple potential energy surfaces (i.e., adiabatic spin splitting, ΔE(H–L), and ionization potential, IP), we show that differences in MR character are more important than the cumulative degree of MR character in predicting the magnitude of an MR effect. Motivated by this observation, we build transfer learning models to predict CCSD(T)-level adiabatic ΔE(H–L) and IP from lower levels of theory. By combining these models with uncertainty quantification and multi-level modeling, we introduce a multi-pronged strategy that accelerates data acquisition by at least a factor of three while achieving coupled cluster accuracy (i.e., to within 1 kcal mol(−1) MAE) for robust VHTS. The Royal Society of Chemistry 2022-04-05 /pmc/articles/PMC9067623/ /pubmed/35655882 http://dx.doi.org/10.1039/d2sc00393g Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/ |
spellingShingle | Chemistry Duan, Chenru Chu, Daniel B. K. Nandy, Aditya Kulik, Heather J. Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost |
title | Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost |
title_full | Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost |
title_fullStr | Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost |
title_full_unstemmed | Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost |
title_short | Detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at DFT cost |
title_sort | detection of multi-reference character imbalances enables a transfer learning approach for virtual high throughput screening with coupled cluster accuracy at dft cost |
topic | Chemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9067623/ https://www.ncbi.nlm.nih.gov/pubmed/35655882 http://dx.doi.org/10.1039/d2sc00393g |
work_keys_str_mv | AT duanchenru detectionofmultireferencecharacterimbalancesenablesatransferlearningapproachforvirtualhighthroughputscreeningwithcoupledclusteraccuracyatdftcost AT chudanielbk detectionofmultireferencecharacterimbalancesenablesatransferlearningapproachforvirtualhighthroughputscreeningwithcoupledclusteraccuracyatdftcost AT nandyaditya detectionofmultireferencecharacterimbalancesenablesatransferlearningapproachforvirtualhighthroughputscreeningwithcoupledclusteraccuracyatdftcost AT kulikheatherj detectionofmultireferencecharacterimbalancesenablesatransferlearningapproachforvirtualhighthroughputscreeningwithcoupledclusteraccuracyatdftcost |