Cargando…
Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations
In view of the theoretical importance and huge application potential of Thermally Activated Delayed Fluorescence (TADF) materials, it is of great significance to conduct High-Throughput Virtual Screening (HTVS) on compound libraries to find TADF candidate molecules. This research focuses on the comp...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society of Chemistry
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9619240/ https://www.ncbi.nlm.nih.gov/pubmed/36349007 http://dx.doi.org/10.1039/d2ra05643g |
_version_ | 1784821232160473088 |
---|---|
author | Tu, Chunyun Huang, Weijiang Liang, Sheng Wang, Kui Tian, Qin Yan, Wei |
author_facet | Tu, Chunyun Huang, Weijiang Liang, Sheng Wang, Kui Tian, Qin Yan, Wei |
author_sort | Tu, Chunyun |
collection | PubMed |
description | In view of the theoretical importance and huge application potential of Thermally Activated Delayed Fluorescence (TADF) materials, it is of great significance to conduct High-Throughput Virtual Screening (HTVS) on compound libraries to find TADF candidate molecules. This research focuses on the computational design of pure organic TADF molecules. By combining machine learning and quantum chemical calculations, using cheminformatics tools, and introducing the concept of selection and mutation from evolutionary theory, we have designed a computational program for HTVS of TADF molecular materials, especially the impact of selection strategy and structural mutations on the results of HTVS was explored. An initial compound library (size = 10(3)) constructed by enumeration of typical donors and acceptors was used to evolve by successively applying selection and 10 different structural mutations. And a group fingerprint similarity (Δ(MSPR)) index was proposed to account for the similarity between two compound libraries with comparable sizes. Based on the computed data, we have found that the mix of selection and mutations into the evolution map does have great impact on the HTVS results: (a) except the fast mutation Sub2, all the rest of the mutations can effectively concentrate ‘good’ molecules in a compound library, and hence give large material abundance (typically >0.8) for high mutation generations (n(g) ≥ 6). (b) The mean energy gap can exhibit a fast convergent trend toward very low values, hence the studied mutations (except Sub2) can cooperate very well with the studied DA substrates to generate optimal molecules, and the group fingerprint similarity can retain high enough values for large n(g), which can be associated with the apparent convergence in molecular skeletons as n(g) increases. (c) The distribution of skeleton frequencies for a specific mutation is generally uneven with one dominant skeleton. The overall numbers of common and generic cores for all mutations are 11 and 7 as n(g) = 9. Hence, in a sense, the ‘optimal’ skeletons seem unique and useful in realizing low energy gaps. With these observations and the development of related HTVS software, we expect to provide insight and tools to the research community of HTVS of molecular (TADF) materials. |
format | Online Article Text |
id | pubmed-9619240 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | The Royal Society of Chemistry |
record_format | MEDLINE/PubMed |
spelling | pubmed-96192402022-11-07 Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations Tu, Chunyun Huang, Weijiang Liang, Sheng Wang, Kui Tian, Qin Yan, Wei RSC Adv Chemistry In view of the theoretical importance and huge application potential of Thermally Activated Delayed Fluorescence (TADF) materials, it is of great significance to conduct High-Throughput Virtual Screening (HTVS) on compound libraries to find TADF candidate molecules. This research focuses on the computational design of pure organic TADF molecules. By combining machine learning and quantum chemical calculations, using cheminformatics tools, and introducing the concept of selection and mutation from evolutionary theory, we have designed a computational program for HTVS of TADF molecular materials, especially the impact of selection strategy and structural mutations on the results of HTVS was explored. An initial compound library (size = 10(3)) constructed by enumeration of typical donors and acceptors was used to evolve by successively applying selection and 10 different structural mutations. And a group fingerprint similarity (Δ(MSPR)) index was proposed to account for the similarity between two compound libraries with comparable sizes. Based on the computed data, we have found that the mix of selection and mutations into the evolution map does have great impact on the HTVS results: (a) except the fast mutation Sub2, all the rest of the mutations can effectively concentrate ‘good’ molecules in a compound library, and hence give large material abundance (typically >0.8) for high mutation generations (n(g) ≥ 6). (b) The mean energy gap can exhibit a fast convergent trend toward very low values, hence the studied mutations (except Sub2) can cooperate very well with the studied DA substrates to generate optimal molecules, and the group fingerprint similarity can retain high enough values for large n(g), which can be associated with the apparent convergence in molecular skeletons as n(g) increases. (c) The distribution of skeleton frequencies for a specific mutation is generally uneven with one dominant skeleton. The overall numbers of common and generic cores for all mutations are 11 and 7 as n(g) = 9. Hence, in a sense, the ‘optimal’ skeletons seem unique and useful in realizing low energy gaps. With these observations and the development of related HTVS software, we expect to provide insight and tools to the research community of HTVS of molecular (TADF) materials. The Royal Society of Chemistry 2022-10-31 /pmc/articles/PMC9619240/ /pubmed/36349007 http://dx.doi.org/10.1039/d2ra05643g Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by/3.0/ |
spellingShingle | Chemistry Tu, Chunyun Huang, Weijiang Liang, Sheng Wang, Kui Tian, Qin Yan, Wei Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations |
title | Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations |
title_full | Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations |
title_fullStr | Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations |
title_full_unstemmed | Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations |
title_short | Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations |
title_sort | combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations |
topic | Chemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9619240/ https://www.ncbi.nlm.nih.gov/pubmed/36349007 http://dx.doi.org/10.1039/d2ra05643g |
work_keys_str_mv | AT tuchunyun combiningmachinelearningandquantumchemicalcalculationsforhighthroughputvirtualscreeningofthermallyactivateddelayedfluorescencemolecularmaterialstheimpactofselectionstrategyandstructuralmutations AT huangweijiang combiningmachinelearningandquantumchemicalcalculationsforhighthroughputvirtualscreeningofthermallyactivateddelayedfluorescencemolecularmaterialstheimpactofselectionstrategyandstructuralmutations AT liangsheng combiningmachinelearningandquantumchemicalcalculationsforhighthroughputvirtualscreeningofthermallyactivateddelayedfluorescencemolecularmaterialstheimpactofselectionstrategyandstructuralmutations AT wangkui combiningmachinelearningandquantumchemicalcalculationsforhighthroughputvirtualscreeningofthermallyactivateddelayedfluorescencemolecularmaterialstheimpactofselectionstrategyandstructuralmutations AT tianqin combiningmachinelearningandquantumchemicalcalculationsforhighthroughputvirtualscreeningofthermallyactivateddelayedfluorescencemolecularmaterialstheimpactofselectionstrategyandstructuralmutations AT yanwei combiningmachinelearningandquantumchemicalcalculationsforhighthroughputvirtualscreeningofthermallyactivateddelayedfluorescencemolecularmaterialstheimpactofselectionstrategyandstructuralmutations |