Cargando…

Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations

In view of the theoretical importance and huge application potential of Thermally Activated Delayed Fluorescence (TADF) materials, it is of great significance to conduct High-Throughput Virtual Screening (HTVS) on compound libraries to find TADF candidate molecules. This research focuses on the comp...

Descripción completa

Detalles Bibliográficos
Autores principales: Tu, Chunyun, Huang, Weijiang, Liang, Sheng, Wang, Kui, Tian, Qin, Yan, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9619240/
https://www.ncbi.nlm.nih.gov/pubmed/36349007
http://dx.doi.org/10.1039/d2ra05643g
_version_ 1784821232160473088
author Tu, Chunyun
Huang, Weijiang
Liang, Sheng
Wang, Kui
Tian, Qin
Yan, Wei
author_facet Tu, Chunyun
Huang, Weijiang
Liang, Sheng
Wang, Kui
Tian, Qin
Yan, Wei
author_sort Tu, Chunyun
collection PubMed
description In view of the theoretical importance and huge application potential of Thermally Activated Delayed Fluorescence (TADF) materials, it is of great significance to conduct High-Throughput Virtual Screening (HTVS) on compound libraries to find TADF candidate molecules. This research focuses on the computational design of pure organic TADF molecules. By combining machine learning and quantum chemical calculations, using cheminformatics tools, and introducing the concept of selection and mutation from evolutionary theory, we have designed a computational program for HTVS of TADF molecular materials, especially the impact of selection strategy and structural mutations on the results of HTVS was explored. An initial compound library (size = 10(3)) constructed by enumeration of typical donors and acceptors was used to evolve by successively applying selection and 10 different structural mutations. And a group fingerprint similarity (Δ(MSPR)) index was proposed to account for the similarity between two compound libraries with comparable sizes. Based on the computed data, we have found that the mix of selection and mutations into the evolution map does have great impact on the HTVS results: (a) except the fast mutation Sub2, all the rest of the mutations can effectively concentrate ‘good’ molecules in a compound library, and hence give large material abundance (typically >0.8) for high mutation generations (n(g) ≥ 6). (b) The mean energy gap can exhibit a fast convergent trend toward very low values, hence the studied mutations (except Sub2) can cooperate very well with the studied DA substrates to generate optimal molecules, and the group fingerprint similarity can retain high enough values for large n(g), which can be associated with the apparent convergence in molecular skeletons as n(g) increases. (c) The distribution of skeleton frequencies for a specific mutation is generally uneven with one dominant skeleton. The overall numbers of common and generic cores for all mutations are 11 and 7 as n(g) = 9. Hence, in a sense, the ‘optimal’ skeletons seem unique and useful in realizing low energy gaps. With these observations and the development of related HTVS software, we expect to provide insight and tools to the research community of HTVS of molecular (TADF) materials.
format Online
Article
Text
id pubmed-9619240
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-96192402022-11-07 Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations Tu, Chunyun Huang, Weijiang Liang, Sheng Wang, Kui Tian, Qin Yan, Wei RSC Adv Chemistry In view of the theoretical importance and huge application potential of Thermally Activated Delayed Fluorescence (TADF) materials, it is of great significance to conduct High-Throughput Virtual Screening (HTVS) on compound libraries to find TADF candidate molecules. This research focuses on the computational design of pure organic TADF molecules. By combining machine learning and quantum chemical calculations, using cheminformatics tools, and introducing the concept of selection and mutation from evolutionary theory, we have designed a computational program for HTVS of TADF molecular materials, especially the impact of selection strategy and structural mutations on the results of HTVS was explored. An initial compound library (size = 10(3)) constructed by enumeration of typical donors and acceptors was used to evolve by successively applying selection and 10 different structural mutations. And a group fingerprint similarity (Δ(MSPR)) index was proposed to account for the similarity between two compound libraries with comparable sizes. Based on the computed data, we have found that the mix of selection and mutations into the evolution map does have great impact on the HTVS results: (a) except the fast mutation Sub2, all the rest of the mutations can effectively concentrate ‘good’ molecules in a compound library, and hence give large material abundance (typically >0.8) for high mutation generations (n(g) ≥ 6). (b) The mean energy gap can exhibit a fast convergent trend toward very low values, hence the studied mutations (except Sub2) can cooperate very well with the studied DA substrates to generate optimal molecules, and the group fingerprint similarity can retain high enough values for large n(g), which can be associated with the apparent convergence in molecular skeletons as n(g) increases. (c) The distribution of skeleton frequencies for a specific mutation is generally uneven with one dominant skeleton. The overall numbers of common and generic cores for all mutations are 11 and 7 as n(g) = 9. Hence, in a sense, the ‘optimal’ skeletons seem unique and useful in realizing low energy gaps. With these observations and the development of related HTVS software, we expect to provide insight and tools to the research community of HTVS of molecular (TADF) materials. The Royal Society of Chemistry 2022-10-31 /pmc/articles/PMC9619240/ /pubmed/36349007 http://dx.doi.org/10.1039/d2ra05643g Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by/3.0/
spellingShingle Chemistry
Tu, Chunyun
Huang, Weijiang
Liang, Sheng
Wang, Kui
Tian, Qin
Yan, Wei
Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations
title Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations
title_full Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations
title_fullStr Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations
title_full_unstemmed Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations
title_short Combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations
title_sort combining machine learning and quantum chemical calculations for high-throughput virtual screening of thermally activated delayed fluorescence molecular materials: the impact of selection strategy and structural mutations
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9619240/
https://www.ncbi.nlm.nih.gov/pubmed/36349007
http://dx.doi.org/10.1039/d2ra05643g
work_keys_str_mv AT tuchunyun combiningmachinelearningandquantumchemicalcalculationsforhighthroughputvirtualscreeningofthermallyactivateddelayedfluorescencemolecularmaterialstheimpactofselectionstrategyandstructuralmutations
AT huangweijiang combiningmachinelearningandquantumchemicalcalculationsforhighthroughputvirtualscreeningofthermallyactivateddelayedfluorescencemolecularmaterialstheimpactofselectionstrategyandstructuralmutations
AT liangsheng combiningmachinelearningandquantumchemicalcalculationsforhighthroughputvirtualscreeningofthermallyactivateddelayedfluorescencemolecularmaterialstheimpactofselectionstrategyandstructuralmutations
AT wangkui combiningmachinelearningandquantumchemicalcalculationsforhighthroughputvirtualscreeningofthermallyactivateddelayedfluorescencemolecularmaterialstheimpactofselectionstrategyandstructuralmutations
AT tianqin combiningmachinelearningandquantumchemicalcalculationsforhighthroughputvirtualscreeningofthermallyactivateddelayedfluorescencemolecularmaterialstheimpactofselectionstrategyandstructuralmutations
AT yanwei combiningmachinelearningandquantumchemicalcalculationsforhighthroughputvirtualscreeningofthermallyactivateddelayedfluorescencemolecularmaterialstheimpactofselectionstrategyandstructuralmutations