Cargando…

Application of Machine Learning in Developing Quantitative Structure–Property Relationship for Electronic Properties of Polyaromatic Compounds

[Image: see text] The degree of π orbital overlap (DPO) model has been demonstrated to be an excellent quantitative structure–property relationship (QSPR) that can map two-dimensional structural information of polycyclic aromatic hydrocarbons (PAHs) and thienoacenes to their electronic properties, n...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Tuan H., Nguyen, Lam H., Truong, Thanh N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9261278/
https://www.ncbi.nlm.nih.gov/pubmed/35811887
http://dx.doi.org/10.1021/acsomega.2c02650
_version_ 1784742238113234944
author Nguyen, Tuan H.
Nguyen, Lam H.
Truong, Thanh N.
author_facet Nguyen, Tuan H.
Nguyen, Lam H.
Truong, Thanh N.
author_sort Nguyen, Tuan H.
collection PubMed
description [Image: see text] The degree of π orbital overlap (DPO) model has been demonstrated to be an excellent quantitative structure–property relationship (QSPR) that can map two-dimensional structural information of polycyclic aromatic hydrocarbons (PAHs) and thienoacenes to their electronic properties, namely, band gaps, electron affinities, and ionization potentials. However, the model suffers from significant limitations that narrow its applications due to inefficient manual procedures in parameter optimization and descriptor formulation. In this work, we developed a machine learning (ML)-based method for efficiently optimizing DPO parameters and proposed a truncated DPO descriptor, which is simple enough that can be automatically extracted from simplified molecular-input line-entry system strings of PAHs and thienoacenes. Compared with the result from our previous studies, the ML-based methodology can optimize DPO parameters with four times fewer data, while it can achieve the same level of accuracy in predictions of the mentioned electronic properties to within 0.1 eV. The truncated DPO model also has similar accuracy to the full DPO model. Consequently, the ML-based DPO approach coupled with the truncated DPO model enables new possibilities for developing automatic pipelines for high-throughput screening and investigating new QSPR for new chemical classes.
format Online
Article
Text
id pubmed-9261278
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-92612782022-07-08 Application of Machine Learning in Developing Quantitative Structure–Property Relationship for Electronic Properties of Polyaromatic Compounds Nguyen, Tuan H. Nguyen, Lam H. Truong, Thanh N. ACS Omega [Image: see text] The degree of π orbital overlap (DPO) model has been demonstrated to be an excellent quantitative structure–property relationship (QSPR) that can map two-dimensional structural information of polycyclic aromatic hydrocarbons (PAHs) and thienoacenes to their electronic properties, namely, band gaps, electron affinities, and ionization potentials. However, the model suffers from significant limitations that narrow its applications due to inefficient manual procedures in parameter optimization and descriptor formulation. In this work, we developed a machine learning (ML)-based method for efficiently optimizing DPO parameters and proposed a truncated DPO descriptor, which is simple enough that can be automatically extracted from simplified molecular-input line-entry system strings of PAHs and thienoacenes. Compared with the result from our previous studies, the ML-based methodology can optimize DPO parameters with four times fewer data, while it can achieve the same level of accuracy in predictions of the mentioned electronic properties to within 0.1 eV. The truncated DPO model also has similar accuracy to the full DPO model. Consequently, the ML-based DPO approach coupled with the truncated DPO model enables new possibilities for developing automatic pipelines for high-throughput screening and investigating new QSPR for new chemical classes. American Chemical Society 2022-06-17 /pmc/articles/PMC9261278/ /pubmed/35811887 http://dx.doi.org/10.1021/acsomega.2c02650 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by-nc-nd/4.0/Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Nguyen, Tuan H.
Nguyen, Lam H.
Truong, Thanh N.
Application of Machine Learning in Developing Quantitative Structure–Property Relationship for Electronic Properties of Polyaromatic Compounds
title Application of Machine Learning in Developing Quantitative Structure–Property Relationship for Electronic Properties of Polyaromatic Compounds
title_full Application of Machine Learning in Developing Quantitative Structure–Property Relationship for Electronic Properties of Polyaromatic Compounds
title_fullStr Application of Machine Learning in Developing Quantitative Structure–Property Relationship for Electronic Properties of Polyaromatic Compounds
title_full_unstemmed Application of Machine Learning in Developing Quantitative Structure–Property Relationship for Electronic Properties of Polyaromatic Compounds
title_short Application of Machine Learning in Developing Quantitative Structure–Property Relationship for Electronic Properties of Polyaromatic Compounds
title_sort application of machine learning in developing quantitative structure–property relationship for electronic properties of polyaromatic compounds
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9261278/
https://www.ncbi.nlm.nih.gov/pubmed/35811887
http://dx.doi.org/10.1021/acsomega.2c02650
work_keys_str_mv AT nguyentuanh applicationofmachinelearningindevelopingquantitativestructurepropertyrelationshipforelectronicpropertiesofpolyaromaticcompounds
AT nguyenlamh applicationofmachinelearningindevelopingquantitativestructurepropertyrelationshipforelectronicpropertiesofpolyaromaticcompounds
AT truongthanhn applicationofmachinelearningindevelopingquantitativestructurepropertyrelationshipforelectronicpropertiesofpolyaromaticcompounds