Cargando…

Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach

Organic fluorescent molecules play critical roles in fluorescence inspection, biological probes, and labeling indicators. More than ten thousand organic fluorescent molecules were imported in this study, followed by a machine learning based approach for extracting the intrinsic structural characteri...

Descripción completa

Detalles Bibliográficos
Autores principales: Ye, Zong-Rong, Huang, I.-Shou, Chan, Yu-Te, Li, Zhong-Ji, Liao, Chen-Cheng, Tsai, Hao-Rong, Hsieh, Meng-Chi, Chang, Chun-Chih, Tsai, Ming-Kang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society of Chemistry 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9054811/
https://www.ncbi.nlm.nih.gov/pubmed/35517310
http://dx.doi.org/10.1039/d0ra05014h
_version_ 1784697275333738496
author Ye, Zong-Rong
Huang, I.-Shou
Chan, Yu-Te
Li, Zhong-Ji
Liao, Chen-Cheng
Tsai, Hao-Rong
Hsieh, Meng-Chi
Chang, Chun-Chih
Tsai, Ming-Kang
author_facet Ye, Zong-Rong
Huang, I.-Shou
Chan, Yu-Te
Li, Zhong-Ji
Liao, Chen-Cheng
Tsai, Hao-Rong
Hsieh, Meng-Chi
Chang, Chun-Chih
Tsai, Ming-Kang
author_sort Ye, Zong-Rong
collection PubMed
description Organic fluorescent molecules play critical roles in fluorescence inspection, biological probes, and labeling indicators. More than ten thousand organic fluorescent molecules were imported in this study, followed by a machine learning based approach for extracting the intrinsic structural characteristics that were found to correlate with the fluorescence emission. A systematic informatics procedure was introduced, starting from descriptor cleaning, descriptor space reduction, and statistical-meaningful regression to build a broad and valid model for estimating the fluorescence emission wavelength. The least absolute shrinkage and selection operator (Lasso) regression coupling with the random forest model was finally reported as the numerical predictor as well as being fulfilled with the statistical criteria. Such an informatics model appeared to bring comparable predictive ability, being complementary to the conventional time-dependent density functional theory method in emission wavelength prediction, however, with a fractional computational expense.
format Online
Article
Text
id pubmed-9054811
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher The Royal Society of Chemistry
record_format MEDLINE/PubMed
spelling pubmed-90548112022-05-04 Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach Ye, Zong-Rong Huang, I.-Shou Chan, Yu-Te Li, Zhong-Ji Liao, Chen-Cheng Tsai, Hao-Rong Hsieh, Meng-Chi Chang, Chun-Chih Tsai, Ming-Kang RSC Adv Chemistry Organic fluorescent molecules play critical roles in fluorescence inspection, biological probes, and labeling indicators. More than ten thousand organic fluorescent molecules were imported in this study, followed by a machine learning based approach for extracting the intrinsic structural characteristics that were found to correlate with the fluorescence emission. A systematic informatics procedure was introduced, starting from descriptor cleaning, descriptor space reduction, and statistical-meaningful regression to build a broad and valid model for estimating the fluorescence emission wavelength. The least absolute shrinkage and selection operator (Lasso) regression coupling with the random forest model was finally reported as the numerical predictor as well as being fulfilled with the statistical criteria. Such an informatics model appeared to bring comparable predictive ability, being complementary to the conventional time-dependent density functional theory method in emission wavelength prediction, however, with a fractional computational expense. The Royal Society of Chemistry 2020-06-23 /pmc/articles/PMC9054811/ /pubmed/35517310 http://dx.doi.org/10.1039/d0ra05014h Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/
spellingShingle Chemistry
Ye, Zong-Rong
Huang, I.-Shou
Chan, Yu-Te
Li, Zhong-Ji
Liao, Chen-Cheng
Tsai, Hao-Rong
Hsieh, Meng-Chi
Chang, Chun-Chih
Tsai, Ming-Kang
Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach
title Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach
title_full Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach
title_fullStr Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach
title_full_unstemmed Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach
title_short Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach
title_sort predicting the emission wavelength of organic molecules using a combinatorial qsar and machine learning approach
topic Chemistry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9054811/
https://www.ncbi.nlm.nih.gov/pubmed/35517310
http://dx.doi.org/10.1039/d0ra05014h
work_keys_str_mv AT yezongrong predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach
AT huangishou predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach
AT chanyute predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach
AT lizhongji predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach
AT liaochencheng predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach
AT tsaihaorong predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach
AT hsiehmengchi predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach
AT changchunchih predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach
AT tsaimingkang predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach