Cargando…
Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach
Organic fluorescent molecules play critical roles in fluorescence inspection, biological probes, and labeling indicators. More than ten thousand organic fluorescent molecules were imported in this study, followed by a machine learning based approach for extracting the intrinsic structural characteri...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society of Chemistry
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9054811/ https://www.ncbi.nlm.nih.gov/pubmed/35517310 http://dx.doi.org/10.1039/d0ra05014h |
_version_ | 1784697275333738496 |
---|---|
author | Ye, Zong-Rong Huang, I.-Shou Chan, Yu-Te Li, Zhong-Ji Liao, Chen-Cheng Tsai, Hao-Rong Hsieh, Meng-Chi Chang, Chun-Chih Tsai, Ming-Kang |
author_facet | Ye, Zong-Rong Huang, I.-Shou Chan, Yu-Te Li, Zhong-Ji Liao, Chen-Cheng Tsai, Hao-Rong Hsieh, Meng-Chi Chang, Chun-Chih Tsai, Ming-Kang |
author_sort | Ye, Zong-Rong |
collection | PubMed |
description | Organic fluorescent molecules play critical roles in fluorescence inspection, biological probes, and labeling indicators. More than ten thousand organic fluorescent molecules were imported in this study, followed by a machine learning based approach for extracting the intrinsic structural characteristics that were found to correlate with the fluorescence emission. A systematic informatics procedure was introduced, starting from descriptor cleaning, descriptor space reduction, and statistical-meaningful regression to build a broad and valid model for estimating the fluorescence emission wavelength. The least absolute shrinkage and selection operator (Lasso) regression coupling with the random forest model was finally reported as the numerical predictor as well as being fulfilled with the statistical criteria. Such an informatics model appeared to bring comparable predictive ability, being complementary to the conventional time-dependent density functional theory method in emission wavelength prediction, however, with a fractional computational expense. |
format | Online Article Text |
id | pubmed-9054811 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | The Royal Society of Chemistry |
record_format | MEDLINE/PubMed |
spelling | pubmed-90548112022-05-04 Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach Ye, Zong-Rong Huang, I.-Shou Chan, Yu-Te Li, Zhong-Ji Liao, Chen-Cheng Tsai, Hao-Rong Hsieh, Meng-Chi Chang, Chun-Chih Tsai, Ming-Kang RSC Adv Chemistry Organic fluorescent molecules play critical roles in fluorescence inspection, biological probes, and labeling indicators. More than ten thousand organic fluorescent molecules were imported in this study, followed by a machine learning based approach for extracting the intrinsic structural characteristics that were found to correlate with the fluorescence emission. A systematic informatics procedure was introduced, starting from descriptor cleaning, descriptor space reduction, and statistical-meaningful regression to build a broad and valid model for estimating the fluorescence emission wavelength. The least absolute shrinkage and selection operator (Lasso) regression coupling with the random forest model was finally reported as the numerical predictor as well as being fulfilled with the statistical criteria. Such an informatics model appeared to bring comparable predictive ability, being complementary to the conventional time-dependent density functional theory method in emission wavelength prediction, however, with a fractional computational expense. The Royal Society of Chemistry 2020-06-23 /pmc/articles/PMC9054811/ /pubmed/35517310 http://dx.doi.org/10.1039/d0ra05014h Text en This journal is © The Royal Society of Chemistry https://creativecommons.org/licenses/by-nc/3.0/ |
spellingShingle | Chemistry Ye, Zong-Rong Huang, I.-Shou Chan, Yu-Te Li, Zhong-Ji Liao, Chen-Cheng Tsai, Hao-Rong Hsieh, Meng-Chi Chang, Chun-Chih Tsai, Ming-Kang Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach |
title | Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach |
title_full | Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach |
title_fullStr | Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach |
title_full_unstemmed | Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach |
title_short | Predicting the emission wavelength of organic molecules using a combinatorial QSAR and machine learning approach |
title_sort | predicting the emission wavelength of organic molecules using a combinatorial qsar and machine learning approach |
topic | Chemistry |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9054811/ https://www.ncbi.nlm.nih.gov/pubmed/35517310 http://dx.doi.org/10.1039/d0ra05014h |
work_keys_str_mv | AT yezongrong predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach AT huangishou predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach AT chanyute predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach AT lizhongji predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach AT liaochencheng predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach AT tsaihaorong predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach AT hsiehmengchi predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach AT changchunchih predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach AT tsaimingkang predictingtheemissionwavelengthoforganicmoleculesusingacombinatorialqsarandmachinelearningapproach |