Cargando…

Machine learning prediction of UV–Vis spectra features of organic compounds related to photoreactive potential

Machine learning (ML) algorithms were explored for the classification of the UV–Vis absorption spectrum of organic molecules based on molecular descriptors and fingerprints generated from 2D chemical structures. Training and test data (~ 75 k molecules and associated UV–Vis data) were assembled from...

Descripción completa

Detalles Bibliográficos
Autores principales: Mamede, Rafael, Pereira, Florbela, Aires-de-Sousa, João
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8660842/
https://www.ncbi.nlm.nih.gov/pubmed/34887473
http://dx.doi.org/10.1038/s41598-021-03070-9
_version_ 1784613277012066304
author Mamede, Rafael
Pereira, Florbela
Aires-de-Sousa, João
author_facet Mamede, Rafael
Pereira, Florbela
Aires-de-Sousa, João
author_sort Mamede, Rafael
collection PubMed
description Machine learning (ML) algorithms were explored for the classification of the UV–Vis absorption spectrum of organic molecules based on molecular descriptors and fingerprints generated from 2D chemical structures. Training and test data (~ 75 k molecules and associated UV–Vis data) were assembled from a database with lists of experimental absorption maxima. They were labeled with positive class (related to photoreactive potential) if an absorption maximum is reported in the range between 290 and 700 nm (UV/Vis) with molar extinction coefficient (MEC) above 1000 Lmol(−1) cm(−1), and as negative if no such a peak is in the list. Random forests were selected among several algorithms. The models were validated with two external test sets comprising 998 organic molecules, obtaining a global accuracy up to 0.89, sensitivity of 0.90 and specificity of 0.88. The ML output (UV–Vis spectrum class) was explored as a predictor of the 3T3 NRU phototoxicity in vitro assay for a set of 43 molecules. Comparable results were observed with the classification directly based on experimental UV–Vis data in the same format.
format Online
Article
Text
id pubmed-8660842
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-86608422021-12-13 Machine learning prediction of UV–Vis spectra features of organic compounds related to photoreactive potential Mamede, Rafael Pereira, Florbela Aires-de-Sousa, João Sci Rep Article Machine learning (ML) algorithms were explored for the classification of the UV–Vis absorption spectrum of organic molecules based on molecular descriptors and fingerprints generated from 2D chemical structures. Training and test data (~ 75 k molecules and associated UV–Vis data) were assembled from a database with lists of experimental absorption maxima. They were labeled with positive class (related to photoreactive potential) if an absorption maximum is reported in the range between 290 and 700 nm (UV/Vis) with molar extinction coefficient (MEC) above 1000 Lmol(−1) cm(−1), and as negative if no such a peak is in the list. Random forests were selected among several algorithms. The models were validated with two external test sets comprising 998 organic molecules, obtaining a global accuracy up to 0.89, sensitivity of 0.90 and specificity of 0.88. The ML output (UV–Vis spectrum class) was explored as a predictor of the 3T3 NRU phototoxicity in vitro assay for a set of 43 molecules. Comparable results were observed with the classification directly based on experimental UV–Vis data in the same format. Nature Publishing Group UK 2021-12-09 /pmc/articles/PMC8660842/ /pubmed/34887473 http://dx.doi.org/10.1038/s41598-021-03070-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Mamede, Rafael
Pereira, Florbela
Aires-de-Sousa, João
Machine learning prediction of UV–Vis spectra features of organic compounds related to photoreactive potential
title Machine learning prediction of UV–Vis spectra features of organic compounds related to photoreactive potential
title_full Machine learning prediction of UV–Vis spectra features of organic compounds related to photoreactive potential
title_fullStr Machine learning prediction of UV–Vis spectra features of organic compounds related to photoreactive potential
title_full_unstemmed Machine learning prediction of UV–Vis spectra features of organic compounds related to photoreactive potential
title_short Machine learning prediction of UV–Vis spectra features of organic compounds related to photoreactive potential
title_sort machine learning prediction of uv–vis spectra features of organic compounds related to photoreactive potential
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8660842/
https://www.ncbi.nlm.nih.gov/pubmed/34887473
http://dx.doi.org/10.1038/s41598-021-03070-9
work_keys_str_mv AT mamederafael machinelearningpredictionofuvvisspectrafeaturesoforganiccompoundsrelatedtophotoreactivepotential
AT pereiraflorbela machinelearningpredictionofuvvisspectrafeaturesoforganiccompoundsrelatedtophotoreactivepotential
AT airesdesousajoao machinelearningpredictionofuvvisspectrafeaturesoforganiccompoundsrelatedtophotoreactivepotential