Cargando…

Support Vector Machine-Based Global Classification Model of the Toxicity of Organic Compounds to Vibrio fischeri

Vibrio fischeri is widely used as the model species in toxicity and risk assessment. For the first time, a global classification model was proposed in this paper for a two-class problem (Class − 1 with log1/IBC(50) ≤ 4.2 and Class + 1 with log1/IBC(50) > 4.2, the unit of IBC(50): mol/L) by utiliz...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Feng, Zhang, Xinhua, Fang, Zhengjun, Yu, Xinliang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10057455/
https://www.ncbi.nlm.nih.gov/pubmed/36985675
http://dx.doi.org/10.3390/molecules28062703
_version_ 1785016371626639360
author Wu, Feng
Zhang, Xinhua
Fang, Zhengjun
Yu, Xinliang
author_facet Wu, Feng
Zhang, Xinhua
Fang, Zhengjun
Yu, Xinliang
author_sort Wu, Feng
collection PubMed
description Vibrio fischeri is widely used as the model species in toxicity and risk assessment. For the first time, a global classification model was proposed in this paper for a two-class problem (Class − 1 with log1/IBC(50) ≤ 4.2 and Class + 1 with log1/IBC(50) > 4.2, the unit of IBC(50): mol/L) by utilizing a large data set of 601 toxicity log1/IBC(50) of organic compounds to Vibrio fischeri. Dragon software was used to calculate 4885 molecular descriptors for each compound. Stepwise multiple linear regression (MLR) analysis was used to select the descriptor subset for the models. The ten molecular descriptors used in the classification model reflect the structural information on the Michael-type addition of nucleophiles, molecular branching, molecular size, polarizability, hydrophobic, and so on. Furthermore, these descriptors were interpreted from the point of view of toxicity mechanisms. The optimal support vector machine (SVM) model (C = 253.8 and γ = 0.009) was obtained with the genetic algorithm. The SVM classification model produced a prediction accuracy of 89.1% for the training set (451 log1/IBC(50)), of 80.0% for the test set (150 log1/IBC(50)), and of 86.9% for the total data set (601 log1/IBC(50)), which are higher than that (80.5%, 76%, and 79.4%, respectively) from the binary logistic regression (BLR) model. The global SVM classification model is successful, although it deals with a large data set in relation to the toxicity of organics to Vibrio fischeri.
format Online
Article
Text
id pubmed-10057455
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100574552023-03-30 Support Vector Machine-Based Global Classification Model of the Toxicity of Organic Compounds to Vibrio fischeri Wu, Feng Zhang, Xinhua Fang, Zhengjun Yu, Xinliang Molecules Article Vibrio fischeri is widely used as the model species in toxicity and risk assessment. For the first time, a global classification model was proposed in this paper for a two-class problem (Class − 1 with log1/IBC(50) ≤ 4.2 and Class + 1 with log1/IBC(50) > 4.2, the unit of IBC(50): mol/L) by utilizing a large data set of 601 toxicity log1/IBC(50) of organic compounds to Vibrio fischeri. Dragon software was used to calculate 4885 molecular descriptors for each compound. Stepwise multiple linear regression (MLR) analysis was used to select the descriptor subset for the models. The ten molecular descriptors used in the classification model reflect the structural information on the Michael-type addition of nucleophiles, molecular branching, molecular size, polarizability, hydrophobic, and so on. Furthermore, these descriptors were interpreted from the point of view of toxicity mechanisms. The optimal support vector machine (SVM) model (C = 253.8 and γ = 0.009) was obtained with the genetic algorithm. The SVM classification model produced a prediction accuracy of 89.1% for the training set (451 log1/IBC(50)), of 80.0% for the test set (150 log1/IBC(50)), and of 86.9% for the total data set (601 log1/IBC(50)), which are higher than that (80.5%, 76%, and 79.4%, respectively) from the binary logistic regression (BLR) model. The global SVM classification model is successful, although it deals with a large data set in relation to the toxicity of organics to Vibrio fischeri. MDPI 2023-03-16 /pmc/articles/PMC10057455/ /pubmed/36985675 http://dx.doi.org/10.3390/molecules28062703 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wu, Feng
Zhang, Xinhua
Fang, Zhengjun
Yu, Xinliang
Support Vector Machine-Based Global Classification Model of the Toxicity of Organic Compounds to Vibrio fischeri
title Support Vector Machine-Based Global Classification Model of the Toxicity of Organic Compounds to Vibrio fischeri
title_full Support Vector Machine-Based Global Classification Model of the Toxicity of Organic Compounds to Vibrio fischeri
title_fullStr Support Vector Machine-Based Global Classification Model of the Toxicity of Organic Compounds to Vibrio fischeri
title_full_unstemmed Support Vector Machine-Based Global Classification Model of the Toxicity of Organic Compounds to Vibrio fischeri
title_short Support Vector Machine-Based Global Classification Model of the Toxicity of Organic Compounds to Vibrio fischeri
title_sort support vector machine-based global classification model of the toxicity of organic compounds to vibrio fischeri
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10057455/
https://www.ncbi.nlm.nih.gov/pubmed/36985675
http://dx.doi.org/10.3390/molecules28062703
work_keys_str_mv AT wufeng supportvectormachinebasedglobalclassificationmodelofthetoxicityoforganiccompoundstovibriofischeri
AT zhangxinhua supportvectormachinebasedglobalclassificationmodelofthetoxicityoforganiccompoundstovibriofischeri
AT fangzhengjun supportvectormachinebasedglobalclassificationmodelofthetoxicityoforganiccompoundstovibriofischeri
AT yuxinliang supportvectormachinebasedglobalclassificationmodelofthetoxicityoforganiccompoundstovibriofischeri