Cargando…
Prediction of Drug-Induced Liver Toxicity Using SVM and Optimal Descriptor Sets
Drug-induced liver toxicity is one of the significant safety challenges for the patient’s health and the pharmaceutical industry. It causes termination of drug candidates in clinical trials and also the retractions of approved drugs from the market. Thus, it is essential to identify hepatotoxic comp...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8348336/ https://www.ncbi.nlm.nih.gov/pubmed/34360838 http://dx.doi.org/10.3390/ijms22158073 |
_version_ | 1783735314518376448 |
---|---|
author | Jaganathan, Keerthana Tayara, Hilal Chong, Kil To |
author_facet | Jaganathan, Keerthana Tayara, Hilal Chong, Kil To |
author_sort | Jaganathan, Keerthana |
collection | PubMed |
description | Drug-induced liver toxicity is one of the significant safety challenges for the patient’s health and the pharmaceutical industry. It causes termination of drug candidates in clinical trials and also the retractions of approved drugs from the market. Thus, it is essential to identify hepatotoxic compounds in the initial stages of drug development process. The purpose of this study is to construct quantitative structure activity relationship models using machine learning algorithms and systematical feature selection methods for molecular descriptor sets. The models were built from a large and diverse set of 1253 drug compounds and were validated internally with 10-fold cross-validation. In this study, we applied a variety of feature selection techniques to extract the optimal subset of descriptors as modeling features to improve the prediction performance. Experimental results suggested that the support vector machine-based classifier had achieved a better classification accuracy with reduced molecular descriptors. The final optimal model provides an accuracy of 0.811, a sensitivity of 0.840, a specificity of 0.783 and Mathew’s correlation coefficient of 0.623 with an internal validation set. Furthermore, this model outperformed the prior studies while evaluated in both the internal and external test sets. The utilization of distinct optimal molecular descriptors as modeling features produce an in silico model with a superior performance. |
format | Online Article Text |
id | pubmed-8348336 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-83483362021-08-08 Prediction of Drug-Induced Liver Toxicity Using SVM and Optimal Descriptor Sets Jaganathan, Keerthana Tayara, Hilal Chong, Kil To Int J Mol Sci Article Drug-induced liver toxicity is one of the significant safety challenges for the patient’s health and the pharmaceutical industry. It causes termination of drug candidates in clinical trials and also the retractions of approved drugs from the market. Thus, it is essential to identify hepatotoxic compounds in the initial stages of drug development process. The purpose of this study is to construct quantitative structure activity relationship models using machine learning algorithms and systematical feature selection methods for molecular descriptor sets. The models were built from a large and diverse set of 1253 drug compounds and were validated internally with 10-fold cross-validation. In this study, we applied a variety of feature selection techniques to extract the optimal subset of descriptors as modeling features to improve the prediction performance. Experimental results suggested that the support vector machine-based classifier had achieved a better classification accuracy with reduced molecular descriptors. The final optimal model provides an accuracy of 0.811, a sensitivity of 0.840, a specificity of 0.783 and Mathew’s correlation coefficient of 0.623 with an internal validation set. Furthermore, this model outperformed the prior studies while evaluated in both the internal and external test sets. The utilization of distinct optimal molecular descriptors as modeling features produce an in silico model with a superior performance. MDPI 2021-07-28 /pmc/articles/PMC8348336/ /pubmed/34360838 http://dx.doi.org/10.3390/ijms22158073 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Jaganathan, Keerthana Tayara, Hilal Chong, Kil To Prediction of Drug-Induced Liver Toxicity Using SVM and Optimal Descriptor Sets |
title | Prediction of Drug-Induced Liver Toxicity Using SVM and Optimal Descriptor Sets |
title_full | Prediction of Drug-Induced Liver Toxicity Using SVM and Optimal Descriptor Sets |
title_fullStr | Prediction of Drug-Induced Liver Toxicity Using SVM and Optimal Descriptor Sets |
title_full_unstemmed | Prediction of Drug-Induced Liver Toxicity Using SVM and Optimal Descriptor Sets |
title_short | Prediction of Drug-Induced Liver Toxicity Using SVM and Optimal Descriptor Sets |
title_sort | prediction of drug-induced liver toxicity using svm and optimal descriptor sets |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8348336/ https://www.ncbi.nlm.nih.gov/pubmed/34360838 http://dx.doi.org/10.3390/ijms22158073 |
work_keys_str_mv | AT jaganathankeerthana predictionofdruginducedlivertoxicityusingsvmandoptimaldescriptorsets AT tayarahilal predictionofdruginducedlivertoxicityusingsvmandoptimaldescriptorsets AT chongkilto predictionofdruginducedlivertoxicityusingsvmandoptimaldescriptorsets |