Cargando…

Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds

[Image: see text] Antibacterial drugs (AD) change the metabolic status of bacteria, contributing to bacterial death. However, antibiotic resistance and the emergence of multidrug-resistant bacteria increase interest in understanding metabolic network (MN) mutations and the interaction of AD vs MN. I...

Descripción completa

Detalles Bibliográficos
Autores principales: Diéguez-Santana, Karel, Casañola-Martin, Gerardo M., Torres, Roldan, Rasulev, Bakhtiyor, Green, James R., González-Díaz, Humbert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9986951/
https://www.ncbi.nlm.nih.gov/pubmed/35671399
http://dx.doi.org/10.1021/acs.molpharmaceut.2c00029
_version_ 1784901275155955712
author Diéguez-Santana, Karel
Casañola-Martin, Gerardo M.
Torres, Roldan
Rasulev, Bakhtiyor
Green, James R.
González-Díaz, Humbert
author_facet Diéguez-Santana, Karel
Casañola-Martin, Gerardo M.
Torres, Roldan
Rasulev, Bakhtiyor
Green, James R.
González-Díaz, Humbert
author_sort Diéguez-Santana, Karel
collection PubMed
description [Image: see text] Antibacterial drugs (AD) change the metabolic status of bacteria, contributing to bacterial death. However, antibiotic resistance and the emergence of multidrug-resistant bacteria increase interest in understanding metabolic network (MN) mutations and the interaction of AD vs MN. In this study, we employed the IFPTML = Information Fusion (IF) + Perturbation Theory (PT) + Machine Learning (ML) algorithm on a huge dataset from the ChEMBL database, which contains >155,000 AD assays vs >40 MNs of multiple bacteria species. We built a linear discriminant analysis (LDA) and 17 ML models centered on the linear index and based on atoms to predict antibacterial compounds. The IFPTML-LDA model presented the following results for the training subset: specificity (Sp) = 76% out of 70,000 cases, sensitivity (Sn) = 70%, and Accuracy (Acc) = 73%. The same model also presented the following results for the validation subsets: Sp = 76%, Sn = 70%, and Acc = 73.1%. Among the IFPTML nonlinear models, the k nearest neighbors (KNN) showed the best results with Sn = 99.2%, Sp = 95.5%, Acc = 97.4%, and Area Under Receiver Operating Characteristic (AUROC) = 0.998 in training sets. In the validation series, the Random Forest had the best results: Sn = 93.96% and Sp = 87.02% (AUROC = 0.945). The IFPTML linear and nonlinear models regarding the ADs vs MNs have good statistical parameters, and they could contribute toward finding new metabolic mutations in antibiotic resistance and reducing time/costs in antibacterial drug research.
format Online
Article
Text
id pubmed-9986951
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-99869512023-03-07 Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds Diéguez-Santana, Karel Casañola-Martin, Gerardo M. Torres, Roldan Rasulev, Bakhtiyor Green, James R. González-Díaz, Humbert Mol Pharm [Image: see text] Antibacterial drugs (AD) change the metabolic status of bacteria, contributing to bacterial death. However, antibiotic resistance and the emergence of multidrug-resistant bacteria increase interest in understanding metabolic network (MN) mutations and the interaction of AD vs MN. In this study, we employed the IFPTML = Information Fusion (IF) + Perturbation Theory (PT) + Machine Learning (ML) algorithm on a huge dataset from the ChEMBL database, which contains >155,000 AD assays vs >40 MNs of multiple bacteria species. We built a linear discriminant analysis (LDA) and 17 ML models centered on the linear index and based on atoms to predict antibacterial compounds. The IFPTML-LDA model presented the following results for the training subset: specificity (Sp) = 76% out of 70,000 cases, sensitivity (Sn) = 70%, and Accuracy (Acc) = 73%. The same model also presented the following results for the validation subsets: Sp = 76%, Sn = 70%, and Acc = 73.1%. Among the IFPTML nonlinear models, the k nearest neighbors (KNN) showed the best results with Sn = 99.2%, Sp = 95.5%, Acc = 97.4%, and Area Under Receiver Operating Characteristic (AUROC) = 0.998 in training sets. In the validation series, the Random Forest had the best results: Sn = 93.96% and Sp = 87.02% (AUROC = 0.945). The IFPTML linear and nonlinear models regarding the ADs vs MNs have good statistical parameters, and they could contribute toward finding new metabolic mutations in antibiotic resistance and reducing time/costs in antibacterial drug research. American Chemical Society 2022-06-07 /pmc/articles/PMC9986951/ /pubmed/35671399 http://dx.doi.org/10.1021/acs.molpharmaceut.2c00029 Text en © 2022 American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Diéguez-Santana, Karel
Casañola-Martin, Gerardo M.
Torres, Roldan
Rasulev, Bakhtiyor
Green, James R.
González-Díaz, Humbert
Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds
title Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds
title_full Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds
title_fullStr Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds
title_full_unstemmed Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds
title_short Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds
title_sort machine learning study of metabolic networks vs chembl data of antibacterial compounds
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9986951/
https://www.ncbi.nlm.nih.gov/pubmed/35671399
http://dx.doi.org/10.1021/acs.molpharmaceut.2c00029
work_keys_str_mv AT dieguezsantanakarel machinelearningstudyofmetabolicnetworksvschembldataofantibacterialcompounds
AT casanolamartingerardom machinelearningstudyofmetabolicnetworksvschembldataofantibacterialcompounds
AT torresroldan machinelearningstudyofmetabolicnetworksvschembldataofantibacterialcompounds
AT rasulevbakhtiyor machinelearningstudyofmetabolicnetworksvschembldataofantibacterialcompounds
AT greenjamesr machinelearningstudyofmetabolicnetworksvschembldataofantibacterialcompounds
AT gonzalezdiazhumbert machinelearningstudyofmetabolicnetworksvschembldataofantibacterialcompounds