Cargando…
Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds
[Image: see text] Antibacterial drugs (AD) change the metabolic status of bacteria, contributing to bacterial death. However, antibiotic resistance and the emergence of multidrug-resistant bacteria increase interest in understanding metabolic network (MN) mutations and the interaction of AD vs MN. I...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2022
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9986951/ https://www.ncbi.nlm.nih.gov/pubmed/35671399 http://dx.doi.org/10.1021/acs.molpharmaceut.2c00029 |
_version_ | 1784901275155955712 |
---|---|
author | Diéguez-Santana, Karel Casañola-Martin, Gerardo M. Torres, Roldan Rasulev, Bakhtiyor Green, James R. González-Díaz, Humbert |
author_facet | Diéguez-Santana, Karel Casañola-Martin, Gerardo M. Torres, Roldan Rasulev, Bakhtiyor Green, James R. González-Díaz, Humbert |
author_sort | Diéguez-Santana, Karel |
collection | PubMed |
description | [Image: see text] Antibacterial drugs (AD) change the metabolic status of bacteria, contributing to bacterial death. However, antibiotic resistance and the emergence of multidrug-resistant bacteria increase interest in understanding metabolic network (MN) mutations and the interaction of AD vs MN. In this study, we employed the IFPTML = Information Fusion (IF) + Perturbation Theory (PT) + Machine Learning (ML) algorithm on a huge dataset from the ChEMBL database, which contains >155,000 AD assays vs >40 MNs of multiple bacteria species. We built a linear discriminant analysis (LDA) and 17 ML models centered on the linear index and based on atoms to predict antibacterial compounds. The IFPTML-LDA model presented the following results for the training subset: specificity (Sp) = 76% out of 70,000 cases, sensitivity (Sn) = 70%, and Accuracy (Acc) = 73%. The same model also presented the following results for the validation subsets: Sp = 76%, Sn = 70%, and Acc = 73.1%. Among the IFPTML nonlinear models, the k nearest neighbors (KNN) showed the best results with Sn = 99.2%, Sp = 95.5%, Acc = 97.4%, and Area Under Receiver Operating Characteristic (AUROC) = 0.998 in training sets. In the validation series, the Random Forest had the best results: Sn = 93.96% and Sp = 87.02% (AUROC = 0.945). The IFPTML linear and nonlinear models regarding the ADs vs MNs have good statistical parameters, and they could contribute toward finding new metabolic mutations in antibiotic resistance and reducing time/costs in antibacterial drug research. |
format | Online Article Text |
id | pubmed-9986951 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-99869512023-03-07 Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds Diéguez-Santana, Karel Casañola-Martin, Gerardo M. Torres, Roldan Rasulev, Bakhtiyor Green, James R. González-Díaz, Humbert Mol Pharm [Image: see text] Antibacterial drugs (AD) change the metabolic status of bacteria, contributing to bacterial death. However, antibiotic resistance and the emergence of multidrug-resistant bacteria increase interest in understanding metabolic network (MN) mutations and the interaction of AD vs MN. In this study, we employed the IFPTML = Information Fusion (IF) + Perturbation Theory (PT) + Machine Learning (ML) algorithm on a huge dataset from the ChEMBL database, which contains >155,000 AD assays vs >40 MNs of multiple bacteria species. We built a linear discriminant analysis (LDA) and 17 ML models centered on the linear index and based on atoms to predict antibacterial compounds. The IFPTML-LDA model presented the following results for the training subset: specificity (Sp) = 76% out of 70,000 cases, sensitivity (Sn) = 70%, and Accuracy (Acc) = 73%. The same model also presented the following results for the validation subsets: Sp = 76%, Sn = 70%, and Acc = 73.1%. Among the IFPTML nonlinear models, the k nearest neighbors (KNN) showed the best results with Sn = 99.2%, Sp = 95.5%, Acc = 97.4%, and Area Under Receiver Operating Characteristic (AUROC) = 0.998 in training sets. In the validation series, the Random Forest had the best results: Sn = 93.96% and Sp = 87.02% (AUROC = 0.945). The IFPTML linear and nonlinear models regarding the ADs vs MNs have good statistical parameters, and they could contribute toward finding new metabolic mutations in antibiotic resistance and reducing time/costs in antibacterial drug research. American Chemical Society 2022-06-07 /pmc/articles/PMC9986951/ /pubmed/35671399 http://dx.doi.org/10.1021/acs.molpharmaceut.2c00029 Text en © 2022 American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Diéguez-Santana, Karel Casañola-Martin, Gerardo M. Torres, Roldan Rasulev, Bakhtiyor Green, James R. González-Díaz, Humbert Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds |
title | Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds |
title_full | Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds |
title_fullStr | Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds |
title_full_unstemmed | Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds |
title_short | Machine Learning Study of Metabolic Networks vs ChEMBL Data of Antibacterial Compounds |
title_sort | machine learning study of metabolic networks vs chembl data of antibacterial compounds |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9986951/ https://www.ncbi.nlm.nih.gov/pubmed/35671399 http://dx.doi.org/10.1021/acs.molpharmaceut.2c00029 |
work_keys_str_mv | AT dieguezsantanakarel machinelearningstudyofmetabolicnetworksvschembldataofantibacterialcompounds AT casanolamartingerardom machinelearningstudyofmetabolicnetworksvschembldataofantibacterialcompounds AT torresroldan machinelearningstudyofmetabolicnetworksvschembldataofantibacterialcompounds AT rasulevbakhtiyor machinelearningstudyofmetabolicnetworksvschembldataofantibacterialcompounds AT greenjamesr machinelearningstudyofmetabolicnetworksvschembldataofantibacterialcompounds AT gonzalezdiazhumbert machinelearningstudyofmetabolicnetworksvschembldataofantibacterialcompounds |