Cargando…

Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models

The lowest concentration of an antimicrobial agent that can inhibit the visible growth of a microorganism after overnight incubation is called as minimum inhibitory concentration (MIC) and the drug prescriptions are made on the basis of MIC data to ensure successful treatment outcomes. Therefore, re...

Descripción completa

Detalles Bibliográficos
Autores principales: Yasir, Muhammad, Karim, Asad Mustafa, Malik, Sumera Kausar, Bajaffer, Amal A., Azhar, Esam I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9280306/
https://www.ncbi.nlm.nih.gov/pubmed/35844400
http://dx.doi.org/10.1016/j.sjbs.2022.02.047
_version_ 1784746610771623936
author Yasir, Muhammad
Karim, Asad Mustafa
Malik, Sumera Kausar
Bajaffer, Amal A.
Azhar, Esam I.
author_facet Yasir, Muhammad
Karim, Asad Mustafa
Malik, Sumera Kausar
Bajaffer, Amal A.
Azhar, Esam I.
author_sort Yasir, Muhammad
collection PubMed
description The lowest concentration of an antimicrobial agent that can inhibit the visible growth of a microorganism after overnight incubation is called as minimum inhibitory concentration (MIC) and the drug prescriptions are made on the basis of MIC data to ensure successful treatment outcomes. Therefore, reliable antimicrobial susceptibility data is crucial, and it will help clinicians about which drug to prescribe. Although few prediction studies based on strategies have been conducted, however, no single machine learning (ML) modelling has been carried out to predict MICs in N. gonorrhoeae. In this study, we propose a ML based approach that can predict MICs of a specific antibiotic using unitigs sequences data. We retrieved N. gonorrhoeae genomes from European Nucleotide Archive and NCBI and analysed them combined with their respective MIC data for cefixime, ciprofloxacin, and azithromycin and then we constructed unitigs by using de Brujin graphs. We built and compared 35 different ML regression models to predict MICs. Our results demonstrate that RandomForest and CATBoost models showed best performance in predicting MICs of the three antibiotics. The coefficient of determination, R(2), (a statistical measure of how well the regression predictions approximate the real data points) for cefixime, ciprofloxacin, and azithromycin was 0.75787, 0.77241, and 0.79009 respectively using RandomForest. For CATBoost model, the R(2) value was 0.74570, 0.77393, and 0.79317 for cefixime, ciprofloxacin, and azithromycin respectively. Lastly, using feature importance, we explore the important genomic regions identified by the models for predicting MICs. The major mutations which are responsible for resistance against these three antibiotics were chosen by ML models as a top feature in case of each antibiotics. CATBoost, DecisionTree, GradientBoosting, and RandomForest regression models chose the same unitigs which are responsible for resistance. This unitigs-based strategy for developing models for MIC prediction, clinical diagnostics, and surveillance can be applicable for other critical bacterial pathogens.
format Online
Article
Text
id pubmed-9280306
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-92803062022-07-15 Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models Yasir, Muhammad Karim, Asad Mustafa Malik, Sumera Kausar Bajaffer, Amal A. Azhar, Esam I. Saudi J Biol Sci Original Article The lowest concentration of an antimicrobial agent that can inhibit the visible growth of a microorganism after overnight incubation is called as minimum inhibitory concentration (MIC) and the drug prescriptions are made on the basis of MIC data to ensure successful treatment outcomes. Therefore, reliable antimicrobial susceptibility data is crucial, and it will help clinicians about which drug to prescribe. Although few prediction studies based on strategies have been conducted, however, no single machine learning (ML) modelling has been carried out to predict MICs in N. gonorrhoeae. In this study, we propose a ML based approach that can predict MICs of a specific antibiotic using unitigs sequences data. We retrieved N. gonorrhoeae genomes from European Nucleotide Archive and NCBI and analysed them combined with their respective MIC data for cefixime, ciprofloxacin, and azithromycin and then we constructed unitigs by using de Brujin graphs. We built and compared 35 different ML regression models to predict MICs. Our results demonstrate that RandomForest and CATBoost models showed best performance in predicting MICs of the three antibiotics. The coefficient of determination, R(2), (a statistical measure of how well the regression predictions approximate the real data points) for cefixime, ciprofloxacin, and azithromycin was 0.75787, 0.77241, and 0.79009 respectively using RandomForest. For CATBoost model, the R(2) value was 0.74570, 0.77393, and 0.79317 for cefixime, ciprofloxacin, and azithromycin respectively. Lastly, using feature importance, we explore the important genomic regions identified by the models for predicting MICs. The major mutations which are responsible for resistance against these three antibiotics were chosen by ML models as a top feature in case of each antibiotics. CATBoost, DecisionTree, GradientBoosting, and RandomForest regression models chose the same unitigs which are responsible for resistance. This unitigs-based strategy for developing models for MIC prediction, clinical diagnostics, and surveillance can be applicable for other critical bacterial pathogens. Elsevier 2022-05 2022-03-04 /pmc/articles/PMC9280306/ /pubmed/35844400 http://dx.doi.org/10.1016/j.sjbs.2022.02.047 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Article
Yasir, Muhammad
Karim, Asad Mustafa
Malik, Sumera Kausar
Bajaffer, Amal A.
Azhar, Esam I.
Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models
title Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models
title_full Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models
title_fullStr Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models
title_full_unstemmed Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models
title_short Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models
title_sort prediction of antimicrobial minimal inhibitory concentrations for neisseria gonorrhoeae using machine learning models
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9280306/
https://www.ncbi.nlm.nih.gov/pubmed/35844400
http://dx.doi.org/10.1016/j.sjbs.2022.02.047
work_keys_str_mv AT yasirmuhammad predictionofantimicrobialminimalinhibitoryconcentrationsforneisseriagonorrhoeaeusingmachinelearningmodels
AT karimasadmustafa predictionofantimicrobialminimalinhibitoryconcentrationsforneisseriagonorrhoeaeusingmachinelearningmodels
AT maliksumerakausar predictionofantimicrobialminimalinhibitoryconcentrationsforneisseriagonorrhoeaeusingmachinelearningmodels
AT bajafferamala predictionofantimicrobialminimalinhibitoryconcentrationsforneisseriagonorrhoeaeusingmachinelearningmodels
AT azharesami predictionofantimicrobialminimalinhibitoryconcentrationsforneisseriagonorrhoeaeusingmachinelearningmodels