Cargando…
Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models
The lowest concentration of an antimicrobial agent that can inhibit the visible growth of a microorganism after overnight incubation is called as minimum inhibitory concentration (MIC) and the drug prescriptions are made on the basis of MIC data to ensure successful treatment outcomes. Therefore, re...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9280306/ https://www.ncbi.nlm.nih.gov/pubmed/35844400 http://dx.doi.org/10.1016/j.sjbs.2022.02.047 |
_version_ | 1784746610771623936 |
---|---|
author | Yasir, Muhammad Karim, Asad Mustafa Malik, Sumera Kausar Bajaffer, Amal A. Azhar, Esam I. |
author_facet | Yasir, Muhammad Karim, Asad Mustafa Malik, Sumera Kausar Bajaffer, Amal A. Azhar, Esam I. |
author_sort | Yasir, Muhammad |
collection | PubMed |
description | The lowest concentration of an antimicrobial agent that can inhibit the visible growth of a microorganism after overnight incubation is called as minimum inhibitory concentration (MIC) and the drug prescriptions are made on the basis of MIC data to ensure successful treatment outcomes. Therefore, reliable antimicrobial susceptibility data is crucial, and it will help clinicians about which drug to prescribe. Although few prediction studies based on strategies have been conducted, however, no single machine learning (ML) modelling has been carried out to predict MICs in N. gonorrhoeae. In this study, we propose a ML based approach that can predict MICs of a specific antibiotic using unitigs sequences data. We retrieved N. gonorrhoeae genomes from European Nucleotide Archive and NCBI and analysed them combined with their respective MIC data for cefixime, ciprofloxacin, and azithromycin and then we constructed unitigs by using de Brujin graphs. We built and compared 35 different ML regression models to predict MICs. Our results demonstrate that RandomForest and CATBoost models showed best performance in predicting MICs of the three antibiotics. The coefficient of determination, R(2), (a statistical measure of how well the regression predictions approximate the real data points) for cefixime, ciprofloxacin, and azithromycin was 0.75787, 0.77241, and 0.79009 respectively using RandomForest. For CATBoost model, the R(2) value was 0.74570, 0.77393, and 0.79317 for cefixime, ciprofloxacin, and azithromycin respectively. Lastly, using feature importance, we explore the important genomic regions identified by the models for predicting MICs. The major mutations which are responsible for resistance against these three antibiotics were chosen by ML models as a top feature in case of each antibiotics. CATBoost, DecisionTree, GradientBoosting, and RandomForest regression models chose the same unitigs which are responsible for resistance. This unitigs-based strategy for developing models for MIC prediction, clinical diagnostics, and surveillance can be applicable for other critical bacterial pathogens. |
format | Online Article Text |
id | pubmed-9280306 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-92803062022-07-15 Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models Yasir, Muhammad Karim, Asad Mustafa Malik, Sumera Kausar Bajaffer, Amal A. Azhar, Esam I. Saudi J Biol Sci Original Article The lowest concentration of an antimicrobial agent that can inhibit the visible growth of a microorganism after overnight incubation is called as minimum inhibitory concentration (MIC) and the drug prescriptions are made on the basis of MIC data to ensure successful treatment outcomes. Therefore, reliable antimicrobial susceptibility data is crucial, and it will help clinicians about which drug to prescribe. Although few prediction studies based on strategies have been conducted, however, no single machine learning (ML) modelling has been carried out to predict MICs in N. gonorrhoeae. In this study, we propose a ML based approach that can predict MICs of a specific antibiotic using unitigs sequences data. We retrieved N. gonorrhoeae genomes from European Nucleotide Archive and NCBI and analysed them combined with their respective MIC data for cefixime, ciprofloxacin, and azithromycin and then we constructed unitigs by using de Brujin graphs. We built and compared 35 different ML regression models to predict MICs. Our results demonstrate that RandomForest and CATBoost models showed best performance in predicting MICs of the three antibiotics. The coefficient of determination, R(2), (a statistical measure of how well the regression predictions approximate the real data points) for cefixime, ciprofloxacin, and azithromycin was 0.75787, 0.77241, and 0.79009 respectively using RandomForest. For CATBoost model, the R(2) value was 0.74570, 0.77393, and 0.79317 for cefixime, ciprofloxacin, and azithromycin respectively. Lastly, using feature importance, we explore the important genomic regions identified by the models for predicting MICs. The major mutations which are responsible for resistance against these three antibiotics were chosen by ML models as a top feature in case of each antibiotics. CATBoost, DecisionTree, GradientBoosting, and RandomForest regression models chose the same unitigs which are responsible for resistance. This unitigs-based strategy for developing models for MIC prediction, clinical diagnostics, and surveillance can be applicable for other critical bacterial pathogens. Elsevier 2022-05 2022-03-04 /pmc/articles/PMC9280306/ /pubmed/35844400 http://dx.doi.org/10.1016/j.sjbs.2022.02.047 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Original Article Yasir, Muhammad Karim, Asad Mustafa Malik, Sumera Kausar Bajaffer, Amal A. Azhar, Esam I. Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models |
title | Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models |
title_full | Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models |
title_fullStr | Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models |
title_full_unstemmed | Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models |
title_short | Prediction of antimicrobial minimal inhibitory concentrations for Neisseria gonorrhoeae using machine learning models |
title_sort | prediction of antimicrobial minimal inhibitory concentrations for neisseria gonorrhoeae using machine learning models |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9280306/ https://www.ncbi.nlm.nih.gov/pubmed/35844400 http://dx.doi.org/10.1016/j.sjbs.2022.02.047 |
work_keys_str_mv | AT yasirmuhammad predictionofantimicrobialminimalinhibitoryconcentrationsforneisseriagonorrhoeaeusingmachinelearningmodels AT karimasadmustafa predictionofantimicrobialminimalinhibitoryconcentrationsforneisseriagonorrhoeaeusingmachinelearningmodels AT maliksumerakausar predictionofantimicrobialminimalinhibitoryconcentrationsforneisseriagonorrhoeaeusingmachinelearningmodels AT bajafferamala predictionofantimicrobialminimalinhibitoryconcentrationsforneisseriagonorrhoeaeusingmachinelearningmodels AT azharesami predictionofantimicrobialminimalinhibitoryconcentrationsforneisseriagonorrhoeaeusingmachinelearningmodels |