Cargando…

Machine Learning Prediction of Resistance to Subinhibitory Antimicrobial Concentrations from Escherichia coli Genomes

Escherichia coli is an important cause of bacterial infections worldwide, with multidrug-resistant strains incurring substantial costs on human lives. Besides therapeutic concentrations of antimicrobials in health care settings, the presence of subinhibitory antimicrobial residues in the environment...

Descripción completa

Detalles Bibliográficos
Autores principales: Benkwitz-Bedford, Sam, Palm, Martin, Demirtas, Talip Yasir, Mustonen, Ville, Farewell, Anne, Warringer, Jonas, Parts, Leopold, Moradigaravand, Danesh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8407197/
https://www.ncbi.nlm.nih.gov/pubmed/34427505
http://dx.doi.org/10.1128/mSystems.00346-21
_version_ 1783746602980081664
author Benkwitz-Bedford, Sam
Palm, Martin
Demirtas, Talip Yasir
Mustonen, Ville
Farewell, Anne
Warringer, Jonas
Parts, Leopold
Moradigaravand, Danesh
author_facet Benkwitz-Bedford, Sam
Palm, Martin
Demirtas, Talip Yasir
Mustonen, Ville
Farewell, Anne
Warringer, Jonas
Parts, Leopold
Moradigaravand, Danesh
author_sort Benkwitz-Bedford, Sam
collection PubMed
description Escherichia coli is an important cause of bacterial infections worldwide, with multidrug-resistant strains incurring substantial costs on human lives. Besides therapeutic concentrations of antimicrobials in health care settings, the presence of subinhibitory antimicrobial residues in the environment and in clinics selects for antimicrobial resistance (AMR), but the underlying genetic repertoire is less well understood. Here, we used machine learning to predict the population doubling time and cell growth yield of 1,407 genetically diverse E. coli strains expanding under exposure to three subinhibitory concentrations of six classes of antimicrobials from single-nucleotide genetic variants, accessory gene variation, and the presence of known AMR genes. We predicted cell growth yields in the held-out test data with an average correlation (Spearman’s ρ) of 0.63 (0.36 to 0.81 across concentrations) and cell doubling times with an average correlation of 0.59 (0.32 to 0.92 across concentrations), with moderate increases in sample size unlikely to improve predictions further. This finding points to the remaining missing heritability of growth under antimicrobial exposure being explained by effects that are too rare or weak to be captured unless sample size is dramatically increased, or by effects other than those conferred by the presence of individual single-nucleotide polymorphisms (SNPs) and genes. Predictions based on whole-genome information were generally superior to those based only on known AMR genes and were accurate for AMR resistance at therapeutic concentrations. We pinpointed genes and SNPs determining the predicted growth and thereby recapitulated many known AMR determinants. Finally, we estimated the effect sizes of resistance genes across the entire collection of strains, disclosing the growth effects for known resistance genes in each individual strain. Our results underscore the potential of predictive modeling of growth patterns from genomic data under subinhibitory concentrations of antimicrobials, although the remaining missing heritability poses a challenge for achieving the accuracy and precision required for clinical use. IMPORTANCE Predicting bacterial growth from genome sequences is important for a rapid characterization of strains in clinical diagnostics and to disclose candidate novel targets for anti-infective drugs. Previous studies have dissected the relationship between bacterial growth and genotype in mutant libraries for laboratory strains, yet no study so far has examined the predictive power of genome sequence in natural strains. In this study, we used a high-throughput phenotypic assay to measure the growth of a systematic collection of natural Escherichia coli strains and then employed machine learning models to predict bacterial growth from genomic data under nontherapeutic subinhibitory concentrations of antimicrobials that are common in nonclinical settings. We found a moderate to strong correlation between predicted and actual values for the different collected data sets. Moreover, we observed that the known resistance genes are still effective at sublethal concentrations, pointing to clinical implications of these concentrations.
format Online
Article
Text
id pubmed-8407197
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-84071972021-09-09 Machine Learning Prediction of Resistance to Subinhibitory Antimicrobial Concentrations from Escherichia coli Genomes Benkwitz-Bedford, Sam Palm, Martin Demirtas, Talip Yasir Mustonen, Ville Farewell, Anne Warringer, Jonas Parts, Leopold Moradigaravand, Danesh mSystems Research Article Escherichia coli is an important cause of bacterial infections worldwide, with multidrug-resistant strains incurring substantial costs on human lives. Besides therapeutic concentrations of antimicrobials in health care settings, the presence of subinhibitory antimicrobial residues in the environment and in clinics selects for antimicrobial resistance (AMR), but the underlying genetic repertoire is less well understood. Here, we used machine learning to predict the population doubling time and cell growth yield of 1,407 genetically diverse E. coli strains expanding under exposure to three subinhibitory concentrations of six classes of antimicrobials from single-nucleotide genetic variants, accessory gene variation, and the presence of known AMR genes. We predicted cell growth yields in the held-out test data with an average correlation (Spearman’s ρ) of 0.63 (0.36 to 0.81 across concentrations) and cell doubling times with an average correlation of 0.59 (0.32 to 0.92 across concentrations), with moderate increases in sample size unlikely to improve predictions further. This finding points to the remaining missing heritability of growth under antimicrobial exposure being explained by effects that are too rare or weak to be captured unless sample size is dramatically increased, or by effects other than those conferred by the presence of individual single-nucleotide polymorphisms (SNPs) and genes. Predictions based on whole-genome information were generally superior to those based only on known AMR genes and were accurate for AMR resistance at therapeutic concentrations. We pinpointed genes and SNPs determining the predicted growth and thereby recapitulated many known AMR determinants. Finally, we estimated the effect sizes of resistance genes across the entire collection of strains, disclosing the growth effects for known resistance genes in each individual strain. Our results underscore the potential of predictive modeling of growth patterns from genomic data under subinhibitory concentrations of antimicrobials, although the remaining missing heritability poses a challenge for achieving the accuracy and precision required for clinical use. IMPORTANCE Predicting bacterial growth from genome sequences is important for a rapid characterization of strains in clinical diagnostics and to disclose candidate novel targets for anti-infective drugs. Previous studies have dissected the relationship between bacterial growth and genotype in mutant libraries for laboratory strains, yet no study so far has examined the predictive power of genome sequence in natural strains. In this study, we used a high-throughput phenotypic assay to measure the growth of a systematic collection of natural Escherichia coli strains and then employed machine learning models to predict bacterial growth from genomic data under nontherapeutic subinhibitory concentrations of antimicrobials that are common in nonclinical settings. We found a moderate to strong correlation between predicted and actual values for the different collected data sets. Moreover, we observed that the known resistance genes are still effective at sublethal concentrations, pointing to clinical implications of these concentrations. American Society for Microbiology 2021-08-24 /pmc/articles/PMC8407197/ /pubmed/34427505 http://dx.doi.org/10.1128/mSystems.00346-21 Text en Copyright © 2021 Benkwitz-Bedford et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Benkwitz-Bedford, Sam
Palm, Martin
Demirtas, Talip Yasir
Mustonen, Ville
Farewell, Anne
Warringer, Jonas
Parts, Leopold
Moradigaravand, Danesh
Machine Learning Prediction of Resistance to Subinhibitory Antimicrobial Concentrations from Escherichia coli Genomes
title Machine Learning Prediction of Resistance to Subinhibitory Antimicrobial Concentrations from Escherichia coli Genomes
title_full Machine Learning Prediction of Resistance to Subinhibitory Antimicrobial Concentrations from Escherichia coli Genomes
title_fullStr Machine Learning Prediction of Resistance to Subinhibitory Antimicrobial Concentrations from Escherichia coli Genomes
title_full_unstemmed Machine Learning Prediction of Resistance to Subinhibitory Antimicrobial Concentrations from Escherichia coli Genomes
title_short Machine Learning Prediction of Resistance to Subinhibitory Antimicrobial Concentrations from Escherichia coli Genomes
title_sort machine learning prediction of resistance to subinhibitory antimicrobial concentrations from escherichia coli genomes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8407197/
https://www.ncbi.nlm.nih.gov/pubmed/34427505
http://dx.doi.org/10.1128/mSystems.00346-21
work_keys_str_mv AT benkwitzbedfordsam machinelearningpredictionofresistancetosubinhibitoryantimicrobialconcentrationsfromescherichiacoligenomes
AT palmmartin machinelearningpredictionofresistancetosubinhibitoryantimicrobialconcentrationsfromescherichiacoligenomes
AT demirtastalipyasir machinelearningpredictionofresistancetosubinhibitoryantimicrobialconcentrationsfromescherichiacoligenomes
AT mustonenville machinelearningpredictionofresistancetosubinhibitoryantimicrobialconcentrationsfromescherichiacoligenomes
AT farewellanne machinelearningpredictionofresistancetosubinhibitoryantimicrobialconcentrationsfromescherichiacoligenomes
AT warringerjonas machinelearningpredictionofresistancetosubinhibitoryantimicrobialconcentrationsfromescherichiacoligenomes
AT partsleopold machinelearningpredictionofresistancetosubinhibitoryantimicrobialconcentrationsfromescherichiacoligenomes
AT moradigaravanddanesh machinelearningpredictionofresistancetosubinhibitoryantimicrobialconcentrationsfromescherichiacoligenomes