Cargando…

Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data

Polymyxins are used as treatments of last resort for Gram-negative bacterial infections. Their increased use has led to concerns about emerging polymyxin resistance (PR). Phenotypic polymyxin susceptibility testing is resource intensive and difficult to perform accurately. The complex polygenic natu...

Descripción completa

Detalles Bibliográficos
Autores principales: Macesic, Nenad, Bear Don’t Walk, Oliver J., Pe’er, Itsik, Tatonetti, Nicholas P., Peleg, Anton Y., Uhlemann, Anne-Catrin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7253370/
https://www.ncbi.nlm.nih.gov/pubmed/32457240
http://dx.doi.org/10.1128/mSystems.00656-19
_version_ 1783539322603962368
author Macesic, Nenad
Bear Don’t Walk, Oliver J.
Pe’er, Itsik
Tatonetti, Nicholas P.
Peleg, Anton Y.
Uhlemann, Anne-Catrin
author_facet Macesic, Nenad
Bear Don’t Walk, Oliver J.
Pe’er, Itsik
Tatonetti, Nicholas P.
Peleg, Anton Y.
Uhlemann, Anne-Catrin
author_sort Macesic, Nenad
collection PubMed
description Polymyxins are used as treatments of last resort for Gram-negative bacterial infections. Their increased use has led to concerns about emerging polymyxin resistance (PR). Phenotypic polymyxin susceptibility testing is resource intensive and difficult to perform accurately. The complex polygenic nature of PR and our incomplete understanding of its genetic basis make it difficult to predict PR using detection of resistance determinants. We therefore applied machine learning (ML) to whole-genome sequencing data from >600 Klebsiella pneumoniae clonal group 258 (CG258) genomes to predict phenotypic PR. Using a reference-based representation of genomic data with ML outperformed a rule-based approach that detected variants in known PR genes (area under receiver-operator curve [AUROC], 0.894 versus 0.791, P = 0.006). We noted modest increases in performance by using a bacterial genome-wide association study to filter relevant genomic features and by integrating clinical data in the form of prior polymyxin exposure. Conversely, reference-free representation of genomic data as k-mers was associated with decreased performance (AUROC, 0.692 versus 0.894, P = 0.015). When ML models were interpreted to extract genomic features, six of seven known PR genes were correctly identified by models without prior programming and several genes involved in stress responses and maintenance of the cell membrane were identified as potential novel determinants of PR. These findings are a proof of concept that whole-genome sequencing data can accurately predict PR in K. pneumoniae CG258 and may be applicable to other forms of complex antimicrobial resistance. IMPORTANCE Polymyxins are last-resort antibiotics used to treat highly resistant Gram-negative bacteria. There are increasing reports of polymyxin resistance emerging, raising concerns of a postantibiotic era. Polymyxin resistance is therefore a significant public health threat, but current phenotypic methods for detection are difficult and time-consuming to perform. There have been increasing efforts to use whole-genome sequencing for detection of antibiotic resistance, but this has been difficult to apply to polymyxin resistance because of its complex polygenic nature. The significance of our research is that we successfully applied machine learning methods to predict polymyxin resistance in Klebsiella pneumoniae clonal group 258, a common health care-associated and multidrug-resistant pathogen. Our findings highlight that machine learning can be successfully applied even in complex forms of antibiotic resistance and represent a significant contribution to the literature that could be used to predict resistance in other bacteria and to other antibiotics.
format Online
Article
Text
id pubmed-7253370
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-72533702020-06-08 Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data Macesic, Nenad Bear Don’t Walk, Oliver J. Pe’er, Itsik Tatonetti, Nicholas P. Peleg, Anton Y. Uhlemann, Anne-Catrin mSystems Research Article Polymyxins are used as treatments of last resort for Gram-negative bacterial infections. Their increased use has led to concerns about emerging polymyxin resistance (PR). Phenotypic polymyxin susceptibility testing is resource intensive and difficult to perform accurately. The complex polygenic nature of PR and our incomplete understanding of its genetic basis make it difficult to predict PR using detection of resistance determinants. We therefore applied machine learning (ML) to whole-genome sequencing data from >600 Klebsiella pneumoniae clonal group 258 (CG258) genomes to predict phenotypic PR. Using a reference-based representation of genomic data with ML outperformed a rule-based approach that detected variants in known PR genes (area under receiver-operator curve [AUROC], 0.894 versus 0.791, P = 0.006). We noted modest increases in performance by using a bacterial genome-wide association study to filter relevant genomic features and by integrating clinical data in the form of prior polymyxin exposure. Conversely, reference-free representation of genomic data as k-mers was associated with decreased performance (AUROC, 0.692 versus 0.894, P = 0.015). When ML models were interpreted to extract genomic features, six of seven known PR genes were correctly identified by models without prior programming and several genes involved in stress responses and maintenance of the cell membrane were identified as potential novel determinants of PR. These findings are a proof of concept that whole-genome sequencing data can accurately predict PR in K. pneumoniae CG258 and may be applicable to other forms of complex antimicrobial resistance. IMPORTANCE Polymyxins are last-resort antibiotics used to treat highly resistant Gram-negative bacteria. There are increasing reports of polymyxin resistance emerging, raising concerns of a postantibiotic era. Polymyxin resistance is therefore a significant public health threat, but current phenotypic methods for detection are difficult and time-consuming to perform. There have been increasing efforts to use whole-genome sequencing for detection of antibiotic resistance, but this has been difficult to apply to polymyxin resistance because of its complex polygenic nature. The significance of our research is that we successfully applied machine learning methods to predict polymyxin resistance in Klebsiella pneumoniae clonal group 258, a common health care-associated and multidrug-resistant pathogen. Our findings highlight that machine learning can be successfully applied even in complex forms of antibiotic resistance and represent a significant contribution to the literature that could be used to predict resistance in other bacteria and to other antibiotics. American Society for Microbiology 2020-05-26 /pmc/articles/PMC7253370/ /pubmed/32457240 http://dx.doi.org/10.1128/mSystems.00656-19 Text en Copyright © 2020 Macesic et al. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Macesic, Nenad
Bear Don’t Walk, Oliver J.
Pe’er, Itsik
Tatonetti, Nicholas P.
Peleg, Anton Y.
Uhlemann, Anne-Catrin
Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data
title Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data
title_full Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data
title_fullStr Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data
title_full_unstemmed Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data
title_short Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data
title_sort predicting phenotypic polymyxin resistance in klebsiella pneumoniae through machine learning analysis of genomic data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7253370/
https://www.ncbi.nlm.nih.gov/pubmed/32457240
http://dx.doi.org/10.1128/mSystems.00656-19
work_keys_str_mv AT macesicnenad predictingphenotypicpolymyxinresistanceinklebsiellapneumoniaethroughmachinelearninganalysisofgenomicdata
AT beardontwalkoliverj predictingphenotypicpolymyxinresistanceinklebsiellapneumoniaethroughmachinelearninganalysisofgenomicdata
AT peeritsik predictingphenotypicpolymyxinresistanceinklebsiellapneumoniaethroughmachinelearninganalysisofgenomicdata
AT tatonettinicholasp predictingphenotypicpolymyxinresistanceinklebsiellapneumoniaethroughmachinelearninganalysisofgenomicdata
AT pelegantony predictingphenotypicpolymyxinresistanceinklebsiellapneumoniaethroughmachinelearninganalysisofgenomicdata
AT uhlemannannecatrin predictingphenotypicpolymyxinresistanceinklebsiellapneumoniaethroughmachinelearninganalysisofgenomicdata