Cargando…
Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data
Polymyxins are used as treatments of last resort for Gram-negative bacterial infections. Their increased use has led to concerns about emerging polymyxin resistance (PR). Phenotypic polymyxin susceptibility testing is resource intensive and difficult to perform accurately. The complex polygenic natu...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society for Microbiology
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7253370/ https://www.ncbi.nlm.nih.gov/pubmed/32457240 http://dx.doi.org/10.1128/mSystems.00656-19 |
_version_ | 1783539322603962368 |
---|---|
author | Macesic, Nenad Bear Don’t Walk, Oliver J. Pe’er, Itsik Tatonetti, Nicholas P. Peleg, Anton Y. Uhlemann, Anne-Catrin |
author_facet | Macesic, Nenad Bear Don’t Walk, Oliver J. Pe’er, Itsik Tatonetti, Nicholas P. Peleg, Anton Y. Uhlemann, Anne-Catrin |
author_sort | Macesic, Nenad |
collection | PubMed |
description | Polymyxins are used as treatments of last resort for Gram-negative bacterial infections. Their increased use has led to concerns about emerging polymyxin resistance (PR). Phenotypic polymyxin susceptibility testing is resource intensive and difficult to perform accurately. The complex polygenic nature of PR and our incomplete understanding of its genetic basis make it difficult to predict PR using detection of resistance determinants. We therefore applied machine learning (ML) to whole-genome sequencing data from >600 Klebsiella pneumoniae clonal group 258 (CG258) genomes to predict phenotypic PR. Using a reference-based representation of genomic data with ML outperformed a rule-based approach that detected variants in known PR genes (area under receiver-operator curve [AUROC], 0.894 versus 0.791, P = 0.006). We noted modest increases in performance by using a bacterial genome-wide association study to filter relevant genomic features and by integrating clinical data in the form of prior polymyxin exposure. Conversely, reference-free representation of genomic data as k-mers was associated with decreased performance (AUROC, 0.692 versus 0.894, P = 0.015). When ML models were interpreted to extract genomic features, six of seven known PR genes were correctly identified by models without prior programming and several genes involved in stress responses and maintenance of the cell membrane were identified as potential novel determinants of PR. These findings are a proof of concept that whole-genome sequencing data can accurately predict PR in K. pneumoniae CG258 and may be applicable to other forms of complex antimicrobial resistance. IMPORTANCE Polymyxins are last-resort antibiotics used to treat highly resistant Gram-negative bacteria. There are increasing reports of polymyxin resistance emerging, raising concerns of a postantibiotic era. Polymyxin resistance is therefore a significant public health threat, but current phenotypic methods for detection are difficult and time-consuming to perform. There have been increasing efforts to use whole-genome sequencing for detection of antibiotic resistance, but this has been difficult to apply to polymyxin resistance because of its complex polygenic nature. The significance of our research is that we successfully applied machine learning methods to predict polymyxin resistance in Klebsiella pneumoniae clonal group 258, a common health care-associated and multidrug-resistant pathogen. Our findings highlight that machine learning can be successfully applied even in complex forms of antibiotic resistance and represent a significant contribution to the literature that could be used to predict resistance in other bacteria and to other antibiotics. |
format | Online Article Text |
id | pubmed-7253370 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | American Society for Microbiology |
record_format | MEDLINE/PubMed |
spelling | pubmed-72533702020-06-08 Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data Macesic, Nenad Bear Don’t Walk, Oliver J. Pe’er, Itsik Tatonetti, Nicholas P. Peleg, Anton Y. Uhlemann, Anne-Catrin mSystems Research Article Polymyxins are used as treatments of last resort for Gram-negative bacterial infections. Their increased use has led to concerns about emerging polymyxin resistance (PR). Phenotypic polymyxin susceptibility testing is resource intensive and difficult to perform accurately. The complex polygenic nature of PR and our incomplete understanding of its genetic basis make it difficult to predict PR using detection of resistance determinants. We therefore applied machine learning (ML) to whole-genome sequencing data from >600 Klebsiella pneumoniae clonal group 258 (CG258) genomes to predict phenotypic PR. Using a reference-based representation of genomic data with ML outperformed a rule-based approach that detected variants in known PR genes (area under receiver-operator curve [AUROC], 0.894 versus 0.791, P = 0.006). We noted modest increases in performance by using a bacterial genome-wide association study to filter relevant genomic features and by integrating clinical data in the form of prior polymyxin exposure. Conversely, reference-free representation of genomic data as k-mers was associated with decreased performance (AUROC, 0.692 versus 0.894, P = 0.015). When ML models were interpreted to extract genomic features, six of seven known PR genes were correctly identified by models without prior programming and several genes involved in stress responses and maintenance of the cell membrane were identified as potential novel determinants of PR. These findings are a proof of concept that whole-genome sequencing data can accurately predict PR in K. pneumoniae CG258 and may be applicable to other forms of complex antimicrobial resistance. IMPORTANCE Polymyxins are last-resort antibiotics used to treat highly resistant Gram-negative bacteria. There are increasing reports of polymyxin resistance emerging, raising concerns of a postantibiotic era. Polymyxin resistance is therefore a significant public health threat, but current phenotypic methods for detection are difficult and time-consuming to perform. There have been increasing efforts to use whole-genome sequencing for detection of antibiotic resistance, but this has been difficult to apply to polymyxin resistance because of its complex polygenic nature. The significance of our research is that we successfully applied machine learning methods to predict polymyxin resistance in Klebsiella pneumoniae clonal group 258, a common health care-associated and multidrug-resistant pathogen. Our findings highlight that machine learning can be successfully applied even in complex forms of antibiotic resistance and represent a significant contribution to the literature that could be used to predict resistance in other bacteria and to other antibiotics. American Society for Microbiology 2020-05-26 /pmc/articles/PMC7253370/ /pubmed/32457240 http://dx.doi.org/10.1128/mSystems.00656-19 Text en Copyright © 2020 Macesic et al. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Research Article Macesic, Nenad Bear Don’t Walk, Oliver J. Pe’er, Itsik Tatonetti, Nicholas P. Peleg, Anton Y. Uhlemann, Anne-Catrin Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data |
title | Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data |
title_full | Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data |
title_fullStr | Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data |
title_full_unstemmed | Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data |
title_short | Predicting Phenotypic Polymyxin Resistance in Klebsiella pneumoniae through Machine Learning Analysis of Genomic Data |
title_sort | predicting phenotypic polymyxin resistance in klebsiella pneumoniae through machine learning analysis of genomic data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7253370/ https://www.ncbi.nlm.nih.gov/pubmed/32457240 http://dx.doi.org/10.1128/mSystems.00656-19 |
work_keys_str_mv | AT macesicnenad predictingphenotypicpolymyxinresistanceinklebsiellapneumoniaethroughmachinelearninganalysisofgenomicdata AT beardontwalkoliverj predictingphenotypicpolymyxinresistanceinklebsiellapneumoniaethroughmachinelearninganalysisofgenomicdata AT peeritsik predictingphenotypicpolymyxinresistanceinklebsiellapneumoniaethroughmachinelearninganalysisofgenomicdata AT tatonettinicholasp predictingphenotypicpolymyxinresistanceinklebsiellapneumoniaethroughmachinelearninganalysisofgenomicdata AT pelegantony predictingphenotypicpolymyxinresistanceinklebsiellapneumoniaethroughmachinelearninganalysisofgenomicdata AT uhlemannannecatrin predictingphenotypicpolymyxinresistanceinklebsiellapneumoniaethroughmachinelearninganalysisofgenomicdata |