Cargando…

Predicting Antimicrobial Resistance Using Partial Genome Alignments

Antimicrobial resistance (AMR) is an important global health threat that impacts millions of people worldwide each year. Developing methods that can detect and predict AMR phenotypes can help to mitigate the spread of AMR by informing clinical decision making and appropriate mitigation strategies. M...

Descripción completa

Detalles Bibliográficos
Autores principales: Aytan-Aktug, D., Nguyen, M., Clausen, P. T. L. C., Stevens, R. L., Aarestrup, F. M., Lund, O., Davis, J. J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8269213/
https://www.ncbi.nlm.nih.gov/pubmed/34128695
http://dx.doi.org/10.1128/mSystems.00185-21
_version_ 1783720527871868928
author Aytan-Aktug, D.
Nguyen, M.
Clausen, P. T. L. C.
Stevens, R. L.
Aarestrup, F. M.
Lund, O.
Davis, J. J.
author_facet Aytan-Aktug, D.
Nguyen, M.
Clausen, P. T. L. C.
Stevens, R. L.
Aarestrup, F. M.
Lund, O.
Davis, J. J.
author_sort Aytan-Aktug, D.
collection PubMed
description Antimicrobial resistance (AMR) is an important global health threat that impacts millions of people worldwide each year. Developing methods that can detect and predict AMR phenotypes can help to mitigate the spread of AMR by informing clinical decision making and appropriate mitigation strategies. Many bioinformatic methods have been developed for predicting AMR phenotypes from whole-genome sequences and AMR genes, but recent studies have indicated that predictions can be made from incomplete genome sequence data. In order to more systematically understand this, we built random forest-based machine learning classifiers for predicting susceptible and resistant phenotypes for Klebsiella pneumoniae (1,640 strains), Mycobacterium tuberculosis (2,497 strains), and Salmonella enterica (1,981 strains). We started by building models from alignments that were based on a reference chromosome for each species. We then subsampled each chromosomal alignment and built models for the resulting subalignments, finding that very small regions, representing approximately 0.1 to 0.2% of the chromosome, are predictive. In K. pneumoniae, M. tuberculosis, and S. enterica, the subalignments are able to predict multiple AMR phenotypes with at least 70% accuracy, even though most do not encode an AMR-related function. We used these models to identify regions of the chromosome with high and low predictive signals. Finally, subalignments that retain high accuracy across larger phylogenetic distances were examined in greater detail, revealing genes and intergenic regions with potential links to AMR, virulence, transport, and survival under stress conditions. IMPORTANCE Antimicrobial resistance causes thousands of deaths annually worldwide. Understanding the regions of the genome that are involved in antimicrobial resistance is important for developing mitigation strategies and preventing transmission. Machine learning models are capable of predicting antimicrobial resistance phenotypes from bacterial genome sequence data by identifying resistance genes, mutations, and other correlated features. They are also capable of implicating regions of the genome that have not been previously characterized as being involved in resistance. In this study, we generated global chromosomal alignments for Klebsiella pneumoniae, Mycobacterium tuberculosis, and Salmonella enterica and systematically searched them for small conserved regions of the genome that enable the prediction of antimicrobial resistance phenotypes. In addition to known antimicrobial resistance genes, this analysis identified genes involved in virulence and transport functions, as well as many genes with no previous implication in antimicrobial resistance.
format Online
Article
Text
id pubmed-8269213
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-82692132021-08-02 Predicting Antimicrobial Resistance Using Partial Genome Alignments Aytan-Aktug, D. Nguyen, M. Clausen, P. T. L. C. Stevens, R. L. Aarestrup, F. M. Lund, O. Davis, J. J. mSystems Research Article Antimicrobial resistance (AMR) is an important global health threat that impacts millions of people worldwide each year. Developing methods that can detect and predict AMR phenotypes can help to mitigate the spread of AMR by informing clinical decision making and appropriate mitigation strategies. Many bioinformatic methods have been developed for predicting AMR phenotypes from whole-genome sequences and AMR genes, but recent studies have indicated that predictions can be made from incomplete genome sequence data. In order to more systematically understand this, we built random forest-based machine learning classifiers for predicting susceptible and resistant phenotypes for Klebsiella pneumoniae (1,640 strains), Mycobacterium tuberculosis (2,497 strains), and Salmonella enterica (1,981 strains). We started by building models from alignments that were based on a reference chromosome for each species. We then subsampled each chromosomal alignment and built models for the resulting subalignments, finding that very small regions, representing approximately 0.1 to 0.2% of the chromosome, are predictive. In K. pneumoniae, M. tuberculosis, and S. enterica, the subalignments are able to predict multiple AMR phenotypes with at least 70% accuracy, even though most do not encode an AMR-related function. We used these models to identify regions of the chromosome with high and low predictive signals. Finally, subalignments that retain high accuracy across larger phylogenetic distances were examined in greater detail, revealing genes and intergenic regions with potential links to AMR, virulence, transport, and survival under stress conditions. IMPORTANCE Antimicrobial resistance causes thousands of deaths annually worldwide. Understanding the regions of the genome that are involved in antimicrobial resistance is important for developing mitigation strategies and preventing transmission. Machine learning models are capable of predicting antimicrobial resistance phenotypes from bacterial genome sequence data by identifying resistance genes, mutations, and other correlated features. They are also capable of implicating regions of the genome that have not been previously characterized as being involved in resistance. In this study, we generated global chromosomal alignments for Klebsiella pneumoniae, Mycobacterium tuberculosis, and Salmonella enterica and systematically searched them for small conserved regions of the genome that enable the prediction of antimicrobial resistance phenotypes. In addition to known antimicrobial resistance genes, this analysis identified genes involved in virulence and transport functions, as well as many genes with no previous implication in antimicrobial resistance. American Society for Microbiology 2021-06-15 /pmc/articles/PMC8269213/ /pubmed/34128695 http://dx.doi.org/10.1128/mSystems.00185-21 Text en https://doi.org/10.1128/AuthorWarrantyLicense.v1This is a work of the U.S. Government and is not subject to copyright protection in the United States. Foreign copyrights may apply.
spellingShingle Research Article
Aytan-Aktug, D.
Nguyen, M.
Clausen, P. T. L. C.
Stevens, R. L.
Aarestrup, F. M.
Lund, O.
Davis, J. J.
Predicting Antimicrobial Resistance Using Partial Genome Alignments
title Predicting Antimicrobial Resistance Using Partial Genome Alignments
title_full Predicting Antimicrobial Resistance Using Partial Genome Alignments
title_fullStr Predicting Antimicrobial Resistance Using Partial Genome Alignments
title_full_unstemmed Predicting Antimicrobial Resistance Using Partial Genome Alignments
title_short Predicting Antimicrobial Resistance Using Partial Genome Alignments
title_sort predicting antimicrobial resistance using partial genome alignments
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8269213/
https://www.ncbi.nlm.nih.gov/pubmed/34128695
http://dx.doi.org/10.1128/mSystems.00185-21
work_keys_str_mv AT aytanaktugd predictingantimicrobialresistanceusingpartialgenomealignments
AT nguyenm predictingantimicrobialresistanceusingpartialgenomealignments
AT clausenptlc predictingantimicrobialresistanceusingpartialgenomealignments
AT stevensrl predictingantimicrobialresistanceusingpartialgenomealignments
AT aarestrupfm predictingantimicrobialresistanceusingpartialgenomealignments
AT lundo predictingantimicrobialresistanceusingpartialgenomealignments
AT davisjj predictingantimicrobialresistanceusingpartialgenomealignments