Cargando…
A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates
Variation in the genome of Pseudomonas aeruginosa, an important pathogen, can have dramatic impacts on the bacterium’s ability to cause disease. We therefore asked whether it was possible to predict the virulence of P. aeruginosa isolates based on their genomic content. We applied a machine learning...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society for Microbiology
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7448275/ https://www.ncbi.nlm.nih.gov/pubmed/32843552 http://dx.doi.org/10.1128/mBio.01527-20 |
_version_ | 1783574469698125824 |
---|---|
author | Pincus, Nathan B. Ozer, Egon A. Allen, Jonathan P. Nguyen, Marcus Davis, James J. Winter, Deborah R. Chuang, Chih-Hsien Chiu, Cheng-Hsun Zamorano, Laura Oliver, Antonio Hauser, Alan R. |
author_facet | Pincus, Nathan B. Ozer, Egon A. Allen, Jonathan P. Nguyen, Marcus Davis, James J. Winter, Deborah R. Chuang, Chih-Hsien Chiu, Cheng-Hsun Zamorano, Laura Oliver, Antonio Hauser, Alan R. |
author_sort | Pincus, Nathan B. |
collection | PubMed |
description | Variation in the genome of Pseudomonas aeruginosa, an important pathogen, can have dramatic impacts on the bacterium’s ability to cause disease. We therefore asked whether it was possible to predict the virulence of P. aeruginosa isolates based on their genomic content. We applied a machine learning approach to a genetically and phenotypically diverse collection of 115 clinical P. aeruginosa isolates using genomic information and corresponding virulence phenotypes in a mouse model of bacteremia. We defined the accessory genome of these isolates through the presence or absence of accessory genomic elements (AGEs), sequences present in some strains but not others. Machine learning models trained using AGEs were predictive of virulence, with a mean nested cross-validation accuracy of 75% using the random forest algorithm. However, individual AGEs did not have a large influence on the algorithm’s performance, suggesting instead that virulence predictions are derived from a diffuse genomic signature. These results were validated with an independent test set of 25 P. aeruginosa isolates whose virulence was predicted with 72% accuracy. Machine learning models trained using core genome single-nucleotide variants and whole-genome k-mers also predicted virulence. Our findings are a proof of concept for the use of bacterial genomes to predict pathogenicity in P. aeruginosa and highlight the potential of this approach for predicting patient outcomes. |
format | Online Article Text |
id | pubmed-7448275 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | American Society for Microbiology |
record_format | MEDLINE/PubMed |
spelling | pubmed-74482752020-09-02 A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates Pincus, Nathan B. Ozer, Egon A. Allen, Jonathan P. Nguyen, Marcus Davis, James J. Winter, Deborah R. Chuang, Chih-Hsien Chiu, Cheng-Hsun Zamorano, Laura Oliver, Antonio Hauser, Alan R. mBio Research Article Variation in the genome of Pseudomonas aeruginosa, an important pathogen, can have dramatic impacts on the bacterium’s ability to cause disease. We therefore asked whether it was possible to predict the virulence of P. aeruginosa isolates based on their genomic content. We applied a machine learning approach to a genetically and phenotypically diverse collection of 115 clinical P. aeruginosa isolates using genomic information and corresponding virulence phenotypes in a mouse model of bacteremia. We defined the accessory genome of these isolates through the presence or absence of accessory genomic elements (AGEs), sequences present in some strains but not others. Machine learning models trained using AGEs were predictive of virulence, with a mean nested cross-validation accuracy of 75% using the random forest algorithm. However, individual AGEs did not have a large influence on the algorithm’s performance, suggesting instead that virulence predictions are derived from a diffuse genomic signature. These results were validated with an independent test set of 25 P. aeruginosa isolates whose virulence was predicted with 72% accuracy. Machine learning models trained using core genome single-nucleotide variants and whole-genome k-mers also predicted virulence. Our findings are a proof of concept for the use of bacterial genomes to predict pathogenicity in P. aeruginosa and highlight the potential of this approach for predicting patient outcomes. American Society for Microbiology 2020-08-25 /pmc/articles/PMC7448275/ /pubmed/32843552 http://dx.doi.org/10.1128/mBio.01527-20 Text en Copyright © 2020 Pincus et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Research Article Pincus, Nathan B. Ozer, Egon A. Allen, Jonathan P. Nguyen, Marcus Davis, James J. Winter, Deborah R. Chuang, Chih-Hsien Chiu, Cheng-Hsun Zamorano, Laura Oliver, Antonio Hauser, Alan R. A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates |
title | A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates |
title_full | A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates |
title_fullStr | A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates |
title_full_unstemmed | A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates |
title_short | A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates |
title_sort | genome-based model to predict the virulence of pseudomonas aeruginosa isolates |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7448275/ https://www.ncbi.nlm.nih.gov/pubmed/32843552 http://dx.doi.org/10.1128/mBio.01527-20 |
work_keys_str_mv | AT pincusnathanb agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT ozeregona agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT allenjonathanp agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT nguyenmarcus agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT davisjamesj agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT winterdeborahr agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT chuangchihhsien agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT chiuchenghsun agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT zamoranolaura agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT oliverantonio agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT hauseralanr agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT pincusnathanb genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT ozeregona genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT allenjonathanp genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT nguyenmarcus genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT davisjamesj genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT winterdeborahr genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT chuangchihhsien genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT chiuchenghsun genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT zamoranolaura genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT oliverantonio genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates AT hauseralanr genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates |