Cargando…

A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates

Variation in the genome of Pseudomonas aeruginosa, an important pathogen, can have dramatic impacts on the bacterium’s ability to cause disease. We therefore asked whether it was possible to predict the virulence of P. aeruginosa isolates based on their genomic content. We applied a machine learning...

Descripción completa

Detalles Bibliográficos
Autores principales: Pincus, Nathan B., Ozer, Egon A., Allen, Jonathan P., Nguyen, Marcus, Davis, James J., Winter, Deborah R., Chuang, Chih-Hsien, Chiu, Cheng-Hsun, Zamorano, Laura, Oliver, Antonio, Hauser, Alan R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7448275/
https://www.ncbi.nlm.nih.gov/pubmed/32843552
http://dx.doi.org/10.1128/mBio.01527-20
_version_ 1783574469698125824
author Pincus, Nathan B.
Ozer, Egon A.
Allen, Jonathan P.
Nguyen, Marcus
Davis, James J.
Winter, Deborah R.
Chuang, Chih-Hsien
Chiu, Cheng-Hsun
Zamorano, Laura
Oliver, Antonio
Hauser, Alan R.
author_facet Pincus, Nathan B.
Ozer, Egon A.
Allen, Jonathan P.
Nguyen, Marcus
Davis, James J.
Winter, Deborah R.
Chuang, Chih-Hsien
Chiu, Cheng-Hsun
Zamorano, Laura
Oliver, Antonio
Hauser, Alan R.
author_sort Pincus, Nathan B.
collection PubMed
description Variation in the genome of Pseudomonas aeruginosa, an important pathogen, can have dramatic impacts on the bacterium’s ability to cause disease. We therefore asked whether it was possible to predict the virulence of P. aeruginosa isolates based on their genomic content. We applied a machine learning approach to a genetically and phenotypically diverse collection of 115 clinical P. aeruginosa isolates using genomic information and corresponding virulence phenotypes in a mouse model of bacteremia. We defined the accessory genome of these isolates through the presence or absence of accessory genomic elements (AGEs), sequences present in some strains but not others. Machine learning models trained using AGEs were predictive of virulence, with a mean nested cross-validation accuracy of 75% using the random forest algorithm. However, individual AGEs did not have a large influence on the algorithm’s performance, suggesting instead that virulence predictions are derived from a diffuse genomic signature. These results were validated with an independent test set of 25 P. aeruginosa isolates whose virulence was predicted with 72% accuracy. Machine learning models trained using core genome single-nucleotide variants and whole-genome k-mers also predicted virulence. Our findings are a proof of concept for the use of bacterial genomes to predict pathogenicity in P. aeruginosa and highlight the potential of this approach for predicting patient outcomes.
format Online
Article
Text
id pubmed-7448275
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-74482752020-09-02 A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates Pincus, Nathan B. Ozer, Egon A. Allen, Jonathan P. Nguyen, Marcus Davis, James J. Winter, Deborah R. Chuang, Chih-Hsien Chiu, Cheng-Hsun Zamorano, Laura Oliver, Antonio Hauser, Alan R. mBio Research Article Variation in the genome of Pseudomonas aeruginosa, an important pathogen, can have dramatic impacts on the bacterium’s ability to cause disease. We therefore asked whether it was possible to predict the virulence of P. aeruginosa isolates based on their genomic content. We applied a machine learning approach to a genetically and phenotypically diverse collection of 115 clinical P. aeruginosa isolates using genomic information and corresponding virulence phenotypes in a mouse model of bacteremia. We defined the accessory genome of these isolates through the presence or absence of accessory genomic elements (AGEs), sequences present in some strains but not others. Machine learning models trained using AGEs were predictive of virulence, with a mean nested cross-validation accuracy of 75% using the random forest algorithm. However, individual AGEs did not have a large influence on the algorithm’s performance, suggesting instead that virulence predictions are derived from a diffuse genomic signature. These results were validated with an independent test set of 25 P. aeruginosa isolates whose virulence was predicted with 72% accuracy. Machine learning models trained using core genome single-nucleotide variants and whole-genome k-mers also predicted virulence. Our findings are a proof of concept for the use of bacterial genomes to predict pathogenicity in P. aeruginosa and highlight the potential of this approach for predicting patient outcomes. American Society for Microbiology 2020-08-25 /pmc/articles/PMC7448275/ /pubmed/32843552 http://dx.doi.org/10.1128/mBio.01527-20 Text en Copyright © 2020 Pincus et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Pincus, Nathan B.
Ozer, Egon A.
Allen, Jonathan P.
Nguyen, Marcus
Davis, James J.
Winter, Deborah R.
Chuang, Chih-Hsien
Chiu, Cheng-Hsun
Zamorano, Laura
Oliver, Antonio
Hauser, Alan R.
A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates
title A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates
title_full A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates
title_fullStr A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates
title_full_unstemmed A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates
title_short A Genome-Based Model to Predict the Virulence of Pseudomonas aeruginosa Isolates
title_sort genome-based model to predict the virulence of pseudomonas aeruginosa isolates
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7448275/
https://www.ncbi.nlm.nih.gov/pubmed/32843552
http://dx.doi.org/10.1128/mBio.01527-20
work_keys_str_mv AT pincusnathanb agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT ozeregona agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT allenjonathanp agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT nguyenmarcus agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT davisjamesj agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT winterdeborahr agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT chuangchihhsien agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT chiuchenghsun agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT zamoranolaura agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT oliverantonio agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT hauseralanr agenomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT pincusnathanb genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT ozeregona genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT allenjonathanp genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT nguyenmarcus genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT davisjamesj genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT winterdeborahr genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT chuangchihhsien genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT chiuchenghsun genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT zamoranolaura genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT oliverantonio genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates
AT hauseralanr genomebasedmodeltopredictthevirulenceofpseudomonasaeruginosaisolates