Cargando…

Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches

Staphylococcus aureus is a major human and animal pathogen, colonizing diverse ecological niches within its hosts. Predicting whether an isolate will infect a specific host and its subsequent clinical fate remains unknown. In this study, we investigated the S. aureus pangenome using a curated set of...

Descripción completa

Detalles Bibliográficos
Autores principales: Sassi, Mohamed, Bronsard, Julie, Pascreau, Gaetan, Emily, Mathieu, Donnio, Pierre-Yves, Revest, Matthieu, Felden, Brice, Wirth, Thierry, Augagneur, Yoann
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9426533/
https://www.ncbi.nlm.nih.gov/pubmed/35862809
http://dx.doi.org/10.1128/msystems.00378-22
_version_ 1784778702729510912
author Sassi, Mohamed
Bronsard, Julie
Pascreau, Gaetan
Emily, Mathieu
Donnio, Pierre-Yves
Revest, Matthieu
Felden, Brice
Wirth, Thierry
Augagneur, Yoann
author_facet Sassi, Mohamed
Bronsard, Julie
Pascreau, Gaetan
Emily, Mathieu
Donnio, Pierre-Yves
Revest, Matthieu
Felden, Brice
Wirth, Thierry
Augagneur, Yoann
author_sort Sassi, Mohamed
collection PubMed
description Staphylococcus aureus is a major human and animal pathogen, colonizing diverse ecological niches within its hosts. Predicting whether an isolate will infect a specific host and its subsequent clinical fate remains unknown. In this study, we investigated the S. aureus pangenome using a curated set of 356 strains, spanning a wide range of hosts, origins, and clinical display and antibiotic resistance profiles. We used genome-wide association study (GWAS) and random forest (RF) algorithms to discriminate strains based on their origins and clinical sources. Here, we show that the presence of sak and scn can discriminate strains based on their host specificity, while other genes such as mecA are often associated with virulent outcomes. Both GWAS and RF indicated the importance of intergenic regions (IGRs) and coding DNA sequence (CDS) but not sRNAs in forecasting an outcome. Additional transcriptomic analyses performed on the most prevalent clonal complex 8 (CC8) clonal types, in media mimicking nasal colonization or bacteremia, indicated three RNAs as potential RNA markers to forecast infection, followed by 30 others that could serve as infection severity predictors. Our report shows that genetic association and transcriptomics are complementary approaches that will be combined in a single analytical framework to improve our understanding of bacterial pathogenesis and ultimately identify potential predictive molecular markers. IMPORTANCE Predicting the outcome of bacterial colonization and infections, based on extensive genomic and transcriptomic data from a given pathogen, would be of substantial help for clinicians in treating and curing patients. In this report, genome-wide association studies and random forest algorithms have defined gene combinations that differentiate human from animal strains, colonization from diseases, and nonsevere from severe diseases, while it revealed the importance of IGRs and CDS, but not small RNAs (sRNAs), in anticipating an outcome. In addition, transcriptomic analyses performed on the most prevalent clonal types, in media mimicking either nasal colonization or bacteremia, revealed significant differences and therefore potent RNA markers. Overall, the use of both genomic and transcriptomic data in a single analytical framework can enhance our understanding of bacterial pathogenesis.
format Online
Article
Text
id pubmed-9426533
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-94265332022-08-31 Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches Sassi, Mohamed Bronsard, Julie Pascreau, Gaetan Emily, Mathieu Donnio, Pierre-Yves Revest, Matthieu Felden, Brice Wirth, Thierry Augagneur, Yoann mSystems Research Article Staphylococcus aureus is a major human and animal pathogen, colonizing diverse ecological niches within its hosts. Predicting whether an isolate will infect a specific host and its subsequent clinical fate remains unknown. In this study, we investigated the S. aureus pangenome using a curated set of 356 strains, spanning a wide range of hosts, origins, and clinical display and antibiotic resistance profiles. We used genome-wide association study (GWAS) and random forest (RF) algorithms to discriminate strains based on their origins and clinical sources. Here, we show that the presence of sak and scn can discriminate strains based on their host specificity, while other genes such as mecA are often associated with virulent outcomes. Both GWAS and RF indicated the importance of intergenic regions (IGRs) and coding DNA sequence (CDS) but not sRNAs in forecasting an outcome. Additional transcriptomic analyses performed on the most prevalent clonal complex 8 (CC8) clonal types, in media mimicking nasal colonization or bacteremia, indicated three RNAs as potential RNA markers to forecast infection, followed by 30 others that could serve as infection severity predictors. Our report shows that genetic association and transcriptomics are complementary approaches that will be combined in a single analytical framework to improve our understanding of bacterial pathogenesis and ultimately identify potential predictive molecular markers. IMPORTANCE Predicting the outcome of bacterial colonization and infections, based on extensive genomic and transcriptomic data from a given pathogen, would be of substantial help for clinicians in treating and curing patients. In this report, genome-wide association studies and random forest algorithms have defined gene combinations that differentiate human from animal strains, colonization from diseases, and nonsevere from severe diseases, while it revealed the importance of IGRs and CDS, but not small RNAs (sRNAs), in anticipating an outcome. In addition, transcriptomic analyses performed on the most prevalent clonal types, in media mimicking either nasal colonization or bacteremia, revealed significant differences and therefore potent RNA markers. Overall, the use of both genomic and transcriptomic data in a single analytical framework can enhance our understanding of bacterial pathogenesis. American Society for Microbiology 2022-07-05 /pmc/articles/PMC9426533/ /pubmed/35862809 http://dx.doi.org/10.1128/msystems.00378-22 Text en Copyright © 2022 Sassi et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Sassi, Mohamed
Bronsard, Julie
Pascreau, Gaetan
Emily, Mathieu
Donnio, Pierre-Yves
Revest, Matthieu
Felden, Brice
Wirth, Thierry
Augagneur, Yoann
Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches
title Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches
title_full Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches
title_fullStr Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches
title_full_unstemmed Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches
title_short Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches
title_sort forecasting staphylococcus aureus infections using genome-wide association studies, machine learning, and transcriptomic approaches
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9426533/
https://www.ncbi.nlm.nih.gov/pubmed/35862809
http://dx.doi.org/10.1128/msystems.00378-22
work_keys_str_mv AT sassimohamed forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches
AT bronsardjulie forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches
AT pascreaugaetan forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches
AT emilymathieu forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches
AT donniopierreyves forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches
AT revestmatthieu forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches
AT feldenbrice forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches
AT wirththierry forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches
AT augagneuryoann forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches