Cargando…
Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches
Staphylococcus aureus is a major human and animal pathogen, colonizing diverse ecological niches within its hosts. Predicting whether an isolate will infect a specific host and its subsequent clinical fate remains unknown. In this study, we investigated the S. aureus pangenome using a curated set of...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society for Microbiology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9426533/ https://www.ncbi.nlm.nih.gov/pubmed/35862809 http://dx.doi.org/10.1128/msystems.00378-22 |
_version_ | 1784778702729510912 |
---|---|
author | Sassi, Mohamed Bronsard, Julie Pascreau, Gaetan Emily, Mathieu Donnio, Pierre-Yves Revest, Matthieu Felden, Brice Wirth, Thierry Augagneur, Yoann |
author_facet | Sassi, Mohamed Bronsard, Julie Pascreau, Gaetan Emily, Mathieu Donnio, Pierre-Yves Revest, Matthieu Felden, Brice Wirth, Thierry Augagneur, Yoann |
author_sort | Sassi, Mohamed |
collection | PubMed |
description | Staphylococcus aureus is a major human and animal pathogen, colonizing diverse ecological niches within its hosts. Predicting whether an isolate will infect a specific host and its subsequent clinical fate remains unknown. In this study, we investigated the S. aureus pangenome using a curated set of 356 strains, spanning a wide range of hosts, origins, and clinical display and antibiotic resistance profiles. We used genome-wide association study (GWAS) and random forest (RF) algorithms to discriminate strains based on their origins and clinical sources. Here, we show that the presence of sak and scn can discriminate strains based on their host specificity, while other genes such as mecA are often associated with virulent outcomes. Both GWAS and RF indicated the importance of intergenic regions (IGRs) and coding DNA sequence (CDS) but not sRNAs in forecasting an outcome. Additional transcriptomic analyses performed on the most prevalent clonal complex 8 (CC8) clonal types, in media mimicking nasal colonization or bacteremia, indicated three RNAs as potential RNA markers to forecast infection, followed by 30 others that could serve as infection severity predictors. Our report shows that genetic association and transcriptomics are complementary approaches that will be combined in a single analytical framework to improve our understanding of bacterial pathogenesis and ultimately identify potential predictive molecular markers. IMPORTANCE Predicting the outcome of bacterial colonization and infections, based on extensive genomic and transcriptomic data from a given pathogen, would be of substantial help for clinicians in treating and curing patients. In this report, genome-wide association studies and random forest algorithms have defined gene combinations that differentiate human from animal strains, colonization from diseases, and nonsevere from severe diseases, while it revealed the importance of IGRs and CDS, but not small RNAs (sRNAs), in anticipating an outcome. In addition, transcriptomic analyses performed on the most prevalent clonal types, in media mimicking either nasal colonization or bacteremia, revealed significant differences and therefore potent RNA markers. Overall, the use of both genomic and transcriptomic data in a single analytical framework can enhance our understanding of bacterial pathogenesis. |
format | Online Article Text |
id | pubmed-9426533 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Society for Microbiology |
record_format | MEDLINE/PubMed |
spelling | pubmed-94265332022-08-31 Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches Sassi, Mohamed Bronsard, Julie Pascreau, Gaetan Emily, Mathieu Donnio, Pierre-Yves Revest, Matthieu Felden, Brice Wirth, Thierry Augagneur, Yoann mSystems Research Article Staphylococcus aureus is a major human and animal pathogen, colonizing diverse ecological niches within its hosts. Predicting whether an isolate will infect a specific host and its subsequent clinical fate remains unknown. In this study, we investigated the S. aureus pangenome using a curated set of 356 strains, spanning a wide range of hosts, origins, and clinical display and antibiotic resistance profiles. We used genome-wide association study (GWAS) and random forest (RF) algorithms to discriminate strains based on their origins and clinical sources. Here, we show that the presence of sak and scn can discriminate strains based on their host specificity, while other genes such as mecA are often associated with virulent outcomes. Both GWAS and RF indicated the importance of intergenic regions (IGRs) and coding DNA sequence (CDS) but not sRNAs in forecasting an outcome. Additional transcriptomic analyses performed on the most prevalent clonal complex 8 (CC8) clonal types, in media mimicking nasal colonization or bacteremia, indicated three RNAs as potential RNA markers to forecast infection, followed by 30 others that could serve as infection severity predictors. Our report shows that genetic association and transcriptomics are complementary approaches that will be combined in a single analytical framework to improve our understanding of bacterial pathogenesis and ultimately identify potential predictive molecular markers. IMPORTANCE Predicting the outcome of bacterial colonization and infections, based on extensive genomic and transcriptomic data from a given pathogen, would be of substantial help for clinicians in treating and curing patients. In this report, genome-wide association studies and random forest algorithms have defined gene combinations that differentiate human from animal strains, colonization from diseases, and nonsevere from severe diseases, while it revealed the importance of IGRs and CDS, but not small RNAs (sRNAs), in anticipating an outcome. In addition, transcriptomic analyses performed on the most prevalent clonal types, in media mimicking either nasal colonization or bacteremia, revealed significant differences and therefore potent RNA markers. Overall, the use of both genomic and transcriptomic data in a single analytical framework can enhance our understanding of bacterial pathogenesis. American Society for Microbiology 2022-07-05 /pmc/articles/PMC9426533/ /pubmed/35862809 http://dx.doi.org/10.1128/msystems.00378-22 Text en Copyright © 2022 Sassi et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Research Article Sassi, Mohamed Bronsard, Julie Pascreau, Gaetan Emily, Mathieu Donnio, Pierre-Yves Revest, Matthieu Felden, Brice Wirth, Thierry Augagneur, Yoann Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches |
title | Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches |
title_full | Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches |
title_fullStr | Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches |
title_full_unstemmed | Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches |
title_short | Forecasting Staphylococcus aureus Infections Using Genome-Wide Association Studies, Machine Learning, and Transcriptomic Approaches |
title_sort | forecasting staphylococcus aureus infections using genome-wide association studies, machine learning, and transcriptomic approaches |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9426533/ https://www.ncbi.nlm.nih.gov/pubmed/35862809 http://dx.doi.org/10.1128/msystems.00378-22 |
work_keys_str_mv | AT sassimohamed forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches AT bronsardjulie forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches AT pascreaugaetan forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches AT emilymathieu forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches AT donniopierreyves forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches AT revestmatthieu forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches AT feldenbrice forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches AT wirththierry forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches AT augagneuryoann forecastingstaphylococcusaureusinfectionsusinggenomewideassociationstudiesmachinelearningandtranscriptomicapproaches |