Cargando…

AB_SA: Accessory genes-Based Source Attribution – tracing the source of Salmonella enterica Typhimurium environmental strains

The partitioning of pathogenic strains isolated in environmental or human cases to their sources is challenging. The pathogens usually colonize multiple animal hosts, including livestock, which contaminate the food-production chain and the environment (e.g. soil and water), posing an additional publ...

Descripción completa

Detalles Bibliográficos
Autores principales: Guillier, Laurent, Gourmelon, Michèle, Lozach, Solen, Cadel-Six, Sabrina, Vignaud, Marie-Léone, Munck, Nanna, Hald, Tine, Palma, Federica
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Microbiology Society 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7478624/
https://www.ncbi.nlm.nih.gov/pubmed/32320376
http://dx.doi.org/10.1099/mgen.0.000366
_version_ 1783580093857136640
author Guillier, Laurent
Gourmelon, Michèle
Lozach, Solen
Cadel-Six, Sabrina
Vignaud, Marie-Léone
Munck, Nanna
Hald, Tine
Palma, Federica
author_facet Guillier, Laurent
Gourmelon, Michèle
Lozach, Solen
Cadel-Six, Sabrina
Vignaud, Marie-Léone
Munck, Nanna
Hald, Tine
Palma, Federica
author_sort Guillier, Laurent
collection PubMed
description The partitioning of pathogenic strains isolated in environmental or human cases to their sources is challenging. The pathogens usually colonize multiple animal hosts, including livestock, which contaminate the food-production chain and the environment (e.g. soil and water), posing an additional public-health burden and major challenges in the identification of the source. Genomic data opens up new opportunities for the development of statistical models aiming to indicate the likely source of pathogen contamination. Here, we propose a computationally fast and efficient multinomial logistic regression source-attribution classifier to predict the animal source of bacterial isolates based on ‘source-enriched’ loci extracted from the accessory-genome profiles of a pangenomic dataset. Depending on the accuracy of the model’s self-attribution step, the modeller selects the number of candidate accessory genes that best fit the model for calculating the likelihood of (source) category membership. The Accessory genes-Based Source Attribution (AB_SA) method was applied to a dataset of strains of Salmonella enterica Typhimurium and its monophasic variant ( S . enterica 1,4,[5],12:i:-). The model was trained on 69 strains with known animal-source categories (i.e. poultry, ruminant and pig). The AB_SA method helped to identify 8 genes as predictors among the 2802 accessory genes. The self-attribution accuracy was 80 %. The AB_SA model was then able to classify 25 of the 29 S . enterica Typhimurium and S . enterica 1,4,[5],12:i:- isolates collected from the environment (considered to be of unknown source) into a specific category (i.e. animal source), with more than 85 % of probability. The AB_SA method herein described provides a user-friendly and valuable tool for performing source-attribution studies in only a few steps. AB_SA is written in R and freely available at https://github.com/lguillier/AB_SA.
format Online
Article
Text
id pubmed-7478624
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Microbiology Society
record_format MEDLINE/PubMed
spelling pubmed-74786242020-09-09 AB_SA: Accessory genes-Based Source Attribution – tracing the source of Salmonella enterica Typhimurium environmental strains Guillier, Laurent Gourmelon, Michèle Lozach, Solen Cadel-Six, Sabrina Vignaud, Marie-Léone Munck, Nanna Hald, Tine Palma, Federica Microb Genom Method The partitioning of pathogenic strains isolated in environmental or human cases to their sources is challenging. The pathogens usually colonize multiple animal hosts, including livestock, which contaminate the food-production chain and the environment (e.g. soil and water), posing an additional public-health burden and major challenges in the identification of the source. Genomic data opens up new opportunities for the development of statistical models aiming to indicate the likely source of pathogen contamination. Here, we propose a computationally fast and efficient multinomial logistic regression source-attribution classifier to predict the animal source of bacterial isolates based on ‘source-enriched’ loci extracted from the accessory-genome profiles of a pangenomic dataset. Depending on the accuracy of the model’s self-attribution step, the modeller selects the number of candidate accessory genes that best fit the model for calculating the likelihood of (source) category membership. The Accessory genes-Based Source Attribution (AB_SA) method was applied to a dataset of strains of Salmonella enterica Typhimurium and its monophasic variant ( S . enterica 1,4,[5],12:i:-). The model was trained on 69 strains with known animal-source categories (i.e. poultry, ruminant and pig). The AB_SA method helped to identify 8 genes as predictors among the 2802 accessory genes. The self-attribution accuracy was 80 %. The AB_SA model was then able to classify 25 of the 29 S . enterica Typhimurium and S . enterica 1,4,[5],12:i:- isolates collected from the environment (considered to be of unknown source) into a specific category (i.e. animal source), with more than 85 % of probability. The AB_SA method herein described provides a user-friendly and valuable tool for performing source-attribution studies in only a few steps. AB_SA is written in R and freely available at https://github.com/lguillier/AB_SA. Microbiology Society 2020-04-22 /pmc/articles/PMC7478624/ /pubmed/32320376 http://dx.doi.org/10.1099/mgen.0.000366 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License.
spellingShingle Method
Guillier, Laurent
Gourmelon, Michèle
Lozach, Solen
Cadel-Six, Sabrina
Vignaud, Marie-Léone
Munck, Nanna
Hald, Tine
Palma, Federica
AB_SA: Accessory genes-Based Source Attribution – tracing the source of Salmonella enterica Typhimurium environmental strains
title AB_SA: Accessory genes-Based Source Attribution – tracing the source of Salmonella enterica Typhimurium environmental strains
title_full AB_SA: Accessory genes-Based Source Attribution – tracing the source of Salmonella enterica Typhimurium environmental strains
title_fullStr AB_SA: Accessory genes-Based Source Attribution – tracing the source of Salmonella enterica Typhimurium environmental strains
title_full_unstemmed AB_SA: Accessory genes-Based Source Attribution – tracing the source of Salmonella enterica Typhimurium environmental strains
title_short AB_SA: Accessory genes-Based Source Attribution – tracing the source of Salmonella enterica Typhimurium environmental strains
title_sort ab_sa: accessory genes-based source attribution – tracing the source of salmonella enterica typhimurium environmental strains
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7478624/
https://www.ncbi.nlm.nih.gov/pubmed/32320376
http://dx.doi.org/10.1099/mgen.0.000366
work_keys_str_mv AT guillierlaurent absaaccessorygenesbasedsourceattributiontracingthesourceofsalmonellaentericatyphimuriumenvironmentalstrains
AT gourmelonmichele absaaccessorygenesbasedsourceattributiontracingthesourceofsalmonellaentericatyphimuriumenvironmentalstrains
AT lozachsolen absaaccessorygenesbasedsourceattributiontracingthesourceofsalmonellaentericatyphimuriumenvironmentalstrains
AT cadelsixsabrina absaaccessorygenesbasedsourceattributiontracingthesourceofsalmonellaentericatyphimuriumenvironmentalstrains
AT vignaudmarieleone absaaccessorygenesbasedsourceattributiontracingthesourceofsalmonellaentericatyphimuriumenvironmentalstrains
AT muncknanna absaaccessorygenesbasedsourceattributiontracingthesourceofsalmonellaentericatyphimuriumenvironmentalstrains
AT haldtine absaaccessorygenesbasedsourceattributiontracingthesourceofsalmonellaentericatyphimuriumenvironmentalstrains
AT palmafederica absaaccessorygenesbasedsourceattributiontracingthesourceofsalmonellaentericatyphimuriumenvironmentalstrains