Cargando…

Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends

Several studies have shown a correlation between outbreaks of Salmonella enterica and meteorological trends, especially related to temperature and precipitation. Additionally, current studies based on outbreaks are performed on data for the species Salmonella enterica, without considering its intra-...

Descripción completa

Detalles Bibliográficos
Autores principales: Karanth, Shraddha, Patel, Jitendra, Shirmohammadi, Adel, Pradhan, Abani K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10290999/
https://www.ncbi.nlm.nih.gov/pubmed/37377491
http://dx.doi.org/10.1016/j.crfs.2023.100525
_version_ 1785062605637812224
author Karanth, Shraddha
Patel, Jitendra
Shirmohammadi, Adel
Pradhan, Abani K.
author_facet Karanth, Shraddha
Patel, Jitendra
Shirmohammadi, Adel
Pradhan, Abani K.
author_sort Karanth, Shraddha
collection PubMed
description Several studies have shown a correlation between outbreaks of Salmonella enterica and meteorological trends, especially related to temperature and precipitation. Additionally, current studies based on outbreaks are performed on data for the species Salmonella enterica, without considering its intra-species and genetic heterogeneity. In this study, we analyzed the effect of differential gene expression and a suite of meteorological factors on salmonellosis outbreak scale (typified by case numbers) using a combination of machine learning and count-based modeling methods. Elastic Net regularization model was used to identify significant genes from a Salmonella pan-genome, and a multi-variable Poisson regression developed to fit the individual and mixed effects data. The best-fit Elastic Net model (α = 0.50; λ = 2.18) identified 53 significant gene features. The final multi-variable Poisson regression model (χ(2) = 5748.22; pseudo R(2) = 0.669; probability > χ(2) = 0) identified 127 significant predictor terms (p < 0.10), comprising 45 gene-only predictors, average temperature, average precipitation, and average snowfall, and 79 gene-meteorological interaction terms. The significant genes ranged in functionality from cellular signaling and transport, virulence, metabolism, and stress response, and included gene variables not considered as significant by the baseline model. This study presents a holistic approach towards evaluating multiple data sources (such as genomic and environmental data) to predict outbreak scale, which could help in revising the estimates for human health risk.
format Online
Article
Text
id pubmed-10290999
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-102909992023-06-27 Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends Karanth, Shraddha Patel, Jitendra Shirmohammadi, Adel Pradhan, Abani K. Curr Res Food Sci Research Article Several studies have shown a correlation between outbreaks of Salmonella enterica and meteorological trends, especially related to temperature and precipitation. Additionally, current studies based on outbreaks are performed on data for the species Salmonella enterica, without considering its intra-species and genetic heterogeneity. In this study, we analyzed the effect of differential gene expression and a suite of meteorological factors on salmonellosis outbreak scale (typified by case numbers) using a combination of machine learning and count-based modeling methods. Elastic Net regularization model was used to identify significant genes from a Salmonella pan-genome, and a multi-variable Poisson regression developed to fit the individual and mixed effects data. The best-fit Elastic Net model (α = 0.50; λ = 2.18) identified 53 significant gene features. The final multi-variable Poisson regression model (χ(2) = 5748.22; pseudo R(2) = 0.669; probability > χ(2) = 0) identified 127 significant predictor terms (p < 0.10), comprising 45 gene-only predictors, average temperature, average precipitation, and average snowfall, and 79 gene-meteorological interaction terms. The significant genes ranged in functionality from cellular signaling and transport, virulence, metabolism, and stress response, and included gene variables not considered as significant by the baseline model. This study presents a holistic approach towards evaluating multiple data sources (such as genomic and environmental data) to predict outbreak scale, which could help in revising the estimates for human health risk. Elsevier 2023-05-28 /pmc/articles/PMC10290999/ /pubmed/37377491 http://dx.doi.org/10.1016/j.crfs.2023.100525 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Karanth, Shraddha
Patel, Jitendra
Shirmohammadi, Adel
Pradhan, Abani K.
Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends
title Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends
title_full Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends
title_fullStr Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends
title_full_unstemmed Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends
title_short Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends
title_sort machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10290999/
https://www.ncbi.nlm.nih.gov/pubmed/37377491
http://dx.doi.org/10.1016/j.crfs.2023.100525
work_keys_str_mv AT karanthshraddha machinelearningtopredictfoodbornesalmonellosisoutbreaksbasedongenomecharacteristicsandmeteorologicaltrends
AT pateljitendra machinelearningtopredictfoodbornesalmonellosisoutbreaksbasedongenomecharacteristicsandmeteorologicaltrends
AT shirmohammadiadel machinelearningtopredictfoodbornesalmonellosisoutbreaksbasedongenomecharacteristicsandmeteorologicaltrends
AT pradhanabanik machinelearningtopredictfoodbornesalmonellosisoutbreaksbasedongenomecharacteristicsandmeteorologicaltrends