Cargando…
Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends
Several studies have shown a correlation between outbreaks of Salmonella enterica and meteorological trends, especially related to temperature and precipitation. Additionally, current studies based on outbreaks are performed on data for the species Salmonella enterica, without considering its intra-...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10290999/ https://www.ncbi.nlm.nih.gov/pubmed/37377491 http://dx.doi.org/10.1016/j.crfs.2023.100525 |
_version_ | 1785062605637812224 |
---|---|
author | Karanth, Shraddha Patel, Jitendra Shirmohammadi, Adel Pradhan, Abani K. |
author_facet | Karanth, Shraddha Patel, Jitendra Shirmohammadi, Adel Pradhan, Abani K. |
author_sort | Karanth, Shraddha |
collection | PubMed |
description | Several studies have shown a correlation between outbreaks of Salmonella enterica and meteorological trends, especially related to temperature and precipitation. Additionally, current studies based on outbreaks are performed on data for the species Salmonella enterica, without considering its intra-species and genetic heterogeneity. In this study, we analyzed the effect of differential gene expression and a suite of meteorological factors on salmonellosis outbreak scale (typified by case numbers) using a combination of machine learning and count-based modeling methods. Elastic Net regularization model was used to identify significant genes from a Salmonella pan-genome, and a multi-variable Poisson regression developed to fit the individual and mixed effects data. The best-fit Elastic Net model (α = 0.50; λ = 2.18) identified 53 significant gene features. The final multi-variable Poisson regression model (χ(2) = 5748.22; pseudo R(2) = 0.669; probability > χ(2) = 0) identified 127 significant predictor terms (p < 0.10), comprising 45 gene-only predictors, average temperature, average precipitation, and average snowfall, and 79 gene-meteorological interaction terms. The significant genes ranged in functionality from cellular signaling and transport, virulence, metabolism, and stress response, and included gene variables not considered as significant by the baseline model. This study presents a holistic approach towards evaluating multiple data sources (such as genomic and environmental data) to predict outbreak scale, which could help in revising the estimates for human health risk. |
format | Online Article Text |
id | pubmed-10290999 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-102909992023-06-27 Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends Karanth, Shraddha Patel, Jitendra Shirmohammadi, Adel Pradhan, Abani K. Curr Res Food Sci Research Article Several studies have shown a correlation between outbreaks of Salmonella enterica and meteorological trends, especially related to temperature and precipitation. Additionally, current studies based on outbreaks are performed on data for the species Salmonella enterica, without considering its intra-species and genetic heterogeneity. In this study, we analyzed the effect of differential gene expression and a suite of meteorological factors on salmonellosis outbreak scale (typified by case numbers) using a combination of machine learning and count-based modeling methods. Elastic Net regularization model was used to identify significant genes from a Salmonella pan-genome, and a multi-variable Poisson regression developed to fit the individual and mixed effects data. The best-fit Elastic Net model (α = 0.50; λ = 2.18) identified 53 significant gene features. The final multi-variable Poisson regression model (χ(2) = 5748.22; pseudo R(2) = 0.669; probability > χ(2) = 0) identified 127 significant predictor terms (p < 0.10), comprising 45 gene-only predictors, average temperature, average precipitation, and average snowfall, and 79 gene-meteorological interaction terms. The significant genes ranged in functionality from cellular signaling and transport, virulence, metabolism, and stress response, and included gene variables not considered as significant by the baseline model. This study presents a holistic approach towards evaluating multiple data sources (such as genomic and environmental data) to predict outbreak scale, which could help in revising the estimates for human health risk. Elsevier 2023-05-28 /pmc/articles/PMC10290999/ /pubmed/37377491 http://dx.doi.org/10.1016/j.crfs.2023.100525 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Karanth, Shraddha Patel, Jitendra Shirmohammadi, Adel Pradhan, Abani K. Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends |
title | Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends |
title_full | Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends |
title_fullStr | Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends |
title_full_unstemmed | Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends |
title_short | Machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends |
title_sort | machine learning to predict foodborne salmonellosis outbreaks based on genome characteristics and meteorological trends |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10290999/ https://www.ncbi.nlm.nih.gov/pubmed/37377491 http://dx.doi.org/10.1016/j.crfs.2023.100525 |
work_keys_str_mv | AT karanthshraddha machinelearningtopredictfoodbornesalmonellosisoutbreaksbasedongenomecharacteristicsandmeteorologicaltrends AT pateljitendra machinelearningtopredictfoodbornesalmonellosisoutbreaksbasedongenomecharacteristicsandmeteorologicaltrends AT shirmohammadiadel machinelearningtopredictfoodbornesalmonellosisoutbreaksbasedongenomecharacteristicsandmeteorologicaltrends AT pradhanabanik machinelearningtopredictfoodbornesalmonellosisoutbreaksbasedongenomecharacteristicsandmeteorologicaltrends |