Cargando…
A Machine Learning Model for Food Source Attribution of Listeria monocytogenes
Despite its low morbidity, listeriosis has a high mortality rate due to the severity of its clinical manifestations. The source of human listeriosis is often unclear. In this study, we investigate the ability of machine learning to predict the food source from which clinical Listeria monocytogenes i...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9230378/ https://www.ncbi.nlm.nih.gov/pubmed/35745545 http://dx.doi.org/10.3390/pathogens11060691 |
_version_ | 1784735043106635776 |
---|---|
author | Tanui, Collins K. Benefo, Edmund O. Karanth, Shraddha Pradhan, Abani K. |
author_facet | Tanui, Collins K. Benefo, Edmund O. Karanth, Shraddha Pradhan, Abani K. |
author_sort | Tanui, Collins K. |
collection | PubMed |
description | Despite its low morbidity, listeriosis has a high mortality rate due to the severity of its clinical manifestations. The source of human listeriosis is often unclear. In this study, we investigate the ability of machine learning to predict the food source from which clinical Listeria monocytogenes isolates originated. Four machine learning classification algorithms were trained on core genome multilocus sequence typing data of 1212 L. monocytogenes isolates from various food sources. The average accuracies of random forest, support vector machine radial kernel, stochastic gradient boosting, and logit boost were found to be 0.72, 0.61, 0.7, and 0.73, respectively. Logit boost showed the best performance and was used in model testing on 154 L. monocytogenes clinical isolates. The model attributed 17.5 % of human clinical cases to dairy, 32.5% to fruits, 14.3% to leafy greens, 9.7% to meat, 4.6% to poultry, and 18.8% to vegetables. The final model also provided us with genetic features that were predictive of specific sources. Thus, this combination of genomic data and machine learning-based models can greatly enhance our ability to track L. monocytogenes from different food sources. |
format | Online Article Text |
id | pubmed-9230378 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-92303782022-06-25 A Machine Learning Model for Food Source Attribution of Listeria monocytogenes Tanui, Collins K. Benefo, Edmund O. Karanth, Shraddha Pradhan, Abani K. Pathogens Article Despite its low morbidity, listeriosis has a high mortality rate due to the severity of its clinical manifestations. The source of human listeriosis is often unclear. In this study, we investigate the ability of machine learning to predict the food source from which clinical Listeria monocytogenes isolates originated. Four machine learning classification algorithms were trained on core genome multilocus sequence typing data of 1212 L. monocytogenes isolates from various food sources. The average accuracies of random forest, support vector machine radial kernel, stochastic gradient boosting, and logit boost were found to be 0.72, 0.61, 0.7, and 0.73, respectively. Logit boost showed the best performance and was used in model testing on 154 L. monocytogenes clinical isolates. The model attributed 17.5 % of human clinical cases to dairy, 32.5% to fruits, 14.3% to leafy greens, 9.7% to meat, 4.6% to poultry, and 18.8% to vegetables. The final model also provided us with genetic features that were predictive of specific sources. Thus, this combination of genomic data and machine learning-based models can greatly enhance our ability to track L. monocytogenes from different food sources. MDPI 2022-06-16 /pmc/articles/PMC9230378/ /pubmed/35745545 http://dx.doi.org/10.3390/pathogens11060691 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Tanui, Collins K. Benefo, Edmund O. Karanth, Shraddha Pradhan, Abani K. A Machine Learning Model for Food Source Attribution of Listeria monocytogenes |
title | A Machine Learning Model for Food Source Attribution of Listeria monocytogenes |
title_full | A Machine Learning Model for Food Source Attribution of Listeria monocytogenes |
title_fullStr | A Machine Learning Model for Food Source Attribution of Listeria monocytogenes |
title_full_unstemmed | A Machine Learning Model for Food Source Attribution of Listeria monocytogenes |
title_short | A Machine Learning Model for Food Source Attribution of Listeria monocytogenes |
title_sort | machine learning model for food source attribution of listeria monocytogenes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9230378/ https://www.ncbi.nlm.nih.gov/pubmed/35745545 http://dx.doi.org/10.3390/pathogens11060691 |
work_keys_str_mv | AT tanuicollinsk amachinelearningmodelforfoodsourceattributionoflisteriamonocytogenes AT benefoedmundo amachinelearningmodelforfoodsourceattributionoflisteriamonocytogenes AT karanthshraddha amachinelearningmodelforfoodsourceattributionoflisteriamonocytogenes AT pradhanabanik amachinelearningmodelforfoodsourceattributionoflisteriamonocytogenes AT tanuicollinsk machinelearningmodelforfoodsourceattributionoflisteriamonocytogenes AT benefoedmundo machinelearningmodelforfoodsourceattributionoflisteriamonocytogenes AT karanthshraddha machinelearningmodelforfoodsourceattributionoflisteriamonocytogenes AT pradhanabanik machinelearningmodelforfoodsourceattributionoflisteriamonocytogenes |