Cargando…

A Machine Learning Model for Food Source Attribution of Listeria monocytogenes

Despite its low morbidity, listeriosis has a high mortality rate due to the severity of its clinical manifestations. The source of human listeriosis is often unclear. In this study, we investigate the ability of machine learning to predict the food source from which clinical Listeria monocytogenes i...

Descripción completa

Detalles Bibliográficos
Autores principales: Tanui, Collins K., Benefo, Edmund O., Karanth, Shraddha, Pradhan, Abani K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9230378/
https://www.ncbi.nlm.nih.gov/pubmed/35745545
http://dx.doi.org/10.3390/pathogens11060691
_version_ 1784735043106635776
author Tanui, Collins K.
Benefo, Edmund O.
Karanth, Shraddha
Pradhan, Abani K.
author_facet Tanui, Collins K.
Benefo, Edmund O.
Karanth, Shraddha
Pradhan, Abani K.
author_sort Tanui, Collins K.
collection PubMed
description Despite its low morbidity, listeriosis has a high mortality rate due to the severity of its clinical manifestations. The source of human listeriosis is often unclear. In this study, we investigate the ability of machine learning to predict the food source from which clinical Listeria monocytogenes isolates originated. Four machine learning classification algorithms were trained on core genome multilocus sequence typing data of 1212 L. monocytogenes isolates from various food sources. The average accuracies of random forest, support vector machine radial kernel, stochastic gradient boosting, and logit boost were found to be 0.72, 0.61, 0.7, and 0.73, respectively. Logit boost showed the best performance and was used in model testing on 154 L. monocytogenes clinical isolates. The model attributed 17.5 % of human clinical cases to dairy, 32.5% to fruits, 14.3% to leafy greens, 9.7% to meat, 4.6% to poultry, and 18.8% to vegetables. The final model also provided us with genetic features that were predictive of specific sources. Thus, this combination of genomic data and machine learning-based models can greatly enhance our ability to track L. monocytogenes from different food sources.
format Online
Article
Text
id pubmed-9230378
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-92303782022-06-25 A Machine Learning Model for Food Source Attribution of Listeria monocytogenes Tanui, Collins K. Benefo, Edmund O. Karanth, Shraddha Pradhan, Abani K. Pathogens Article Despite its low morbidity, listeriosis has a high mortality rate due to the severity of its clinical manifestations. The source of human listeriosis is often unclear. In this study, we investigate the ability of machine learning to predict the food source from which clinical Listeria monocytogenes isolates originated. Four machine learning classification algorithms were trained on core genome multilocus sequence typing data of 1212 L. monocytogenes isolates from various food sources. The average accuracies of random forest, support vector machine radial kernel, stochastic gradient boosting, and logit boost were found to be 0.72, 0.61, 0.7, and 0.73, respectively. Logit boost showed the best performance and was used in model testing on 154 L. monocytogenes clinical isolates. The model attributed 17.5 % of human clinical cases to dairy, 32.5% to fruits, 14.3% to leafy greens, 9.7% to meat, 4.6% to poultry, and 18.8% to vegetables. The final model also provided us with genetic features that were predictive of specific sources. Thus, this combination of genomic data and machine learning-based models can greatly enhance our ability to track L. monocytogenes from different food sources. MDPI 2022-06-16 /pmc/articles/PMC9230378/ /pubmed/35745545 http://dx.doi.org/10.3390/pathogens11060691 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tanui, Collins K.
Benefo, Edmund O.
Karanth, Shraddha
Pradhan, Abani K.
A Machine Learning Model for Food Source Attribution of Listeria monocytogenes
title A Machine Learning Model for Food Source Attribution of Listeria monocytogenes
title_full A Machine Learning Model for Food Source Attribution of Listeria monocytogenes
title_fullStr A Machine Learning Model for Food Source Attribution of Listeria monocytogenes
title_full_unstemmed A Machine Learning Model for Food Source Attribution of Listeria monocytogenes
title_short A Machine Learning Model for Food Source Attribution of Listeria monocytogenes
title_sort machine learning model for food source attribution of listeria monocytogenes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9230378/
https://www.ncbi.nlm.nih.gov/pubmed/35745545
http://dx.doi.org/10.3390/pathogens11060691
work_keys_str_mv AT tanuicollinsk amachinelearningmodelforfoodsourceattributionoflisteriamonocytogenes
AT benefoedmundo amachinelearningmodelforfoodsourceattributionoflisteriamonocytogenes
AT karanthshraddha amachinelearningmodelforfoodsourceattributionoflisteriamonocytogenes
AT pradhanabanik amachinelearningmodelforfoodsourceattributionoflisteriamonocytogenes
AT tanuicollinsk machinelearningmodelforfoodsourceattributionoflisteriamonocytogenes
AT benefoedmundo machinelearningmodelforfoodsourceattributionoflisteriamonocytogenes
AT karanthshraddha machinelearningmodelforfoodsourceattributionoflisteriamonocytogenes
AT pradhanabanik machinelearningmodelforfoodsourceattributionoflisteriamonocytogenes