Cargando…
Prediction of Pseudomonas spp. Population in Food Products and Culture Media Using Machine Learning-Based Regression Methods
Machine learning approaches are alternative modelling techniques to traditional modelling equations used in predictive food microbiology and utilise algorithms to analyse large datasets that contain information about microbial growth or survival in various food matrices. These approaches leverage th...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10381478/ https://www.ncbi.nlm.nih.gov/pubmed/37511805 http://dx.doi.org/10.3390/life13071430 |
_version_ | 1785080454143016960 |
---|---|
author | Tarlak, Fatih Yücel, Özgün |
author_facet | Tarlak, Fatih Yücel, Özgün |
author_sort | Tarlak, Fatih |
collection | PubMed |
description | Machine learning approaches are alternative modelling techniques to traditional modelling equations used in predictive food microbiology and utilise algorithms to analyse large datasets that contain information about microbial growth or survival in various food matrices. These approaches leverage the power of algorithms to extract insights from the data and make predictions regarding the behaviour of microorganisms in different food environments. The objective of this study was to apply various machine learning-based regression methods, including support vector regression (SVR), Gaussian process regression (GPR), decision tree regression (DTR), and random forest regression (RFR), to estimate bacterial populations. In order to achieve this, a total of 5618 data points for Pseudomonas spp. present in food products (beef, pork, and poultry) and culture media were gathered from the ComBase database. The machine learning algorithms were applied to predict the growth or survival behaviour of Pseudomonas spp. in food products and culture media by considering predictor variables such as temperature, salt concentration, water activity, and acidity. The suitability of the algorithms was assessed using statistical measures such as coefficient of determination (R(2)), root mean square error (RMSE), bias factor (Bf), and accuracy (A(f)). Each of the regression algorithms showed appropriate estimation capabilities with R(2) ranging from 0.886 to 0.913, RMSE from 0.724 to 0.899, B(f) from 1.012 to 1.020, and A(f) from 1.086 to 1.101 for each food product and culture medium. Since the predictive capability of RFR was the best among the algorithms, externally collected data from the literature were used for RFR. The external validation process showed statistical indices of B(f) ranging from 0.951 to 1.040 and A(f) ranging from 1.091 to 1.130, indicating that RFR can be used for predicting the survival and growth of microorganisms in food products. Therefore, machine learning approaches can be considered as an alternative to conventional modelling methods in predictive microbiology. However, it is important to highlight that the prediction power of the machine learning regression method directly depends on the dataset size, and it requires a large dataset to be employed for modelling. Therefore, the modelling work of this study can only be used for the prediction of Pseudomonas spp. in specific food products (beef, pork, and poultry) and culture medium with certain conditions where a large dataset is available. |
format | Online Article Text |
id | pubmed-10381478 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-103814782023-07-29 Prediction of Pseudomonas spp. Population in Food Products and Culture Media Using Machine Learning-Based Regression Methods Tarlak, Fatih Yücel, Özgün Life (Basel) Article Machine learning approaches are alternative modelling techniques to traditional modelling equations used in predictive food microbiology and utilise algorithms to analyse large datasets that contain information about microbial growth or survival in various food matrices. These approaches leverage the power of algorithms to extract insights from the data and make predictions regarding the behaviour of microorganisms in different food environments. The objective of this study was to apply various machine learning-based regression methods, including support vector regression (SVR), Gaussian process regression (GPR), decision tree regression (DTR), and random forest regression (RFR), to estimate bacterial populations. In order to achieve this, a total of 5618 data points for Pseudomonas spp. present in food products (beef, pork, and poultry) and culture media were gathered from the ComBase database. The machine learning algorithms were applied to predict the growth or survival behaviour of Pseudomonas spp. in food products and culture media by considering predictor variables such as temperature, salt concentration, water activity, and acidity. The suitability of the algorithms was assessed using statistical measures such as coefficient of determination (R(2)), root mean square error (RMSE), bias factor (Bf), and accuracy (A(f)). Each of the regression algorithms showed appropriate estimation capabilities with R(2) ranging from 0.886 to 0.913, RMSE from 0.724 to 0.899, B(f) from 1.012 to 1.020, and A(f) from 1.086 to 1.101 for each food product and culture medium. Since the predictive capability of RFR was the best among the algorithms, externally collected data from the literature were used for RFR. The external validation process showed statistical indices of B(f) ranging from 0.951 to 1.040 and A(f) ranging from 1.091 to 1.130, indicating that RFR can be used for predicting the survival and growth of microorganisms in food products. Therefore, machine learning approaches can be considered as an alternative to conventional modelling methods in predictive microbiology. However, it is important to highlight that the prediction power of the machine learning regression method directly depends on the dataset size, and it requires a large dataset to be employed for modelling. Therefore, the modelling work of this study can only be used for the prediction of Pseudomonas spp. in specific food products (beef, pork, and poultry) and culture medium with certain conditions where a large dataset is available. MDPI 2023-06-22 /pmc/articles/PMC10381478/ /pubmed/37511805 http://dx.doi.org/10.3390/life13071430 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Tarlak, Fatih Yücel, Özgün Prediction of Pseudomonas spp. Population in Food Products and Culture Media Using Machine Learning-Based Regression Methods |
title | Prediction of Pseudomonas spp. Population in Food Products and Culture Media Using Machine Learning-Based Regression Methods |
title_full | Prediction of Pseudomonas spp. Population in Food Products and Culture Media Using Machine Learning-Based Regression Methods |
title_fullStr | Prediction of Pseudomonas spp. Population in Food Products and Culture Media Using Machine Learning-Based Regression Methods |
title_full_unstemmed | Prediction of Pseudomonas spp. Population in Food Products and Culture Media Using Machine Learning-Based Regression Methods |
title_short | Prediction of Pseudomonas spp. Population in Food Products and Culture Media Using Machine Learning-Based Regression Methods |
title_sort | prediction of pseudomonas spp. population in food products and culture media using machine learning-based regression methods |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10381478/ https://www.ncbi.nlm.nih.gov/pubmed/37511805 http://dx.doi.org/10.3390/life13071430 |
work_keys_str_mv | AT tarlakfatih predictionofpseudomonasspppopulationinfoodproductsandculturemediausingmachinelearningbasedregressionmethods AT yucelozgun predictionofpseudomonasspppopulationinfoodproductsandculturemediausingmachinelearningbasedregressionmethods |