Cargando…

Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis

Machine learning (ML) algorithms may help better understand the complex interactions among factors that influence dietary choices and behaviors. The aim of this study was to explore whether ML algorithms are more accurate than traditional statistical models in predicting vegetable and fruit (VF) con...

Descripción completa

Detalles Bibliográficos
Autores principales:	Côté, Mélina, Osseni, Mazid Abiodoun, Brassard, Didier, Carbonneau, Élise, Robitaille, Julie, Vohl, Marie-Claude, Lemieux, Simone, Laviolette, François, Lamarche, Benoît
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2022
Materias:	Nutrition
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8891134/ https://www.ncbi.nlm.nih.gov/pubmed/35252288 http://dx.doi.org/10.3389/fnut.2022.740898

_version_	1784661803012194304
author	Côté, Mélina Osseni, Mazid Abiodoun Brassard, Didier Carbonneau, Élise Robitaille, Julie Vohl, Marie-Claude Lemieux, Simone Laviolette, François Lamarche, Benoît
author_facet	Côté, Mélina Osseni, Mazid Abiodoun Brassard, Didier Carbonneau, Élise Robitaille, Julie Vohl, Marie-Claude Lemieux, Simone Laviolette, François Lamarche, Benoît
author_sort	Côté, Mélina
collection	PubMed
description	Machine learning (ML) algorithms may help better understand the complex interactions among factors that influence dietary choices and behaviors. The aim of this study was to explore whether ML algorithms are more accurate than traditional statistical models in predicting vegetable and fruit (VF) consumption. A large array of features (2,452 features from 525 variables) encompassing individual and environmental information related to dietary habits and food choices in a sample of 1,147 French-speaking adult men and women was used for the purpose of this study. Adequate VF consumption, which was defined as 5 servings/d or more, was measured by averaging data from three web-based 24 h recalls and used as the outcome to predict. Nine classification ML algorithms were compared to two traditional statistical predictive models, logistic regression and penalized regression (Lasso). The performance of the predictive ML algorithms was tested after the implementation of adjustments, including normalizing the data, as well as in a series of sensitivity analyses such as using VF consumption obtained from a web-based food frequency questionnaire (wFFQ) and applying a feature selection algorithm in an attempt to reduce overfitting. Logistic regression and Lasso predicted adequate VF consumption with an accuracy of 0.64 (95% confidence interval [CI]: 0.58–0.70) and 0.64 (95%CI: 0.60–0.68) respectively. Among the ML algorithms tested, the most accurate algorithms to predict adequate VF consumption were the support vector machine (SVM) with either a radial basis kernel or a sigmoid kernel, both with an accuracy of 0.65 (95%CI: 0.59–0.71). The least accurate ML algorithm was the SVM with a linear kernel with an accuracy of 0.55 (95%CI: 0.49–0.61). Using dietary intake data from the wFFQ and applying a feature selection algorithm had little to no impact on the performance of the algorithms. In summary, ML algorithms and traditional statistical models predicted adequate VF consumption with similar accuracies among adults. These results suggest that additional research is needed to explore further the true potential of ML in predicting dietary behaviours that are determined by complex interactions among several individual, social and environmental factors.
format	Online Article Text
id	pubmed-8891134
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-88911342022-03-04 Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis Côté, Mélina Osseni, Mazid Abiodoun Brassard, Didier Carbonneau, Élise Robitaille, Julie Vohl, Marie-Claude Lemieux, Simone Laviolette, François Lamarche, Benoît Front Nutr Nutrition Machine learning (ML) algorithms may help better understand the complex interactions among factors that influence dietary choices and behaviors. The aim of this study was to explore whether ML algorithms are more accurate than traditional statistical models in predicting vegetable and fruit (VF) consumption. A large array of features (2,452 features from 525 variables) encompassing individual and environmental information related to dietary habits and food choices in a sample of 1,147 French-speaking adult men and women was used for the purpose of this study. Adequate VF consumption, which was defined as 5 servings/d or more, was measured by averaging data from three web-based 24 h recalls and used as the outcome to predict. Nine classification ML algorithms were compared to two traditional statistical predictive models, logistic regression and penalized regression (Lasso). The performance of the predictive ML algorithms was tested after the implementation of adjustments, including normalizing the data, as well as in a series of sensitivity analyses such as using VF consumption obtained from a web-based food frequency questionnaire (wFFQ) and applying a feature selection algorithm in an attempt to reduce overfitting. Logistic regression and Lasso predicted adequate VF consumption with an accuracy of 0.64 (95% confidence interval [CI]: 0.58–0.70) and 0.64 (95%CI: 0.60–0.68) respectively. Among the ML algorithms tested, the most accurate algorithms to predict adequate VF consumption were the support vector machine (SVM) with either a radial basis kernel or a sigmoid kernel, both with an accuracy of 0.65 (95%CI: 0.59–0.71). The least accurate ML algorithm was the SVM with a linear kernel with an accuracy of 0.55 (95%CI: 0.49–0.61). Using dietary intake data from the wFFQ and applying a feature selection algorithm had little to no impact on the performance of the algorithms. In summary, ML algorithms and traditional statistical models predicted adequate VF consumption with similar accuracies among adults. These results suggest that additional research is needed to explore further the true potential of ML in predicting dietary behaviours that are determined by complex interactions among several individual, social and environmental factors. Frontiers Media S.A. 2022-02-17 /pmc/articles/PMC8891134/ /pubmed/35252288 http://dx.doi.org/10.3389/fnut.2022.740898 Text en Copyright © 2022 Côté, Osseni, Brassard, Carbonneau, Robitaille, Vohl, Lemieux, Laviolette and Lamarche. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Nutrition Côté, Mélina Osseni, Mazid Abiodoun Brassard, Didier Carbonneau, Élise Robitaille, Julie Vohl, Marie-Claude Lemieux, Simone Laviolette, François Lamarche, Benoît Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis
title	Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis
title_full	Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis
title_fullStr	Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis
title_full_unstemmed	Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis
title_short	Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis
title_sort	are machine learning algorithms more accurate in predicting vegetable and fruit consumption than traditional statistical models? an exploratory analysis
topic	Nutrition
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8891134/ https://www.ncbi.nlm.nih.gov/pubmed/35252288 http://dx.doi.org/10.3389/fnut.2022.740898
work_keys_str_mv	AT cotemelina aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis AT ossenimazidabiodoun aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis AT brassarddidier aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis AT carbonneauelise aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis AT robitaillejulie aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis AT vohlmarieclaude aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis AT lemieuxsimone aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis AT laviolettefrancois aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis AT lamarchebenoit aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis

Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis

Ejemplares similares