Cargando…

Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis

Machine learning (ML) algorithms may help better understand the complex interactions among factors that influence dietary choices and behaviors. The aim of this study was to explore whether ML algorithms are more accurate than traditional statistical models in predicting vegetable and fruit (VF) con...

Descripción completa

Detalles Bibliográficos
Autores principales: Côté, Mélina, Osseni, Mazid Abiodoun, Brassard, Didier, Carbonneau, Élise, Robitaille, Julie, Vohl, Marie-Claude, Lemieux, Simone, Laviolette, François, Lamarche, Benoît
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8891134/
https://www.ncbi.nlm.nih.gov/pubmed/35252288
http://dx.doi.org/10.3389/fnut.2022.740898
_version_ 1784661803012194304
author Côté, Mélina
Osseni, Mazid Abiodoun
Brassard, Didier
Carbonneau, Élise
Robitaille, Julie
Vohl, Marie-Claude
Lemieux, Simone
Laviolette, François
Lamarche, Benoît
author_facet Côté, Mélina
Osseni, Mazid Abiodoun
Brassard, Didier
Carbonneau, Élise
Robitaille, Julie
Vohl, Marie-Claude
Lemieux, Simone
Laviolette, François
Lamarche, Benoît
author_sort Côté, Mélina
collection PubMed
description Machine learning (ML) algorithms may help better understand the complex interactions among factors that influence dietary choices and behaviors. The aim of this study was to explore whether ML algorithms are more accurate than traditional statistical models in predicting vegetable and fruit (VF) consumption. A large array of features (2,452 features from 525 variables) encompassing individual and environmental information related to dietary habits and food choices in a sample of 1,147 French-speaking adult men and women was used for the purpose of this study. Adequate VF consumption, which was defined as 5 servings/d or more, was measured by averaging data from three web-based 24 h recalls and used as the outcome to predict. Nine classification ML algorithms were compared to two traditional statistical predictive models, logistic regression and penalized regression (Lasso). The performance of the predictive ML algorithms was tested after the implementation of adjustments, including normalizing the data, as well as in a series of sensitivity analyses such as using VF consumption obtained from a web-based food frequency questionnaire (wFFQ) and applying a feature selection algorithm in an attempt to reduce overfitting. Logistic regression and Lasso predicted adequate VF consumption with an accuracy of 0.64 (95% confidence interval [CI]: 0.58–0.70) and 0.64 (95%CI: 0.60–0.68) respectively. Among the ML algorithms tested, the most accurate algorithms to predict adequate VF consumption were the support vector machine (SVM) with either a radial basis kernel or a sigmoid kernel, both with an accuracy of 0.65 (95%CI: 0.59–0.71). The least accurate ML algorithm was the SVM with a linear kernel with an accuracy of 0.55 (95%CI: 0.49–0.61). Using dietary intake data from the wFFQ and applying a feature selection algorithm had little to no impact on the performance of the algorithms. In summary, ML algorithms and traditional statistical models predicted adequate VF consumption with similar accuracies among adults. These results suggest that additional research is needed to explore further the true potential of ML in predicting dietary behaviours that are determined by complex interactions among several individual, social and environmental factors.
format Online
Article
Text
id pubmed-8891134
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-88911342022-03-04 Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis Côté, Mélina Osseni, Mazid Abiodoun Brassard, Didier Carbonneau, Élise Robitaille, Julie Vohl, Marie-Claude Lemieux, Simone Laviolette, François Lamarche, Benoît Front Nutr Nutrition Machine learning (ML) algorithms may help better understand the complex interactions among factors that influence dietary choices and behaviors. The aim of this study was to explore whether ML algorithms are more accurate than traditional statistical models in predicting vegetable and fruit (VF) consumption. A large array of features (2,452 features from 525 variables) encompassing individual and environmental information related to dietary habits and food choices in a sample of 1,147 French-speaking adult men and women was used for the purpose of this study. Adequate VF consumption, which was defined as 5 servings/d or more, was measured by averaging data from three web-based 24 h recalls and used as the outcome to predict. Nine classification ML algorithms were compared to two traditional statistical predictive models, logistic regression and penalized regression (Lasso). The performance of the predictive ML algorithms was tested after the implementation of adjustments, including normalizing the data, as well as in a series of sensitivity analyses such as using VF consumption obtained from a web-based food frequency questionnaire (wFFQ) and applying a feature selection algorithm in an attempt to reduce overfitting. Logistic regression and Lasso predicted adequate VF consumption with an accuracy of 0.64 (95% confidence interval [CI]: 0.58–0.70) and 0.64 (95%CI: 0.60–0.68) respectively. Among the ML algorithms tested, the most accurate algorithms to predict adequate VF consumption were the support vector machine (SVM) with either a radial basis kernel or a sigmoid kernel, both with an accuracy of 0.65 (95%CI: 0.59–0.71). The least accurate ML algorithm was the SVM with a linear kernel with an accuracy of 0.55 (95%CI: 0.49–0.61). Using dietary intake data from the wFFQ and applying a feature selection algorithm had little to no impact on the performance of the algorithms. In summary, ML algorithms and traditional statistical models predicted adequate VF consumption with similar accuracies among adults. These results suggest that additional research is needed to explore further the true potential of ML in predicting dietary behaviours that are determined by complex interactions among several individual, social and environmental factors. Frontiers Media S.A. 2022-02-17 /pmc/articles/PMC8891134/ /pubmed/35252288 http://dx.doi.org/10.3389/fnut.2022.740898 Text en Copyright © 2022 Côté, Osseni, Brassard, Carbonneau, Robitaille, Vohl, Lemieux, Laviolette and Lamarche. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Nutrition
Côté, Mélina
Osseni, Mazid Abiodoun
Brassard, Didier
Carbonneau, Élise
Robitaille, Julie
Vohl, Marie-Claude
Lemieux, Simone
Laviolette, François
Lamarche, Benoît
Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis
title Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis
title_full Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis
title_fullStr Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis
title_full_unstemmed Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis
title_short Are Machine Learning Algorithms More Accurate in Predicting Vegetable and Fruit Consumption Than Traditional Statistical Models? An Exploratory Analysis
title_sort are machine learning algorithms more accurate in predicting vegetable and fruit consumption than traditional statistical models? an exploratory analysis
topic Nutrition
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8891134/
https://www.ncbi.nlm.nih.gov/pubmed/35252288
http://dx.doi.org/10.3389/fnut.2022.740898
work_keys_str_mv AT cotemelina aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis
AT ossenimazidabiodoun aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis
AT brassarddidier aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis
AT carbonneauelise aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis
AT robitaillejulie aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis
AT vohlmarieclaude aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis
AT lemieuxsimone aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis
AT laviolettefrancois aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis
AT lamarchebenoit aremachinelearningalgorithmsmoreaccurateinpredictingvegetableandfruitconsumptionthantraditionalstatisticalmodelsanexploratoryanalysis