Cargando…
Optimising an FFQ Using a Machine Learning Pipeline to teach an Efficient Nutrient Intake Predictive Model
Food frequency questionnaires (FFQs) are the most commonly selected tools in nutrition monitoring, as they are inexpensive, easily implemented and provide useful information regarding dietary intake. They are usually carefully drafted by experts from nutritional and/or medical fields and can be vali...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7764455/ https://www.ncbi.nlm.nih.gov/pubmed/33321959 http://dx.doi.org/10.3390/nu12123789 |
_version_ | 1783628260964302848 |
---|---|
author | Reščič, Nina Eftimov, Tome Koroušić Seljak, Barbara Luštrek, Mitja |
author_facet | Reščič, Nina Eftimov, Tome Koroušić Seljak, Barbara Luštrek, Mitja |
author_sort | Reščič, Nina |
collection | PubMed |
description | Food frequency questionnaires (FFQs) are the most commonly selected tools in nutrition monitoring, as they are inexpensive, easily implemented and provide useful information regarding dietary intake. They are usually carefully drafted by experts from nutritional and/or medical fields and can be validated by using other dietary monitoring techniques. FFQs can get very extensive, which could indicate that some of the questions are less significant than others and could be omitted without losing too much information. In this paper, machine learning is used to explore how reducing the number of questions affects the predicted nutrient values and diet quality score. The paper addresses the problem of removing redundant questions and finding the best subset of questions in the Extended Short Form Food Frequency Questionnaire (ESFFFQ), developed as part of the H2020 project WellCo. Eight common machine-learning algorithms were compared on different subsets of questions by using the PROMETHEE method, which compares methods and subsets via multiple performance measures. According to the results, for some of the targets, specifically sugar intake, fiber intake and protein intake, a smaller subset of questions are sufficient to predict diet quality scores. Additionally, for smaller subsets of questions, machine-learning algorithms generally perform better than statistical methods for predicting intake and diet quality scores. The proposed method could therefore be useful for finding the most informative subsets of questions in other FFQs as well. This could help experts develop FFQs that provide the necessary information and are not overbearing for those answering. |
format | Online Article Text |
id | pubmed-7764455 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-77644552020-12-27 Optimising an FFQ Using a Machine Learning Pipeline to teach an Efficient Nutrient Intake Predictive Model Reščič, Nina Eftimov, Tome Koroušić Seljak, Barbara Luštrek, Mitja Nutrients Article Food frequency questionnaires (FFQs) are the most commonly selected tools in nutrition monitoring, as they are inexpensive, easily implemented and provide useful information regarding dietary intake. They are usually carefully drafted by experts from nutritional and/or medical fields and can be validated by using other dietary monitoring techniques. FFQs can get very extensive, which could indicate that some of the questions are less significant than others and could be omitted without losing too much information. In this paper, machine learning is used to explore how reducing the number of questions affects the predicted nutrient values and diet quality score. The paper addresses the problem of removing redundant questions and finding the best subset of questions in the Extended Short Form Food Frequency Questionnaire (ESFFFQ), developed as part of the H2020 project WellCo. Eight common machine-learning algorithms were compared on different subsets of questions by using the PROMETHEE method, which compares methods and subsets via multiple performance measures. According to the results, for some of the targets, specifically sugar intake, fiber intake and protein intake, a smaller subset of questions are sufficient to predict diet quality scores. Additionally, for smaller subsets of questions, machine-learning algorithms generally perform better than statistical methods for predicting intake and diet quality scores. The proposed method could therefore be useful for finding the most informative subsets of questions in other FFQs as well. This could help experts develop FFQs that provide the necessary information and are not overbearing for those answering. MDPI 2020-12-10 /pmc/articles/PMC7764455/ /pubmed/33321959 http://dx.doi.org/10.3390/nu12123789 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Reščič, Nina Eftimov, Tome Koroušić Seljak, Barbara Luštrek, Mitja Optimising an FFQ Using a Machine Learning Pipeline to teach an Efficient Nutrient Intake Predictive Model |
title | Optimising an FFQ Using a Machine Learning Pipeline to teach an Efficient Nutrient Intake Predictive Model |
title_full | Optimising an FFQ Using a Machine Learning Pipeline to teach an Efficient Nutrient Intake Predictive Model |
title_fullStr | Optimising an FFQ Using a Machine Learning Pipeline to teach an Efficient Nutrient Intake Predictive Model |
title_full_unstemmed | Optimising an FFQ Using a Machine Learning Pipeline to teach an Efficient Nutrient Intake Predictive Model |
title_short | Optimising an FFQ Using a Machine Learning Pipeline to teach an Efficient Nutrient Intake Predictive Model |
title_sort | optimising an ffq using a machine learning pipeline to teach an efficient nutrient intake predictive model |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7764455/ https://www.ncbi.nlm.nih.gov/pubmed/33321959 http://dx.doi.org/10.3390/nu12123789 |
work_keys_str_mv | AT rescicnina optimisinganffqusingamachinelearningpipelinetoteachanefficientnutrientintakepredictivemodel AT eftimovtome optimisinganffqusingamachinelearningpipelinetoteachanefficientnutrientintakepredictivemodel AT korousicseljakbarbara optimisinganffqusingamachinelearningpipelinetoteachanefficientnutrientintakepredictivemodel AT lustrekmitja optimisinganffqusingamachinelearningpipelinetoteachanefficientnutrientintakepredictivemodel |