Cargando…
Application of a new dietary pattern analysis method in nutritional epidemiology
BACKGROUND: Diet plays an important role in chronic disease, and the use of dietary pattern analysis has grown rapidly as a way of deconstructing the complexity of nutritional intake and its relation to health. Pattern analysis methods, such as principal component analysis (PCA), have been used to i...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6206725/ https://www.ncbi.nlm.nih.gov/pubmed/30373530 http://dx.doi.org/10.1186/s12874-018-0585-8 |
_version_ | 1783366408438022144 |
---|---|
author | Zhang, Fengqing Tapera, Tinashe M. Gou, Jiangtao |
author_facet | Zhang, Fengqing Tapera, Tinashe M. Gou, Jiangtao |
author_sort | Zhang, Fengqing |
collection | PubMed |
description | BACKGROUND: Diet plays an important role in chronic disease, and the use of dietary pattern analysis has grown rapidly as a way of deconstructing the complexity of nutritional intake and its relation to health. Pattern analysis methods, such as principal component analysis (PCA), have been used to investigate various dimensions of diet. Existing analytic methods, however, do not fully utilize the predictive potential of dietary assessment data. In particular, these methods are often suboptimal at predicting clinically important variables. METHODS: We propose a new dietary pattern analysis method using the advanced LASSO (Least Absolute Shrinkage and Selection Operator) model to improve the prediction of disease-related risk factors. Despite the potential advantages of LASSO, this is the first time that the model has been adapted for dietary pattern analysis. Hence, the systematic evaluation of the LASSO model as applied to dietary data and health outcomes is highly innovative and novel. Using Food Frequency Questionnaire data from NHANES 2005–2006, we apply PCA and LASSO to identify dietary patterns related to cardiovascular disease risk factors in healthy US adults (n = 2609) after controlling for confounding variables (e.g., age and BMI). Both analyses account for the sampling weights. Model performance in terms of prediction accuracy is evaluated using an independent test set. RESULTS: PCA yields 10 principal components (PCs) that together account for 65% of the variation in the data set and represent distinct dietary patterns. These PCs are then used as predictors in a regression model to predict cardiovascular disease risk factors. We find that LASSO better predicts levels of triglycerides, LDL cholesterol, HDL cholesterol, and total cholesterol (adjusted R(2) = 0.861, 0.899, 0.890, and 0.935 respectively) than does the traditional, linear-regression-based, dietary pattern analysis method (adjusted R(2) = 0.163, 0.005, 0.235, and 0.024 respectively) when the latter is applied to components derived from PCA. CONCLUSIONS: The proposed method is shown to be an appropriate and promising statistical means of deriving dietary patterns predictive of cardiovascular disease risk. Future studies, involving different diseases and risk factors, will be necessary before LASSO’s broader usefulness in nutritional epidemiology can be established. |
format | Online Article Text |
id | pubmed-6206725 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-62067252018-10-31 Application of a new dietary pattern analysis method in nutritional epidemiology Zhang, Fengqing Tapera, Tinashe M. Gou, Jiangtao BMC Med Res Methodol Research Article BACKGROUND: Diet plays an important role in chronic disease, and the use of dietary pattern analysis has grown rapidly as a way of deconstructing the complexity of nutritional intake and its relation to health. Pattern analysis methods, such as principal component analysis (PCA), have been used to investigate various dimensions of diet. Existing analytic methods, however, do not fully utilize the predictive potential of dietary assessment data. In particular, these methods are often suboptimal at predicting clinically important variables. METHODS: We propose a new dietary pattern analysis method using the advanced LASSO (Least Absolute Shrinkage and Selection Operator) model to improve the prediction of disease-related risk factors. Despite the potential advantages of LASSO, this is the first time that the model has been adapted for dietary pattern analysis. Hence, the systematic evaluation of the LASSO model as applied to dietary data and health outcomes is highly innovative and novel. Using Food Frequency Questionnaire data from NHANES 2005–2006, we apply PCA and LASSO to identify dietary patterns related to cardiovascular disease risk factors in healthy US adults (n = 2609) after controlling for confounding variables (e.g., age and BMI). Both analyses account for the sampling weights. Model performance in terms of prediction accuracy is evaluated using an independent test set. RESULTS: PCA yields 10 principal components (PCs) that together account for 65% of the variation in the data set and represent distinct dietary patterns. These PCs are then used as predictors in a regression model to predict cardiovascular disease risk factors. We find that LASSO better predicts levels of triglycerides, LDL cholesterol, HDL cholesterol, and total cholesterol (adjusted R(2) = 0.861, 0.899, 0.890, and 0.935 respectively) than does the traditional, linear-regression-based, dietary pattern analysis method (adjusted R(2) = 0.163, 0.005, 0.235, and 0.024 respectively) when the latter is applied to components derived from PCA. CONCLUSIONS: The proposed method is shown to be an appropriate and promising statistical means of deriving dietary patterns predictive of cardiovascular disease risk. Future studies, involving different diseases and risk factors, will be necessary before LASSO’s broader usefulness in nutritional epidemiology can be established. BioMed Central 2018-10-29 /pmc/articles/PMC6206725/ /pubmed/30373530 http://dx.doi.org/10.1186/s12874-018-0585-8 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Zhang, Fengqing Tapera, Tinashe M. Gou, Jiangtao Application of a new dietary pattern analysis method in nutritional epidemiology |
title | Application of a new dietary pattern analysis method in nutritional epidemiology |
title_full | Application of a new dietary pattern analysis method in nutritional epidemiology |
title_fullStr | Application of a new dietary pattern analysis method in nutritional epidemiology |
title_full_unstemmed | Application of a new dietary pattern analysis method in nutritional epidemiology |
title_short | Application of a new dietary pattern analysis method in nutritional epidemiology |
title_sort | application of a new dietary pattern analysis method in nutritional epidemiology |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6206725/ https://www.ncbi.nlm.nih.gov/pubmed/30373530 http://dx.doi.org/10.1186/s12874-018-0585-8 |
work_keys_str_mv | AT zhangfengqing applicationofanewdietarypatternanalysismethodinnutritionalepidemiology AT taperatinashem applicationofanewdietarypatternanalysismethodinnutritionalepidemiology AT goujiangtao applicationofanewdietarypatternanalysismethodinnutritionalepidemiology |