Cargando…
The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns
In the last decades, different multivariate techniques have been applied to multidimensional dietary datasets to identify meaningful patterns reflecting the dietary habits of populations. Among them, principal component analysis (PCA) and cluster analysis represent the two most used techniques, eith...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9824338/ https://www.ncbi.nlm.nih.gov/pubmed/36615850 http://dx.doi.org/10.3390/nu15010195 |
_version_ | 1784866385237639168 |
---|---|
author | Maugeri, Andrea Barchitta, Martina Favara, Giuliana La Mastra, Claudia La Rosa, Maria Clara Magnano San Lio, Roberta Agodi, Antonella |
author_facet | Maugeri, Andrea Barchitta, Martina Favara, Giuliana La Mastra, Claudia La Rosa, Maria Clara Magnano San Lio, Roberta Agodi, Antonella |
author_sort | Maugeri, Andrea |
collection | PubMed |
description | In the last decades, different multivariate techniques have been applied to multidimensional dietary datasets to identify meaningful patterns reflecting the dietary habits of populations. Among them, principal component analysis (PCA) and cluster analysis represent the two most used techniques, either applied separately or in parallel. Here, we propose a workflow to combine PCA, hierarchical clustering, and a K-means algorithm in a novel approach for dietary pattern derivation. Since the workflow presents certain subjective decisions that might affect the final clustering solution, we also provide some alternatives in relation to different dietary data used. For example, we used the dietary data of 855 women from Catania, Italy. Our approach—defined as clustering on principal components—could be useful to leverage the strengths of each method and to obtain a better cluster solution. In fact, it seemed to disentangle dietary data better than simple clustering algorithms. However, before choosing between the alternatives proposed, it is suggested to consider the nature of dietary data and the main questions raised by the research. |
format | Online Article Text |
id | pubmed-9824338 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-98243382023-01-08 The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns Maugeri, Andrea Barchitta, Martina Favara, Giuliana La Mastra, Claudia La Rosa, Maria Clara Magnano San Lio, Roberta Agodi, Antonella Nutrients Article In the last decades, different multivariate techniques have been applied to multidimensional dietary datasets to identify meaningful patterns reflecting the dietary habits of populations. Among them, principal component analysis (PCA) and cluster analysis represent the two most used techniques, either applied separately or in parallel. Here, we propose a workflow to combine PCA, hierarchical clustering, and a K-means algorithm in a novel approach for dietary pattern derivation. Since the workflow presents certain subjective decisions that might affect the final clustering solution, we also provide some alternatives in relation to different dietary data used. For example, we used the dietary data of 855 women from Catania, Italy. Our approach—defined as clustering on principal components—could be useful to leverage the strengths of each method and to obtain a better cluster solution. In fact, it seemed to disentangle dietary data better than simple clustering algorithms. However, before choosing between the alternatives proposed, it is suggested to consider the nature of dietary data and the main questions raised by the research. MDPI 2022-12-30 /pmc/articles/PMC9824338/ /pubmed/36615850 http://dx.doi.org/10.3390/nu15010195 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Maugeri, Andrea Barchitta, Martina Favara, Giuliana La Mastra, Claudia La Rosa, Maria Clara Magnano San Lio, Roberta Agodi, Antonella The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns |
title | The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns |
title_full | The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns |
title_fullStr | The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns |
title_full_unstemmed | The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns |
title_short | The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns |
title_sort | application of clustering on principal components for nutritional epidemiology: a workflow to derive dietary patterns |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9824338/ https://www.ncbi.nlm.nih.gov/pubmed/36615850 http://dx.doi.org/10.3390/nu15010195 |
work_keys_str_mv | AT maugeriandrea theapplicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns AT barchittamartina theapplicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns AT favaragiuliana theapplicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns AT lamastraclaudia theapplicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns AT larosamariaclara theapplicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns AT magnanosanlioroberta theapplicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns AT agodiantonella theapplicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns AT maugeriandrea applicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns AT barchittamartina applicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns AT favaragiuliana applicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns AT lamastraclaudia applicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns AT larosamariaclara applicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns AT magnanosanlioroberta applicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns AT agodiantonella applicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns |