Cargando…

The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns

In the last decades, different multivariate techniques have been applied to multidimensional dietary datasets to identify meaningful patterns reflecting the dietary habits of populations. Among them, principal component analysis (PCA) and cluster analysis represent the two most used techniques, eith...

Descripción completa

Detalles Bibliográficos
Autores principales: Maugeri, Andrea, Barchitta, Martina, Favara, Giuliana, La Mastra, Claudia, La Rosa, Maria Clara, Magnano San Lio, Roberta, Agodi, Antonella
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9824338/
https://www.ncbi.nlm.nih.gov/pubmed/36615850
http://dx.doi.org/10.3390/nu15010195
_version_ 1784866385237639168
author Maugeri, Andrea
Barchitta, Martina
Favara, Giuliana
La Mastra, Claudia
La Rosa, Maria Clara
Magnano San Lio, Roberta
Agodi, Antonella
author_facet Maugeri, Andrea
Barchitta, Martina
Favara, Giuliana
La Mastra, Claudia
La Rosa, Maria Clara
Magnano San Lio, Roberta
Agodi, Antonella
author_sort Maugeri, Andrea
collection PubMed
description In the last decades, different multivariate techniques have been applied to multidimensional dietary datasets to identify meaningful patterns reflecting the dietary habits of populations. Among them, principal component analysis (PCA) and cluster analysis represent the two most used techniques, either applied separately or in parallel. Here, we propose a workflow to combine PCA, hierarchical clustering, and a K-means algorithm in a novel approach for dietary pattern derivation. Since the workflow presents certain subjective decisions that might affect the final clustering solution, we also provide some alternatives in relation to different dietary data used. For example, we used the dietary data of 855 women from Catania, Italy. Our approach—defined as clustering on principal components—could be useful to leverage the strengths of each method and to obtain a better cluster solution. In fact, it seemed to disentangle dietary data better than simple clustering algorithms. However, before choosing between the alternatives proposed, it is suggested to consider the nature of dietary data and the main questions raised by the research.
format Online
Article
Text
id pubmed-9824338
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-98243382023-01-08 The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns Maugeri, Andrea Barchitta, Martina Favara, Giuliana La Mastra, Claudia La Rosa, Maria Clara Magnano San Lio, Roberta Agodi, Antonella Nutrients Article In the last decades, different multivariate techniques have been applied to multidimensional dietary datasets to identify meaningful patterns reflecting the dietary habits of populations. Among them, principal component analysis (PCA) and cluster analysis represent the two most used techniques, either applied separately or in parallel. Here, we propose a workflow to combine PCA, hierarchical clustering, and a K-means algorithm in a novel approach for dietary pattern derivation. Since the workflow presents certain subjective decisions that might affect the final clustering solution, we also provide some alternatives in relation to different dietary data used. For example, we used the dietary data of 855 women from Catania, Italy. Our approach—defined as clustering on principal components—could be useful to leverage the strengths of each method and to obtain a better cluster solution. In fact, it seemed to disentangle dietary data better than simple clustering algorithms. However, before choosing between the alternatives proposed, it is suggested to consider the nature of dietary data and the main questions raised by the research. MDPI 2022-12-30 /pmc/articles/PMC9824338/ /pubmed/36615850 http://dx.doi.org/10.3390/nu15010195 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Maugeri, Andrea
Barchitta, Martina
Favara, Giuliana
La Mastra, Claudia
La Rosa, Maria Clara
Magnano San Lio, Roberta
Agodi, Antonella
The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns
title The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns
title_full The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns
title_fullStr The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns
title_full_unstemmed The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns
title_short The Application of Clustering on Principal Components for Nutritional Epidemiology: A Workflow to Derive Dietary Patterns
title_sort application of clustering on principal components for nutritional epidemiology: a workflow to derive dietary patterns
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9824338/
https://www.ncbi.nlm.nih.gov/pubmed/36615850
http://dx.doi.org/10.3390/nu15010195
work_keys_str_mv AT maugeriandrea theapplicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns
AT barchittamartina theapplicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns
AT favaragiuliana theapplicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns
AT lamastraclaudia theapplicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns
AT larosamariaclara theapplicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns
AT magnanosanlioroberta theapplicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns
AT agodiantonella theapplicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns
AT maugeriandrea applicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns
AT barchittamartina applicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns
AT favaragiuliana applicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns
AT lamastraclaudia applicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns
AT larosamariaclara applicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns
AT magnanosanlioroberta applicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns
AT agodiantonella applicationofclusteringonprincipalcomponentsfornutritionalepidemiologyaworkflowtoderivedietarypatterns