Cargando…

Diet-related chronic disease in the northeastern United States: a model-based clustering approach

BACKGROUND: Obesity and diabetes are global public health concerns. Studies indicate a relationship between socioeconomic, demographic and environmental variables and the spatial patterns of diet-related chronic disease. In this paper, we propose a methodology using model-based clustering and variab...

Descripción completa

Detalles Bibliográficos
Autores principales: Flynt, Abby, Daepp, Madeleine I. G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4559302/
https://www.ncbi.nlm.nih.gov/pubmed/26338084
http://dx.doi.org/10.1186/s12942-015-0017-5
_version_ 1782388753440964608
author Flynt, Abby
Daepp, Madeleine I. G.
author_facet Flynt, Abby
Daepp, Madeleine I. G.
author_sort Flynt, Abby
collection PubMed
description BACKGROUND: Obesity and diabetes are global public health concerns. Studies indicate a relationship between socioeconomic, demographic and environmental variables and the spatial patterns of diet-related chronic disease. In this paper, we propose a methodology using model-based clustering and variable selection to predict rates of obesity and diabetes. We test this method through an application in the northeastern United States. METHODS: We use model-based clustering, an unsupervised learning approach, to find latent clusters of similar US counties based on a set of socioeconomic, demographic, and environmental variables chosen through the process of variable selection. We then use Analysis of Variance and Post-hoc Tukey comparisons to examine differences in rates of obesity and diabetes for the clusters from the resulting clustering solution. RESULTS: We find access to supermarkets, median household income, population density and socioeconomic status to be important in clustering the counties of two northeastern states. The results of the cluster analysis can be used to identify two sets of counties with significantly lower rates of diet-related chronic disease than those observed in the other identified clusters. These relatively healthy clusters are distinguished by the large central and large fringe metropolitan areas contained in their component counties. However, the relationship of socio-demographic factors and diet-related chronic disease is more complicated than previous research would suggest. Additionally, we find evidence of low food access in two clusters of counties adjacent to large central and fringe metropolitan areas. While food access has previously been seen as a problem of inner-city or remote rural areas, this study offers preliminary evidence of declining food access in suburban areas. CONCLUSIONS: Model-based clustering with variable selection offers a new approach to the analysis of socioeconomic, demographic, and environmental data for diet-related chronic disease prediction. In a test application to two northeastern states, this method allows us to identify two sets of metropolitan counties with significantly lower diet-related chronic disease rates than those observed in most rural and suburban areas. Our method could be applied to larger geographic areas or other countries with comparable data sets, offering a promising method for researchers interested in the global increase in diet-related chronic disease. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12942-015-0017-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4559302
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45593022015-09-04 Diet-related chronic disease in the northeastern United States: a model-based clustering approach Flynt, Abby Daepp, Madeleine I. G. Int J Health Geogr Methodology BACKGROUND: Obesity and diabetes are global public health concerns. Studies indicate a relationship between socioeconomic, demographic and environmental variables and the spatial patterns of diet-related chronic disease. In this paper, we propose a methodology using model-based clustering and variable selection to predict rates of obesity and diabetes. We test this method through an application in the northeastern United States. METHODS: We use model-based clustering, an unsupervised learning approach, to find latent clusters of similar US counties based on a set of socioeconomic, demographic, and environmental variables chosen through the process of variable selection. We then use Analysis of Variance and Post-hoc Tukey comparisons to examine differences in rates of obesity and diabetes for the clusters from the resulting clustering solution. RESULTS: We find access to supermarkets, median household income, population density and socioeconomic status to be important in clustering the counties of two northeastern states. The results of the cluster analysis can be used to identify two sets of counties with significantly lower rates of diet-related chronic disease than those observed in the other identified clusters. These relatively healthy clusters are distinguished by the large central and large fringe metropolitan areas contained in their component counties. However, the relationship of socio-demographic factors and diet-related chronic disease is more complicated than previous research would suggest. Additionally, we find evidence of low food access in two clusters of counties adjacent to large central and fringe metropolitan areas. While food access has previously been seen as a problem of inner-city or remote rural areas, this study offers preliminary evidence of declining food access in suburban areas. CONCLUSIONS: Model-based clustering with variable selection offers a new approach to the analysis of socioeconomic, demographic, and environmental data for diet-related chronic disease prediction. In a test application to two northeastern states, this method allows us to identify two sets of metropolitan counties with significantly lower diet-related chronic disease rates than those observed in most rural and suburban areas. Our method could be applied to larger geographic areas or other countries with comparable data sets, offering a promising method for researchers interested in the global increase in diet-related chronic disease. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12942-015-0017-5) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-04 /pmc/articles/PMC4559302/ /pubmed/26338084 http://dx.doi.org/10.1186/s12942-015-0017-5 Text en © Flynt and Daepp. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Flynt, Abby
Daepp, Madeleine I. G.
Diet-related chronic disease in the northeastern United States: a model-based clustering approach
title Diet-related chronic disease in the northeastern United States: a model-based clustering approach
title_full Diet-related chronic disease in the northeastern United States: a model-based clustering approach
title_fullStr Diet-related chronic disease in the northeastern United States: a model-based clustering approach
title_full_unstemmed Diet-related chronic disease in the northeastern United States: a model-based clustering approach
title_short Diet-related chronic disease in the northeastern United States: a model-based clustering approach
title_sort diet-related chronic disease in the northeastern united states: a model-based clustering approach
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4559302/
https://www.ncbi.nlm.nih.gov/pubmed/26338084
http://dx.doi.org/10.1186/s12942-015-0017-5
work_keys_str_mv AT flyntabby dietrelatedchronicdiseaseinthenortheasternunitedstatesamodelbasedclusteringapproach
AT daeppmadeleineig dietrelatedchronicdiseaseinthenortheasternunitedstatesamodelbasedclusteringapproach