Cargando…
Diet-related chronic disease in the northeastern United States: a model-based clustering approach
BACKGROUND: Obesity and diabetes are global public health concerns. Studies indicate a relationship between socioeconomic, demographic and environmental variables and the spatial patterns of diet-related chronic disease. In this paper, we propose a methodology using model-based clustering and variab...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4559302/ https://www.ncbi.nlm.nih.gov/pubmed/26338084 http://dx.doi.org/10.1186/s12942-015-0017-5 |
_version_ | 1782388753440964608 |
---|---|
author | Flynt, Abby Daepp, Madeleine I. G. |
author_facet | Flynt, Abby Daepp, Madeleine I. G. |
author_sort | Flynt, Abby |
collection | PubMed |
description | BACKGROUND: Obesity and diabetes are global public health concerns. Studies indicate a relationship between socioeconomic, demographic and environmental variables and the spatial patterns of diet-related chronic disease. In this paper, we propose a methodology using model-based clustering and variable selection to predict rates of obesity and diabetes. We test this method through an application in the northeastern United States. METHODS: We use model-based clustering, an unsupervised learning approach, to find latent clusters of similar US counties based on a set of socioeconomic, demographic, and environmental variables chosen through the process of variable selection. We then use Analysis of Variance and Post-hoc Tukey comparisons to examine differences in rates of obesity and diabetes for the clusters from the resulting clustering solution. RESULTS: We find access to supermarkets, median household income, population density and socioeconomic status to be important in clustering the counties of two northeastern states. The results of the cluster analysis can be used to identify two sets of counties with significantly lower rates of diet-related chronic disease than those observed in the other identified clusters. These relatively healthy clusters are distinguished by the large central and large fringe metropolitan areas contained in their component counties. However, the relationship of socio-demographic factors and diet-related chronic disease is more complicated than previous research would suggest. Additionally, we find evidence of low food access in two clusters of counties adjacent to large central and fringe metropolitan areas. While food access has previously been seen as a problem of inner-city or remote rural areas, this study offers preliminary evidence of declining food access in suburban areas. CONCLUSIONS: Model-based clustering with variable selection offers a new approach to the analysis of socioeconomic, demographic, and environmental data for diet-related chronic disease prediction. In a test application to two northeastern states, this method allows us to identify two sets of metropolitan counties with significantly lower diet-related chronic disease rates than those observed in most rural and suburban areas. Our method could be applied to larger geographic areas or other countries with comparable data sets, offering a promising method for researchers interested in the global increase in diet-related chronic disease. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12942-015-0017-5) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4559302 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45593022015-09-04 Diet-related chronic disease in the northeastern United States: a model-based clustering approach Flynt, Abby Daepp, Madeleine I. G. Int J Health Geogr Methodology BACKGROUND: Obesity and diabetes are global public health concerns. Studies indicate a relationship between socioeconomic, demographic and environmental variables and the spatial patterns of diet-related chronic disease. In this paper, we propose a methodology using model-based clustering and variable selection to predict rates of obesity and diabetes. We test this method through an application in the northeastern United States. METHODS: We use model-based clustering, an unsupervised learning approach, to find latent clusters of similar US counties based on a set of socioeconomic, demographic, and environmental variables chosen through the process of variable selection. We then use Analysis of Variance and Post-hoc Tukey comparisons to examine differences in rates of obesity and diabetes for the clusters from the resulting clustering solution. RESULTS: We find access to supermarkets, median household income, population density and socioeconomic status to be important in clustering the counties of two northeastern states. The results of the cluster analysis can be used to identify two sets of counties with significantly lower rates of diet-related chronic disease than those observed in the other identified clusters. These relatively healthy clusters are distinguished by the large central and large fringe metropolitan areas contained in their component counties. However, the relationship of socio-demographic factors and diet-related chronic disease is more complicated than previous research would suggest. Additionally, we find evidence of low food access in two clusters of counties adjacent to large central and fringe metropolitan areas. While food access has previously been seen as a problem of inner-city or remote rural areas, this study offers preliminary evidence of declining food access in suburban areas. CONCLUSIONS: Model-based clustering with variable selection offers a new approach to the analysis of socioeconomic, demographic, and environmental data for diet-related chronic disease prediction. In a test application to two northeastern states, this method allows us to identify two sets of metropolitan counties with significantly lower diet-related chronic disease rates than those observed in most rural and suburban areas. Our method could be applied to larger geographic areas or other countries with comparable data sets, offering a promising method for researchers interested in the global increase in diet-related chronic disease. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12942-015-0017-5) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-04 /pmc/articles/PMC4559302/ /pubmed/26338084 http://dx.doi.org/10.1186/s12942-015-0017-5 Text en © Flynt and Daepp. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Flynt, Abby Daepp, Madeleine I. G. Diet-related chronic disease in the northeastern United States: a model-based clustering approach |
title | Diet-related chronic disease in the northeastern United States: a model-based clustering approach |
title_full | Diet-related chronic disease in the northeastern United States: a model-based clustering approach |
title_fullStr | Diet-related chronic disease in the northeastern United States: a model-based clustering approach |
title_full_unstemmed | Diet-related chronic disease in the northeastern United States: a model-based clustering approach |
title_short | Diet-related chronic disease in the northeastern United States: a model-based clustering approach |
title_sort | diet-related chronic disease in the northeastern united states: a model-based clustering approach |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4559302/ https://www.ncbi.nlm.nih.gov/pubmed/26338084 http://dx.doi.org/10.1186/s12942-015-0017-5 |
work_keys_str_mv | AT flyntabby dietrelatedchronicdiseaseinthenortheasternunitedstatesamodelbasedclusteringapproach AT daeppmadeleineig dietrelatedchronicdiseaseinthenortheasternunitedstatesamodelbasedclusteringapproach |