Cargando…
Machine learning approaches to the social determinants of health in the health and retirement study
BACKGROUND: Social and economic factors are important predictors of health and of recognized importance for health systems. However, machine learning, used elsewhere in the biomedical literature, has not been extensively applied to study relationships between society and health. We investigate how m...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5769116/ https://www.ncbi.nlm.nih.gov/pubmed/29349278 http://dx.doi.org/10.1016/j.ssmph.2017.11.008 |
_version_ | 1783292839116931072 |
---|---|
author | Seligman, Benjamin Tuljapurkar, Shripad Rehkopf, David |
author_facet | Seligman, Benjamin Tuljapurkar, Shripad Rehkopf, David |
author_sort | Seligman, Benjamin |
collection | PubMed |
description | BACKGROUND: Social and economic factors are important predictors of health and of recognized importance for health systems. However, machine learning, used elsewhere in the biomedical literature, has not been extensively applied to study relationships between society and health. We investigate how machine learning may add to our understanding of social determinants of health using data from the Health and Retirement Study. METHODS: A linear regression of age and gender, and a parsimonious theory-based regression additionally incorporating income, wealth, and education, were used to predict systolic blood pressure, body mass index, waist circumference, and telomere length. Prediction, fit, and interpretability were compared across four machine learning methods: linear regression, penalized regressions, random forests, and neural networks. RESULTS: All models had poor out-of-sample prediction. Most machine learning models performed similarly to the simpler models. However, neural networks greatly outperformed the three other methods. Neural networks also had good fit to the data (R(2) between 0.4–0.6, versus <0.3 for all others). Across machine learning models, nine variables were frequently selected or highly weighted as predictors: dental visits, current smoking, self-rated health, serial-seven subtractions, probability of receiving an inheritance, probability of leaving an inheritance of at least $10,000, number of children ever born, African-American race, and gender. DISCUSSION: Some of the machine learning methods do not improve prediction or fit beyond simpler models, however, neural networks performed well. The predictors identified across models suggest underlying social factors that are important predictors of biological indicators of chronic disease, and that the non-linear and interactive relationships between variables fundamental to the neural network approach may be important to consider. |
format | Online Article Text |
id | pubmed-5769116 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-57691162018-01-18 Machine learning approaches to the social determinants of health in the health and retirement study Seligman, Benjamin Tuljapurkar, Shripad Rehkopf, David SSM Popul Health Article BACKGROUND: Social and economic factors are important predictors of health and of recognized importance for health systems. However, machine learning, used elsewhere in the biomedical literature, has not been extensively applied to study relationships between society and health. We investigate how machine learning may add to our understanding of social determinants of health using data from the Health and Retirement Study. METHODS: A linear regression of age and gender, and a parsimonious theory-based regression additionally incorporating income, wealth, and education, were used to predict systolic blood pressure, body mass index, waist circumference, and telomere length. Prediction, fit, and interpretability were compared across four machine learning methods: linear regression, penalized regressions, random forests, and neural networks. RESULTS: All models had poor out-of-sample prediction. Most machine learning models performed similarly to the simpler models. However, neural networks greatly outperformed the three other methods. Neural networks also had good fit to the data (R(2) between 0.4–0.6, versus <0.3 for all others). Across machine learning models, nine variables were frequently selected or highly weighted as predictors: dental visits, current smoking, self-rated health, serial-seven subtractions, probability of receiving an inheritance, probability of leaving an inheritance of at least $10,000, number of children ever born, African-American race, and gender. DISCUSSION: Some of the machine learning methods do not improve prediction or fit beyond simpler models, however, neural networks performed well. The predictors identified across models suggest underlying social factors that are important predictors of biological indicators of chronic disease, and that the non-linear and interactive relationships between variables fundamental to the neural network approach may be important to consider. Elsevier 2017-11-21 /pmc/articles/PMC5769116/ /pubmed/29349278 http://dx.doi.org/10.1016/j.ssmph.2017.11.008 Text en © 2017 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Article Seligman, Benjamin Tuljapurkar, Shripad Rehkopf, David Machine learning approaches to the social determinants of health in the health and retirement study |
title | Machine learning approaches to the social determinants of health in the health and retirement study |
title_full | Machine learning approaches to the social determinants of health in the health and retirement study |
title_fullStr | Machine learning approaches to the social determinants of health in the health and retirement study |
title_full_unstemmed | Machine learning approaches to the social determinants of health in the health and retirement study |
title_short | Machine learning approaches to the social determinants of health in the health and retirement study |
title_sort | machine learning approaches to the social determinants of health in the health and retirement study |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5769116/ https://www.ncbi.nlm.nih.gov/pubmed/29349278 http://dx.doi.org/10.1016/j.ssmph.2017.11.008 |
work_keys_str_mv | AT seligmanbenjamin machinelearningapproachestothesocialdeterminantsofhealthinthehealthandretirementstudy AT tuljapurkarshripad machinelearningapproachestothesocialdeterminantsofhealthinthehealthandretirementstudy AT rehkopfdavid machinelearningapproachestothesocialdeterminantsofhealthinthehealthandretirementstudy |