Cargando…

Machine learning approaches to the social determinants of health in the health and retirement study

BACKGROUND: Social and economic factors are important predictors of health and of recognized importance for health systems. However, machine learning, used elsewhere in the biomedical literature, has not been extensively applied to study relationships between society and health. We investigate how m...

Descripción completa

Detalles Bibliográficos
Autores principales: Seligman, Benjamin, Tuljapurkar, Shripad, Rehkopf, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5769116/
https://www.ncbi.nlm.nih.gov/pubmed/29349278
http://dx.doi.org/10.1016/j.ssmph.2017.11.008
_version_ 1783292839116931072
author Seligman, Benjamin
Tuljapurkar, Shripad
Rehkopf, David
author_facet Seligman, Benjamin
Tuljapurkar, Shripad
Rehkopf, David
author_sort Seligman, Benjamin
collection PubMed
description BACKGROUND: Social and economic factors are important predictors of health and of recognized importance for health systems. However, machine learning, used elsewhere in the biomedical literature, has not been extensively applied to study relationships between society and health. We investigate how machine learning may add to our understanding of social determinants of health using data from the Health and Retirement Study. METHODS: A linear regression of age and gender, and a parsimonious theory-based regression additionally incorporating income, wealth, and education, were used to predict systolic blood pressure, body mass index, waist circumference, and telomere length. Prediction, fit, and interpretability were compared across four machine learning methods: linear regression, penalized regressions, random forests, and neural networks. RESULTS: All models had poor out-of-sample prediction. Most machine learning models performed similarly to the simpler models. However, neural networks greatly outperformed the three other methods. Neural networks also had good fit to the data (R(2) between 0.4–0.6, versus <0.3 for all others). Across machine learning models, nine variables were frequently selected or highly weighted as predictors: dental visits, current smoking, self-rated health, serial-seven subtractions, probability of receiving an inheritance, probability of leaving an inheritance of at least $10,000, number of children ever born, African-American race, and gender. DISCUSSION: Some of the machine learning methods do not improve prediction or fit beyond simpler models, however, neural networks performed well. The predictors identified across models suggest underlying social factors that are important predictors of biological indicators of chronic disease, and that the non-linear and interactive relationships between variables fundamental to the neural network approach may be important to consider.
format Online
Article
Text
id pubmed-5769116
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-57691162018-01-18 Machine learning approaches to the social determinants of health in the health and retirement study Seligman, Benjamin Tuljapurkar, Shripad Rehkopf, David SSM Popul Health Article BACKGROUND: Social and economic factors are important predictors of health and of recognized importance for health systems. However, machine learning, used elsewhere in the biomedical literature, has not been extensively applied to study relationships between society and health. We investigate how machine learning may add to our understanding of social determinants of health using data from the Health and Retirement Study. METHODS: A linear regression of age and gender, and a parsimonious theory-based regression additionally incorporating income, wealth, and education, were used to predict systolic blood pressure, body mass index, waist circumference, and telomere length. Prediction, fit, and interpretability were compared across four machine learning methods: linear regression, penalized regressions, random forests, and neural networks. RESULTS: All models had poor out-of-sample prediction. Most machine learning models performed similarly to the simpler models. However, neural networks greatly outperformed the three other methods. Neural networks also had good fit to the data (R(2) between 0.4–0.6, versus <0.3 for all others). Across machine learning models, nine variables were frequently selected or highly weighted as predictors: dental visits, current smoking, self-rated health, serial-seven subtractions, probability of receiving an inheritance, probability of leaving an inheritance of at least $10,000, number of children ever born, African-American race, and gender. DISCUSSION: Some of the machine learning methods do not improve prediction or fit beyond simpler models, however, neural networks performed well. The predictors identified across models suggest underlying social factors that are important predictors of biological indicators of chronic disease, and that the non-linear and interactive relationships between variables fundamental to the neural network approach may be important to consider. Elsevier 2017-11-21 /pmc/articles/PMC5769116/ /pubmed/29349278 http://dx.doi.org/10.1016/j.ssmph.2017.11.008 Text en © 2017 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Seligman, Benjamin
Tuljapurkar, Shripad
Rehkopf, David
Machine learning approaches to the social determinants of health in the health and retirement study
title Machine learning approaches to the social determinants of health in the health and retirement study
title_full Machine learning approaches to the social determinants of health in the health and retirement study
title_fullStr Machine learning approaches to the social determinants of health in the health and retirement study
title_full_unstemmed Machine learning approaches to the social determinants of health in the health and retirement study
title_short Machine learning approaches to the social determinants of health in the health and retirement study
title_sort machine learning approaches to the social determinants of health in the health and retirement study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5769116/
https://www.ncbi.nlm.nih.gov/pubmed/29349278
http://dx.doi.org/10.1016/j.ssmph.2017.11.008
work_keys_str_mv AT seligmanbenjamin machinelearningapproachestothesocialdeterminantsofhealthinthehealthandretirementstudy
AT tuljapurkarshripad machinelearningapproachestothesocialdeterminantsofhealthinthehealthandretirementstudy
AT rehkopfdavid machinelearningapproachestothesocialdeterminantsofhealthinthehealthandretirementstudy