Cargando…

Predicting self-perceived general health status using machine learning: an external exposome study

BACKGROUND: Self-perceived general health (SPGH) is a general health indicator commonly used in epidemiological research and is associated with a wide range of exposures from different domains. However, most studies on SPGH only investigated a limited set of exposures and did not take the entire ext...

Descripción completa

Detalles Bibliográficos
Autores principales: Hoekstra, Jurriaan, Lenssen, Esther S., Wong, Albert, Loef, Bette, Herber, Gerrie-Cor M., Boshuizen, Hendriek C., Strak, Maciek, Verschuren, W. M. Monique, Janssen, Nicole A. H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10230687/
https://www.ncbi.nlm.nih.gov/pubmed/37259056
http://dx.doi.org/10.1186/s12889-023-15962-8
_version_ 1785051588336812032
author Hoekstra, Jurriaan
Lenssen, Esther S.
Wong, Albert
Loef, Bette
Herber, Gerrie-Cor M.
Boshuizen, Hendriek C.
Strak, Maciek
Verschuren, W. M. Monique
Janssen, Nicole A. H.
author_facet Hoekstra, Jurriaan
Lenssen, Esther S.
Wong, Albert
Loef, Bette
Herber, Gerrie-Cor M.
Boshuizen, Hendriek C.
Strak, Maciek
Verschuren, W. M. Monique
Janssen, Nicole A. H.
author_sort Hoekstra, Jurriaan
collection PubMed
description BACKGROUND: Self-perceived general health (SPGH) is a general health indicator commonly used in epidemiological research and is associated with a wide range of exposures from different domains. However, most studies on SPGH only investigated a limited set of exposures and did not take the entire external exposome into account. We aimed to develop predictive models for SPGH based on exposome datasets using machine learning techniques and identify the most important predictors of poor SPGH status. METHODS: Random forest (RF) was used on two datasets based on personal characteristics from the 2012 and 2016 editions of the Dutch national health survey, enriched with environmental and neighborhood characteristics. Model performance was determined using the area under the curve (AUC) score. The most important predictors were identified using a variable importance procedure and individual effects of exposures using partial dependence and accumulated local effect plots. The final 2012 dataset contained information on 199,840 individuals and 81 variables, whereas the final 2016 dataset had 244,557 individuals with 91 variables. RESULTS: Our RF models had overall good predictive performance (2012: AUC = 0.864 (CI: 0.852–0.876); 2016: AUC = 0.890 (CI: 0.883–0.896)) and the most important predictors were “Control of own life”, “Physical activity”, “Loneliness” and “Making ends meet”. Subjects who felt insufficiently in control of their own life, scored high on the De Jong-Gierveld loneliness scale or had difficulty in making ends meet were more likely to have poor SPGH status, whereas increased physical activity per week reduced the probability of poor SPGH. We observed associations between some neighborhood and environmental characteristics, but these variables did not contribute to the overall predictive strength of the models. CONCLUSIONS: This study identified that within an external exposome dataset, the most important predictors for SPGH status are related to mental wellbeing, physical exercise, loneliness, and financial status. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12889-023-15962-8.
format Online
Article
Text
id pubmed-10230687
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-102306872023-06-01 Predicting self-perceived general health status using machine learning: an external exposome study Hoekstra, Jurriaan Lenssen, Esther S. Wong, Albert Loef, Bette Herber, Gerrie-Cor M. Boshuizen, Hendriek C. Strak, Maciek Verschuren, W. M. Monique Janssen, Nicole A. H. BMC Public Health Research BACKGROUND: Self-perceived general health (SPGH) is a general health indicator commonly used in epidemiological research and is associated with a wide range of exposures from different domains. However, most studies on SPGH only investigated a limited set of exposures and did not take the entire external exposome into account. We aimed to develop predictive models for SPGH based on exposome datasets using machine learning techniques and identify the most important predictors of poor SPGH status. METHODS: Random forest (RF) was used on two datasets based on personal characteristics from the 2012 and 2016 editions of the Dutch national health survey, enriched with environmental and neighborhood characteristics. Model performance was determined using the area under the curve (AUC) score. The most important predictors were identified using a variable importance procedure and individual effects of exposures using partial dependence and accumulated local effect plots. The final 2012 dataset contained information on 199,840 individuals and 81 variables, whereas the final 2016 dataset had 244,557 individuals with 91 variables. RESULTS: Our RF models had overall good predictive performance (2012: AUC = 0.864 (CI: 0.852–0.876); 2016: AUC = 0.890 (CI: 0.883–0.896)) and the most important predictors were “Control of own life”, “Physical activity”, “Loneliness” and “Making ends meet”. Subjects who felt insufficiently in control of their own life, scored high on the De Jong-Gierveld loneliness scale or had difficulty in making ends meet were more likely to have poor SPGH status, whereas increased physical activity per week reduced the probability of poor SPGH. We observed associations between some neighborhood and environmental characteristics, but these variables did not contribute to the overall predictive strength of the models. CONCLUSIONS: This study identified that within an external exposome dataset, the most important predictors for SPGH status are related to mental wellbeing, physical exercise, loneliness, and financial status. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12889-023-15962-8. BioMed Central 2023-05-31 /pmc/articles/PMC10230687/ /pubmed/37259056 http://dx.doi.org/10.1186/s12889-023-15962-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Hoekstra, Jurriaan
Lenssen, Esther S.
Wong, Albert
Loef, Bette
Herber, Gerrie-Cor M.
Boshuizen, Hendriek C.
Strak, Maciek
Verschuren, W. M. Monique
Janssen, Nicole A. H.
Predicting self-perceived general health status using machine learning: an external exposome study
title Predicting self-perceived general health status using machine learning: an external exposome study
title_full Predicting self-perceived general health status using machine learning: an external exposome study
title_fullStr Predicting self-perceived general health status using machine learning: an external exposome study
title_full_unstemmed Predicting self-perceived general health status using machine learning: an external exposome study
title_short Predicting self-perceived general health status using machine learning: an external exposome study
title_sort predicting self-perceived general health status using machine learning: an external exposome study
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10230687/
https://www.ncbi.nlm.nih.gov/pubmed/37259056
http://dx.doi.org/10.1186/s12889-023-15962-8
work_keys_str_mv AT hoekstrajurriaan predictingselfperceivedgeneralhealthstatususingmachinelearninganexternalexposomestudy
AT lenssenesthers predictingselfperceivedgeneralhealthstatususingmachinelearninganexternalexposomestudy
AT wongalbert predictingselfperceivedgeneralhealthstatususingmachinelearninganexternalexposomestudy
AT loefbette predictingselfperceivedgeneralhealthstatususingmachinelearninganexternalexposomestudy
AT herbergerriecorm predictingselfperceivedgeneralhealthstatususingmachinelearninganexternalexposomestudy
AT boshuizenhendriekc predictingselfperceivedgeneralhealthstatususingmachinelearninganexternalexposomestudy
AT strakmaciek predictingselfperceivedgeneralhealthstatususingmachinelearninganexternalexposomestudy
AT verschurenwmmonique predictingselfperceivedgeneralhealthstatususingmachinelearninganexternalexposomestudy
AT janssennicoleah predictingselfperceivedgeneralhealthstatususingmachinelearninganexternalexposomestudy