Cargando…

Predicting risk of obesity in overweight adults using interpretable machine learning algorithms

OBJECTIVE: To screen for predictive obesity factors in overweight populations using an optimal and interpretable machine learning algorithm. METHODS: This cross-sectional study was conducted between June 2011 and January 2012. The participants were randomly selected using a simple random sampling te...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lin, Wei, Shi, Songchang, Huang, Huibin, Wen, Junping, Chen, Gang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2023
Materias:	Endocrinology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10693451/ https://www.ncbi.nlm.nih.gov/pubmed/38047114 http://dx.doi.org/10.3389/fendo.2023.1292167

_version_	1785153164238913536
author	Lin, Wei Shi, Songchang Huang, Huibin Wen, Junping Chen, Gang
author_facet	Lin, Wei Shi, Songchang Huang, Huibin Wen, Junping Chen, Gang
author_sort	Lin, Wei
collection	PubMed
description	OBJECTIVE: To screen for predictive obesity factors in overweight populations using an optimal and interpretable machine learning algorithm. METHODS: This cross-sectional study was conducted between June 2011 and January 2012. The participants were randomly selected using a simple random sampling technique. Seven commonly used machine learning methods were employed to construct obesity risk prediction models. A total of 5,236 Chinese participants from Ningde City, Fujian Province, Southeast China, participated in this study. The best model was selected through appropriate verification and validation and suitably explained. Subsequently, a minimal set of significant predictors was identified. The Shapley additive explanation force plot was used to illustrate the model at the individual level. RESULTS: Machine learning models for predicting obesity have demonstrated strong performance, with CatBoost emerging as the most effective in both model validity and net clinical benefit. Specifically, the CatBoost algorithm yielded the highest scores, registering 0.91 in the training set and an impressive 0.83 in the test set. This was further corroborated by the area under the curve (AUC) metrics, where CatBoost achieved 0.95 for the training set and 0.87 for the test set. In a rigorous five-fold cross-validation, the AUC for the CatBoost model ranged between 0.84 and 0.91, with an average AUC of ROC at 0.87 ± 0.022. Key predictors identified within these models included waist circumference, hip circumference, female gender, and systolic blood pressure. CONCLUSION: CatBoost may be the best machine learning method for prediction. Combining Shapley’s additive explanation and machine learning methods can be effective in identifying disease risk factors for prevention and control.
format	Online Article Text
id	pubmed-10693451
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-106934512023-12-03 Predicting risk of obesity in overweight adults using interpretable machine learning algorithms Lin, Wei Shi, Songchang Huang, Huibin Wen, Junping Chen, Gang Front Endocrinol (Lausanne) Endocrinology OBJECTIVE: To screen for predictive obesity factors in overweight populations using an optimal and interpretable machine learning algorithm. METHODS: This cross-sectional study was conducted between June 2011 and January 2012. The participants were randomly selected using a simple random sampling technique. Seven commonly used machine learning methods were employed to construct obesity risk prediction models. A total of 5,236 Chinese participants from Ningde City, Fujian Province, Southeast China, participated in this study. The best model was selected through appropriate verification and validation and suitably explained. Subsequently, a minimal set of significant predictors was identified. The Shapley additive explanation force plot was used to illustrate the model at the individual level. RESULTS: Machine learning models for predicting obesity have demonstrated strong performance, with CatBoost emerging as the most effective in both model validity and net clinical benefit. Specifically, the CatBoost algorithm yielded the highest scores, registering 0.91 in the training set and an impressive 0.83 in the test set. This was further corroborated by the area under the curve (AUC) metrics, where CatBoost achieved 0.95 for the training set and 0.87 for the test set. In a rigorous five-fold cross-validation, the AUC for the CatBoost model ranged between 0.84 and 0.91, with an average AUC of ROC at 0.87 ± 0.022. Key predictors identified within these models included waist circumference, hip circumference, female gender, and systolic blood pressure. CONCLUSION: CatBoost may be the best machine learning method for prediction. Combining Shapley’s additive explanation and machine learning methods can be effective in identifying disease risk factors for prevention and control. Frontiers Media S.A. 2023-11-17 /pmc/articles/PMC10693451/ /pubmed/38047114 http://dx.doi.org/10.3389/fendo.2023.1292167 Text en Copyright © 2023 Lin, Shi, Huang, Wen and Chen https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Endocrinology Lin, Wei Shi, Songchang Huang, Huibin Wen, Junping Chen, Gang Predicting risk of obesity in overweight adults using interpretable machine learning algorithms
title	Predicting risk of obesity in overweight adults using interpretable machine learning algorithms
title_full	Predicting risk of obesity in overweight adults using interpretable machine learning algorithms
title_fullStr	Predicting risk of obesity in overweight adults using interpretable machine learning algorithms
title_full_unstemmed	Predicting risk of obesity in overweight adults using interpretable machine learning algorithms
title_short	Predicting risk of obesity in overweight adults using interpretable machine learning algorithms
title_sort	predicting risk of obesity in overweight adults using interpretable machine learning algorithms
topic	Endocrinology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10693451/ https://www.ncbi.nlm.nih.gov/pubmed/38047114 http://dx.doi.org/10.3389/fendo.2023.1292167
work_keys_str_mv	AT linwei predictingriskofobesityinoverweightadultsusinginterpretablemachinelearningalgorithms AT shisongchang predictingriskofobesityinoverweightadultsusinginterpretablemachinelearningalgorithms AT huanghuibin predictingriskofobesityinoverweightadultsusinginterpretablemachinelearningalgorithms AT wenjunping predictingriskofobesityinoverweightadultsusinginterpretablemachinelearningalgorithms AT chengang predictingriskofobesityinoverweightadultsusinginterpretablemachinelearningalgorithms

Predicting risk of obesity in overweight adults using interpretable machine learning algorithms

Ejemplares similares