Cargando…

Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis

Background: Obesity prevalence has become one of the most prominent issues in global public health. Physical activity has been recognized as a key player in the obesity epidemic. Objectives: The objectives of this study are to (1) examine the relationship between physical activity and weight status...

Descripción completa

Detalles Bibliográficos
Autores principales: Cheng, Xiaolu, Lin, Shuo-yu, Liu, Jin, Liu, Shiyong, Zhang, Jun, Nie, Peng, Fuemmeler, Bernard F., Wang, Youfa, Xue, Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8069304/
https://www.ncbi.nlm.nih.gov/pubmed/33918760
http://dx.doi.org/10.3390/ijerph18083966
_version_ 1783683205819269120
author Cheng, Xiaolu
Lin, Shuo-yu
Liu, Jin
Liu, Shiyong
Zhang, Jun
Nie, Peng
Fuemmeler, Bernard F.
Wang, Youfa
Xue, Hong
author_facet Cheng, Xiaolu
Lin, Shuo-yu
Liu, Jin
Liu, Shiyong
Zhang, Jun
Nie, Peng
Fuemmeler, Bernard F.
Wang, Youfa
Xue, Hong
author_sort Cheng, Xiaolu
collection PubMed
description Background: Obesity prevalence has become one of the most prominent issues in global public health. Physical activity has been recognized as a key player in the obesity epidemic. Objectives: The objectives of this study are to (1) examine the relationship between physical activity and weight status and (2) assess the performance and predictive power of a set of popular machine learning and traditional statistical methods. Methods: National Health and Nutrition Examination Survey (NHANES, 2003 to 2006) data were used. A total of 7162 participants met our inclusion criteria (3682 males and 3480 females), with average age ranging from 48.6 (normal weight) to 52.1 years old (overweight). Eleven classifying algorithms—including logistic regression, naïve Bayes, Radial Basis Function (RBF), local k-nearest neighbors (k-NN), classification via regression (CVR), random subspace, decision table, multiobjective evolutionary fuzzy classifier, random tree, J48, and multilayer perceptron—were implemented and evaluated, and they were compared with traditional logistic regression model estimates. Results: With physical activity and basic demographic status, of all methods analyzed, the random subspace classifier algorithm achieved the highest overall accuracy and area under the receiver operating characteristic (ROC) curve (AUC). The duration of vigorous-intensity activity in one week and the duration of moderate-intensity activity in one week were important attributes. In general, most algorithms showed similar performance. Logistic regression was middle-ranking in terms of overall accuracy, sensitivity, specificity, and AUC among all methods. Conclusions: Physical activity was an important factor in predicting weight status, with gender, age, and race/ethnicity being less but still essential factors associated with weight outcomes. Tailored intervention policies and programs should target the differences rooted in these demographic factors to curb the increase in the prevalence of obesity and reduce disparities among sub-demographic populations.
format Online
Article
Text
id pubmed-8069304
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-80693042021-04-26 Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis Cheng, Xiaolu Lin, Shuo-yu Liu, Jin Liu, Shiyong Zhang, Jun Nie, Peng Fuemmeler, Bernard F. Wang, Youfa Xue, Hong Int J Environ Res Public Health Article Background: Obesity prevalence has become one of the most prominent issues in global public health. Physical activity has been recognized as a key player in the obesity epidemic. Objectives: The objectives of this study are to (1) examine the relationship between physical activity and weight status and (2) assess the performance and predictive power of a set of popular machine learning and traditional statistical methods. Methods: National Health and Nutrition Examination Survey (NHANES, 2003 to 2006) data were used. A total of 7162 participants met our inclusion criteria (3682 males and 3480 females), with average age ranging from 48.6 (normal weight) to 52.1 years old (overweight). Eleven classifying algorithms—including logistic regression, naïve Bayes, Radial Basis Function (RBF), local k-nearest neighbors (k-NN), classification via regression (CVR), random subspace, decision table, multiobjective evolutionary fuzzy classifier, random tree, J48, and multilayer perceptron—were implemented and evaluated, and they were compared with traditional logistic regression model estimates. Results: With physical activity and basic demographic status, of all methods analyzed, the random subspace classifier algorithm achieved the highest overall accuracy and area under the receiver operating characteristic (ROC) curve (AUC). The duration of vigorous-intensity activity in one week and the duration of moderate-intensity activity in one week were important attributes. In general, most algorithms showed similar performance. Logistic regression was middle-ranking in terms of overall accuracy, sensitivity, specificity, and AUC among all methods. Conclusions: Physical activity was an important factor in predicting weight status, with gender, age, and race/ethnicity being less but still essential factors associated with weight outcomes. Tailored intervention policies and programs should target the differences rooted in these demographic factors to curb the increase in the prevalence of obesity and reduce disparities among sub-demographic populations. MDPI 2021-04-09 /pmc/articles/PMC8069304/ /pubmed/33918760 http://dx.doi.org/10.3390/ijerph18083966 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Cheng, Xiaolu
Lin, Shuo-yu
Liu, Jin
Liu, Shiyong
Zhang, Jun
Nie, Peng
Fuemmeler, Bernard F.
Wang, Youfa
Xue, Hong
Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis
title Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis
title_full Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis
title_fullStr Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis
title_full_unstemmed Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis
title_short Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis
title_sort does physical activity predict obesity—a machine learning and statistical method-based analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8069304/
https://www.ncbi.nlm.nih.gov/pubmed/33918760
http://dx.doi.org/10.3390/ijerph18083966
work_keys_str_mv AT chengxiaolu doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis
AT linshuoyu doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis
AT liujin doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis
AT liushiyong doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis
AT zhangjun doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis
AT niepeng doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis
AT fuemmelerbernardf doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis
AT wangyoufa doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis
AT xuehong doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis