Cargando…
Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis
Background: Obesity prevalence has become one of the most prominent issues in global public health. Physical activity has been recognized as a key player in the obesity epidemic. Objectives: The objectives of this study are to (1) examine the relationship between physical activity and weight status...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8069304/ https://www.ncbi.nlm.nih.gov/pubmed/33918760 http://dx.doi.org/10.3390/ijerph18083966 |
_version_ | 1783683205819269120 |
---|---|
author | Cheng, Xiaolu Lin, Shuo-yu Liu, Jin Liu, Shiyong Zhang, Jun Nie, Peng Fuemmeler, Bernard F. Wang, Youfa Xue, Hong |
author_facet | Cheng, Xiaolu Lin, Shuo-yu Liu, Jin Liu, Shiyong Zhang, Jun Nie, Peng Fuemmeler, Bernard F. Wang, Youfa Xue, Hong |
author_sort | Cheng, Xiaolu |
collection | PubMed |
description | Background: Obesity prevalence has become one of the most prominent issues in global public health. Physical activity has been recognized as a key player in the obesity epidemic. Objectives: The objectives of this study are to (1) examine the relationship between physical activity and weight status and (2) assess the performance and predictive power of a set of popular machine learning and traditional statistical methods. Methods: National Health and Nutrition Examination Survey (NHANES, 2003 to 2006) data were used. A total of 7162 participants met our inclusion criteria (3682 males and 3480 females), with average age ranging from 48.6 (normal weight) to 52.1 years old (overweight). Eleven classifying algorithms—including logistic regression, naïve Bayes, Radial Basis Function (RBF), local k-nearest neighbors (k-NN), classification via regression (CVR), random subspace, decision table, multiobjective evolutionary fuzzy classifier, random tree, J48, and multilayer perceptron—were implemented and evaluated, and they were compared with traditional logistic regression model estimates. Results: With physical activity and basic demographic status, of all methods analyzed, the random subspace classifier algorithm achieved the highest overall accuracy and area under the receiver operating characteristic (ROC) curve (AUC). The duration of vigorous-intensity activity in one week and the duration of moderate-intensity activity in one week were important attributes. In general, most algorithms showed similar performance. Logistic regression was middle-ranking in terms of overall accuracy, sensitivity, specificity, and AUC among all methods. Conclusions: Physical activity was an important factor in predicting weight status, with gender, age, and race/ethnicity being less but still essential factors associated with weight outcomes. Tailored intervention policies and programs should target the differences rooted in these demographic factors to curb the increase in the prevalence of obesity and reduce disparities among sub-demographic populations. |
format | Online Article Text |
id | pubmed-8069304 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-80693042021-04-26 Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis Cheng, Xiaolu Lin, Shuo-yu Liu, Jin Liu, Shiyong Zhang, Jun Nie, Peng Fuemmeler, Bernard F. Wang, Youfa Xue, Hong Int J Environ Res Public Health Article Background: Obesity prevalence has become one of the most prominent issues in global public health. Physical activity has been recognized as a key player in the obesity epidemic. Objectives: The objectives of this study are to (1) examine the relationship between physical activity and weight status and (2) assess the performance and predictive power of a set of popular machine learning and traditional statistical methods. Methods: National Health and Nutrition Examination Survey (NHANES, 2003 to 2006) data were used. A total of 7162 participants met our inclusion criteria (3682 males and 3480 females), with average age ranging from 48.6 (normal weight) to 52.1 years old (overweight). Eleven classifying algorithms—including logistic regression, naïve Bayes, Radial Basis Function (RBF), local k-nearest neighbors (k-NN), classification via regression (CVR), random subspace, decision table, multiobjective evolutionary fuzzy classifier, random tree, J48, and multilayer perceptron—were implemented and evaluated, and they were compared with traditional logistic regression model estimates. Results: With physical activity and basic demographic status, of all methods analyzed, the random subspace classifier algorithm achieved the highest overall accuracy and area under the receiver operating characteristic (ROC) curve (AUC). The duration of vigorous-intensity activity in one week and the duration of moderate-intensity activity in one week were important attributes. In general, most algorithms showed similar performance. Logistic regression was middle-ranking in terms of overall accuracy, sensitivity, specificity, and AUC among all methods. Conclusions: Physical activity was an important factor in predicting weight status, with gender, age, and race/ethnicity being less but still essential factors associated with weight outcomes. Tailored intervention policies and programs should target the differences rooted in these demographic factors to curb the increase in the prevalence of obesity and reduce disparities among sub-demographic populations. MDPI 2021-04-09 /pmc/articles/PMC8069304/ /pubmed/33918760 http://dx.doi.org/10.3390/ijerph18083966 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Cheng, Xiaolu Lin, Shuo-yu Liu, Jin Liu, Shiyong Zhang, Jun Nie, Peng Fuemmeler, Bernard F. Wang, Youfa Xue, Hong Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis |
title | Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis |
title_full | Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis |
title_fullStr | Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis |
title_full_unstemmed | Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis |
title_short | Does Physical Activity Predict Obesity—A Machine Learning and Statistical Method-Based Analysis |
title_sort | does physical activity predict obesity—a machine learning and statistical method-based analysis |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8069304/ https://www.ncbi.nlm.nih.gov/pubmed/33918760 http://dx.doi.org/10.3390/ijerph18083966 |
work_keys_str_mv | AT chengxiaolu doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis AT linshuoyu doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis AT liujin doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis AT liushiyong doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis AT zhangjun doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis AT niepeng doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis AT fuemmelerbernardf doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis AT wangyoufa doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis AT xuehong doesphysicalactivitypredictobesityamachinelearningandstatisticalmethodbasedanalysis |