Cargando…

Predicting Obesity in Adults Using Machine Learning Techniques: An Analysis of Indonesian Basic Health Research 2018

Obesity is strongly associated with multiple risk factors. It is significantly contributing to an increased risk of chronic disease morbidity and mortality worldwide. There are various challenges to better understand the association between risk factors and the occurrence of obesity. The traditional...

Descripción completa

Detalles Bibliográficos
Autores principales: Thamrin, Sri Astuti, Arsyad, Dian Sidik, Kuswanto, Hedi, Lawi, Armin, Nasir, Sudirman
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8255629/
https://www.ncbi.nlm.nih.gov/pubmed/34235168
http://dx.doi.org/10.3389/fnut.2021.669155
_version_ 1783717946808336384
author Thamrin, Sri Astuti
Arsyad, Dian Sidik
Kuswanto, Hedi
Lawi, Armin
Nasir, Sudirman
author_facet Thamrin, Sri Astuti
Arsyad, Dian Sidik
Kuswanto, Hedi
Lawi, Armin
Nasir, Sudirman
author_sort Thamrin, Sri Astuti
collection PubMed
description Obesity is strongly associated with multiple risk factors. It is significantly contributing to an increased risk of chronic disease morbidity and mortality worldwide. There are various challenges to better understand the association between risk factors and the occurrence of obesity. The traditional regression approach limits analysis to a small number of predictors and imposes assumptions of independence and linearity. Machine Learning (ML) methods are an alternative that provide information with a unique approach to the application stage of data analysis on obesity. This study aims to assess the ability of ML methods, namely Logistic Regression, Classification and Regression Trees (CART), and Naïve Bayes to identify the presence of obesity using publicly available health data, using a novel approach with sophisticated ML methods to predict obesity as an attempt to go beyond traditional prediction models, and to compare the performance of three different methods. Meanwhile, the main objective of this study is to establish a set of risk factors for obesity in adults among the available study variables. Furthermore, we address data imbalance using Synthetic Minority Oversampling Technique (SMOTE) to predict obesity status based on risk factors available in the dataset. This study indicates that the Logistic Regression method shows the highest performance. Nevertheless, kappa coefficients show only moderate concordance between predicted and measured obesity. Location, marital status, age groups, education, sweet drinks, fatty/oily foods, grilled foods, preserved foods, seasoning powders, soft/carbonated drinks, alcoholic drinks, mental emotional disorders, diagnosed hypertension, physical activity, smoking, and fruit and vegetables consumptions are significant in predicting obesity status in adults. Identifying these risk factors could inform health authorities in designing or modifying existing policies for better controlling chronic diseases especially in relation to risk factors associated with obesity. Moreover, applying ML methods on publicly available health data, such as Indonesian Basic Health Research (RISKESDAS) is a promising strategy to fill the gap for a more robust understanding of the associations of multiple risk factors in predicting health outcomes.
format Online
Article
Text
id pubmed-8255629
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-82556292021-07-06 Predicting Obesity in Adults Using Machine Learning Techniques: An Analysis of Indonesian Basic Health Research 2018 Thamrin, Sri Astuti Arsyad, Dian Sidik Kuswanto, Hedi Lawi, Armin Nasir, Sudirman Front Nutr Nutrition Obesity is strongly associated with multiple risk factors. It is significantly contributing to an increased risk of chronic disease morbidity and mortality worldwide. There are various challenges to better understand the association between risk factors and the occurrence of obesity. The traditional regression approach limits analysis to a small number of predictors and imposes assumptions of independence and linearity. Machine Learning (ML) methods are an alternative that provide information with a unique approach to the application stage of data analysis on obesity. This study aims to assess the ability of ML methods, namely Logistic Regression, Classification and Regression Trees (CART), and Naïve Bayes to identify the presence of obesity using publicly available health data, using a novel approach with sophisticated ML methods to predict obesity as an attempt to go beyond traditional prediction models, and to compare the performance of three different methods. Meanwhile, the main objective of this study is to establish a set of risk factors for obesity in adults among the available study variables. Furthermore, we address data imbalance using Synthetic Minority Oversampling Technique (SMOTE) to predict obesity status based on risk factors available in the dataset. This study indicates that the Logistic Regression method shows the highest performance. Nevertheless, kappa coefficients show only moderate concordance between predicted and measured obesity. Location, marital status, age groups, education, sweet drinks, fatty/oily foods, grilled foods, preserved foods, seasoning powders, soft/carbonated drinks, alcoholic drinks, mental emotional disorders, diagnosed hypertension, physical activity, smoking, and fruit and vegetables consumptions are significant in predicting obesity status in adults. Identifying these risk factors could inform health authorities in designing or modifying existing policies for better controlling chronic diseases especially in relation to risk factors associated with obesity. Moreover, applying ML methods on publicly available health data, such as Indonesian Basic Health Research (RISKESDAS) is a promising strategy to fill the gap for a more robust understanding of the associations of multiple risk factors in predicting health outcomes. Frontiers Media S.A. 2021-06-21 /pmc/articles/PMC8255629/ /pubmed/34235168 http://dx.doi.org/10.3389/fnut.2021.669155 Text en Copyright © 2021 Thamrin, Arsyad, Kuswanto, Lawi and Nasir. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Nutrition
Thamrin, Sri Astuti
Arsyad, Dian Sidik
Kuswanto, Hedi
Lawi, Armin
Nasir, Sudirman
Predicting Obesity in Adults Using Machine Learning Techniques: An Analysis of Indonesian Basic Health Research 2018
title Predicting Obesity in Adults Using Machine Learning Techniques: An Analysis of Indonesian Basic Health Research 2018
title_full Predicting Obesity in Adults Using Machine Learning Techniques: An Analysis of Indonesian Basic Health Research 2018
title_fullStr Predicting Obesity in Adults Using Machine Learning Techniques: An Analysis of Indonesian Basic Health Research 2018
title_full_unstemmed Predicting Obesity in Adults Using Machine Learning Techniques: An Analysis of Indonesian Basic Health Research 2018
title_short Predicting Obesity in Adults Using Machine Learning Techniques: An Analysis of Indonesian Basic Health Research 2018
title_sort predicting obesity in adults using machine learning techniques: an analysis of indonesian basic health research 2018
topic Nutrition
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8255629/
https://www.ncbi.nlm.nih.gov/pubmed/34235168
http://dx.doi.org/10.3389/fnut.2021.669155
work_keys_str_mv AT thamrinsriastuti predictingobesityinadultsusingmachinelearningtechniquesananalysisofindonesianbasichealthresearch2018
AT arsyaddiansidik predictingobesityinadultsusingmachinelearningtechniquesananalysisofindonesianbasichealthresearch2018
AT kuswantohedi predictingobesityinadultsusingmachinelearningtechniquesananalysisofindonesianbasichealthresearch2018
AT lawiarmin predictingobesityinadultsusingmachinelearningtechniquesananalysisofindonesianbasichealthresearch2018
AT nasirsudirman predictingobesityinadultsusingmachinelearningtechniquesananalysisofindonesianbasichealthresearch2018