Cargando…
Construction of Xinjiang metabolic syndrome risk prediction model based on interpretable models
BACKGROUND: We aimed to construct simple and practical metabolic syndrome (MetS) risk prediction models based on the data of inhabitants of Urumqi and to provide a methodological reference for the prevention and control of MetS. METHODS: This is a cross-sectional study conducted in the Xinjiang Uygu...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8822755/ https://www.ncbi.nlm.nih.gov/pubmed/35135534 http://dx.doi.org/10.1186/s12889-022-12617-y |
_version_ | 1784646664271691776 |
---|---|
author | Zhang, Yan Razbek, JAINA Li, Deyang Yang, Lei Bao, Liangliang Xia, Wenjun Mao, Hongkai Daken, Mayisha Zhang, Xiaoxu Cao, Mingqin |
author_facet | Zhang, Yan Razbek, JAINA Li, Deyang Yang, Lei Bao, Liangliang Xia, Wenjun Mao, Hongkai Daken, Mayisha Zhang, Xiaoxu Cao, Mingqin |
author_sort | Zhang, Yan |
collection | PubMed |
description | BACKGROUND: We aimed to construct simple and practical metabolic syndrome (MetS) risk prediction models based on the data of inhabitants of Urumqi and to provide a methodological reference for the prevention and control of MetS. METHODS: This is a cross-sectional study conducted in the Xinjiang Uygur Autonomous Region of China. We collected data from inhabitants of Urumqi from 2018 to 2019, including demographic characteristics, anthropometric indicators, living habits and family history. Resampling technology was used to preprocess the data imbalance problems, and then MetS risk prediction models were constructed based on logistic regression (LR) and decision tree (DT). In addition, nomograms and tree diagrams of DT were used to explain and visualize the model. RESULTS: Of the 25,542 participants included in the study, 3,267 (12.8%) were diagnosed with MetS, and 22,275 (87.2%) were diagnosed with non-MetS. Both the LR and DT models based on the random undersampling dataset had good AUROC values (0.846 and 0.913, respectively). The accuracy, sensitivity, specificity, and AUROC values of the DT model were higher than those of the LR model. Based on a random undersampling dataset, the LR model showed that exercises such as walking (OR=0.769) and running (OR= 0.736) were protective factors against MetS. Age 60 ~ 74 years (OR=1.388), previous diabetes (OR=8.902), previous hypertension (OR=2.830), fatty liver (OR=3.306), smoking (OR=1.541), high systolic blood pressure (OR=1.044), and high diastolic blood pressure (OR=1.072) were risk factors for MetS; the DT model had 7 depth layers and 18 leaves, with BMI as the root node of the DT being the most important factor affecting MetS, and the other variables in descending order of importance: SBP, previous diabetes, previous hypertension, DBP, fatty liver, smoking, and exercise. CONCLUSIONS: Both DT and LR MetS risk prediction models have good prediction performance and their respective characteristics. Combining these two methods to construct an interpretable risk prediction model of MetS can provide methodological references for the prevention and control of MetS. |
format | Online Article Text |
id | pubmed-8822755 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-88227552022-02-08 Construction of Xinjiang metabolic syndrome risk prediction model based on interpretable models Zhang, Yan Razbek, JAINA Li, Deyang Yang, Lei Bao, Liangliang Xia, Wenjun Mao, Hongkai Daken, Mayisha Zhang, Xiaoxu Cao, Mingqin BMC Public Health Research BACKGROUND: We aimed to construct simple and practical metabolic syndrome (MetS) risk prediction models based on the data of inhabitants of Urumqi and to provide a methodological reference for the prevention and control of MetS. METHODS: This is a cross-sectional study conducted in the Xinjiang Uygur Autonomous Region of China. We collected data from inhabitants of Urumqi from 2018 to 2019, including demographic characteristics, anthropometric indicators, living habits and family history. Resampling technology was used to preprocess the data imbalance problems, and then MetS risk prediction models were constructed based on logistic regression (LR) and decision tree (DT). In addition, nomograms and tree diagrams of DT were used to explain and visualize the model. RESULTS: Of the 25,542 participants included in the study, 3,267 (12.8%) were diagnosed with MetS, and 22,275 (87.2%) were diagnosed with non-MetS. Both the LR and DT models based on the random undersampling dataset had good AUROC values (0.846 and 0.913, respectively). The accuracy, sensitivity, specificity, and AUROC values of the DT model were higher than those of the LR model. Based on a random undersampling dataset, the LR model showed that exercises such as walking (OR=0.769) and running (OR= 0.736) were protective factors against MetS. Age 60 ~ 74 years (OR=1.388), previous diabetes (OR=8.902), previous hypertension (OR=2.830), fatty liver (OR=3.306), smoking (OR=1.541), high systolic blood pressure (OR=1.044), and high diastolic blood pressure (OR=1.072) were risk factors for MetS; the DT model had 7 depth layers and 18 leaves, with BMI as the root node of the DT being the most important factor affecting MetS, and the other variables in descending order of importance: SBP, previous diabetes, previous hypertension, DBP, fatty liver, smoking, and exercise. CONCLUSIONS: Both DT and LR MetS risk prediction models have good prediction performance and their respective characteristics. Combining these two methods to construct an interpretable risk prediction model of MetS can provide methodological references for the prevention and control of MetS. BioMed Central 2022-02-08 /pmc/articles/PMC8822755/ /pubmed/35135534 http://dx.doi.org/10.1186/s12889-022-12617-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Zhang, Yan Razbek, JAINA Li, Deyang Yang, Lei Bao, Liangliang Xia, Wenjun Mao, Hongkai Daken, Mayisha Zhang, Xiaoxu Cao, Mingqin Construction of Xinjiang metabolic syndrome risk prediction model based on interpretable models |
title | Construction of Xinjiang metabolic syndrome risk prediction model based on interpretable models |
title_full | Construction of Xinjiang metabolic syndrome risk prediction model based on interpretable models |
title_fullStr | Construction of Xinjiang metabolic syndrome risk prediction model based on interpretable models |
title_full_unstemmed | Construction of Xinjiang metabolic syndrome risk prediction model based on interpretable models |
title_short | Construction of Xinjiang metabolic syndrome risk prediction model based on interpretable models |
title_sort | construction of xinjiang metabolic syndrome risk prediction model based on interpretable models |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8822755/ https://www.ncbi.nlm.nih.gov/pubmed/35135534 http://dx.doi.org/10.1186/s12889-022-12617-y |
work_keys_str_mv | AT zhangyan constructionofxinjiangmetabolicsyndromeriskpredictionmodelbasedoninterpretablemodels AT razbekjaina constructionofxinjiangmetabolicsyndromeriskpredictionmodelbasedoninterpretablemodels AT lideyang constructionofxinjiangmetabolicsyndromeriskpredictionmodelbasedoninterpretablemodels AT yanglei constructionofxinjiangmetabolicsyndromeriskpredictionmodelbasedoninterpretablemodels AT baoliangliang constructionofxinjiangmetabolicsyndromeriskpredictionmodelbasedoninterpretablemodels AT xiawenjun constructionofxinjiangmetabolicsyndromeriskpredictionmodelbasedoninterpretablemodels AT maohongkai constructionofxinjiangmetabolicsyndromeriskpredictionmodelbasedoninterpretablemodels AT dakenmayisha constructionofxinjiangmetabolicsyndromeriskpredictionmodelbasedoninterpretablemodels AT zhangxiaoxu constructionofxinjiangmetabolicsyndromeriskpredictionmodelbasedoninterpretablemodels AT caomingqin constructionofxinjiangmetabolicsyndromeriskpredictionmodelbasedoninterpretablemodels |