Cargando…
A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort
BACKGROUND: Osteoporosis develops in the elderly due to decreased bone mineral density (BMD), potentially increasing bone fracture risk. However, the BMD is not regularly measured in a clinical setting. This study aimed to develop a good prediction model for the osteoporosis risk using a machine lea...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Korean Academy of Medical Sciences
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10226854/ https://www.ncbi.nlm.nih.gov/pubmed/37270917 http://dx.doi.org/10.3346/jkms.2023.38.e162 |
_version_ | 1785050652850782208 |
---|---|
author | Wu, Xuangao Park, Sunmin |
author_facet | Wu, Xuangao Park, Sunmin |
author_sort | Wu, Xuangao |
collection | PubMed |
description | BACKGROUND: Osteoporosis develops in the elderly due to decreased bone mineral density (BMD), potentially increasing bone fracture risk. However, the BMD is not regularly measured in a clinical setting. This study aimed to develop a good prediction model for the osteoporosis risk using a machine learning (ML) approach in adults over 40 years in the Ansan/Anseong cohort and the association of predicted osteoporosis risk with a fracture in the Health Examinees (HEXA) cohort. METHODS: The 109 demographic, anthropometric, biochemical, genetic, nutrient, and lifestyle variables of 8,842 participants were manually selected in an Ansan/Anseong cohort and included in the ML algorithm. The polygenic risk score (PRS) of osteoporosis was generated with a genome-wide association study and added for the genetic impact of osteoporosis. Osteoporosis was defined with < −2.5 T scores of the tibia or radius compared to people in their 20s–30s. They were divided randomly into the training (n = 7,074) and test (n = 1,768) sets—Pearson’s correlation between the predicted osteoporosis risk and fracture in the HEXA cohort. RESULTS: XGBoost, deep neural network, and random forest generated the prediction model with a high area under the curve (AUC, 0.86) of the receiver operating characteristic (ROC) with 10, 15, and 20 features; the prediction model by XGBoost had the highest AUC of ROC, high accuracy and k-fold values (> 0.85) in 15 features among seven ML approaches. The model included the genetic factor, genders, number of children and breastfed children, age, residence area, education, seasons to measure, height, smoking status, hormone replacement therapy, serum albumin, hip circumferences, vitamin B6 intake, and body weight. The prediction models for women alone were similar to those for both genders, with lower accuracy. When the prediction model was applied to the HEXA study, the correlation between the fracture incidence and predicted osteoporosis risk was significant but weak (r = 0.173, P < 0.001). CONCLUSION: The prediction model for osteoporosis risk generated by XGBoost can be applied to estimate osteoporosis risk. The biomarkers can be considered for enhancing the prevention, detection, and early therapy of osteoporosis risk in Asians. |
format | Online Article Text |
id | pubmed-10226854 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | The Korean Academy of Medical Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-102268542023-05-31 A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort Wu, Xuangao Park, Sunmin J Korean Med Sci Original Article BACKGROUND: Osteoporosis develops in the elderly due to decreased bone mineral density (BMD), potentially increasing bone fracture risk. However, the BMD is not regularly measured in a clinical setting. This study aimed to develop a good prediction model for the osteoporosis risk using a machine learning (ML) approach in adults over 40 years in the Ansan/Anseong cohort and the association of predicted osteoporosis risk with a fracture in the Health Examinees (HEXA) cohort. METHODS: The 109 demographic, anthropometric, biochemical, genetic, nutrient, and lifestyle variables of 8,842 participants were manually selected in an Ansan/Anseong cohort and included in the ML algorithm. The polygenic risk score (PRS) of osteoporosis was generated with a genome-wide association study and added for the genetic impact of osteoporosis. Osteoporosis was defined with < −2.5 T scores of the tibia or radius compared to people in their 20s–30s. They were divided randomly into the training (n = 7,074) and test (n = 1,768) sets—Pearson’s correlation between the predicted osteoporosis risk and fracture in the HEXA cohort. RESULTS: XGBoost, deep neural network, and random forest generated the prediction model with a high area under the curve (AUC, 0.86) of the receiver operating characteristic (ROC) with 10, 15, and 20 features; the prediction model by XGBoost had the highest AUC of ROC, high accuracy and k-fold values (> 0.85) in 15 features among seven ML approaches. The model included the genetic factor, genders, number of children and breastfed children, age, residence area, education, seasons to measure, height, smoking status, hormone replacement therapy, serum albumin, hip circumferences, vitamin B6 intake, and body weight. The prediction models for women alone were similar to those for both genders, with lower accuracy. When the prediction model was applied to the HEXA study, the correlation between the fracture incidence and predicted osteoporosis risk was significant but weak (r = 0.173, P < 0.001). CONCLUSION: The prediction model for osteoporosis risk generated by XGBoost can be applied to estimate osteoporosis risk. The biomarkers can be considered for enhancing the prevention, detection, and early therapy of osteoporosis risk in Asians. The Korean Academy of Medical Sciences 2023-04-25 /pmc/articles/PMC10226854/ /pubmed/37270917 http://dx.doi.org/10.3346/jkms.2023.38.e162 Text en © 2023 The Korean Academy of Medical Sciences. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Wu, Xuangao Park, Sunmin A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort |
title | A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort |
title_full | A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort |
title_fullStr | A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort |
title_full_unstemmed | A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort |
title_short | A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort |
title_sort | prediction model for osteoporosis risk using a machine-learning approach and its validation in a large cohort |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10226854/ https://www.ncbi.nlm.nih.gov/pubmed/37270917 http://dx.doi.org/10.3346/jkms.2023.38.e162 |
work_keys_str_mv | AT wuxuangao apredictionmodelforosteoporosisriskusingamachinelearningapproachanditsvalidationinalargecohort AT parksunmin apredictionmodelforosteoporosisriskusingamachinelearningapproachanditsvalidationinalargecohort AT wuxuangao predictionmodelforosteoporosisriskusingamachinelearningapproachanditsvalidationinalargecohort AT parksunmin predictionmodelforosteoporosisriskusingamachinelearningapproachanditsvalidationinalargecohort |