Cargando…

A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort

BACKGROUND: Osteoporosis develops in the elderly due to decreased bone mineral density (BMD), potentially increasing bone fracture risk. However, the BMD is not regularly measured in a clinical setting. This study aimed to develop a good prediction model for the osteoporosis risk using a machine lea...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Xuangao, Park, Sunmin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Korean Academy of Medical Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10226854/
https://www.ncbi.nlm.nih.gov/pubmed/37270917
http://dx.doi.org/10.3346/jkms.2023.38.e162
_version_ 1785050652850782208
author Wu, Xuangao
Park, Sunmin
author_facet Wu, Xuangao
Park, Sunmin
author_sort Wu, Xuangao
collection PubMed
description BACKGROUND: Osteoporosis develops in the elderly due to decreased bone mineral density (BMD), potentially increasing bone fracture risk. However, the BMD is not regularly measured in a clinical setting. This study aimed to develop a good prediction model for the osteoporosis risk using a machine learning (ML) approach in adults over 40 years in the Ansan/Anseong cohort and the association of predicted osteoporosis risk with a fracture in the Health Examinees (HEXA) cohort. METHODS: The 109 demographic, anthropometric, biochemical, genetic, nutrient, and lifestyle variables of 8,842 participants were manually selected in an Ansan/Anseong cohort and included in the ML algorithm. The polygenic risk score (PRS) of osteoporosis was generated with a genome-wide association study and added for the genetic impact of osteoporosis. Osteoporosis was defined with < −2.5 T scores of the tibia or radius compared to people in their 20s–30s. They were divided randomly into the training (n = 7,074) and test (n = 1,768) sets—Pearson’s correlation between the predicted osteoporosis risk and fracture in the HEXA cohort. RESULTS: XGBoost, deep neural network, and random forest generated the prediction model with a high area under the curve (AUC, 0.86) of the receiver operating characteristic (ROC) with 10, 15, and 20 features; the prediction model by XGBoost had the highest AUC of ROC, high accuracy and k-fold values (> 0.85) in 15 features among seven ML approaches. The model included the genetic factor, genders, number of children and breastfed children, age, residence area, education, seasons to measure, height, smoking status, hormone replacement therapy, serum albumin, hip circumferences, vitamin B6 intake, and body weight. The prediction models for women alone were similar to those for both genders, with lower accuracy. When the prediction model was applied to the HEXA study, the correlation between the fracture incidence and predicted osteoporosis risk was significant but weak (r = 0.173, P < 0.001). CONCLUSION: The prediction model for osteoporosis risk generated by XGBoost can be applied to estimate osteoporosis risk. The biomarkers can be considered for enhancing the prevention, detection, and early therapy of osteoporosis risk in Asians.
format Online
Article
Text
id pubmed-10226854
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Korean Academy of Medical Sciences
record_format MEDLINE/PubMed
spelling pubmed-102268542023-05-31 A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort Wu, Xuangao Park, Sunmin J Korean Med Sci Original Article BACKGROUND: Osteoporosis develops in the elderly due to decreased bone mineral density (BMD), potentially increasing bone fracture risk. However, the BMD is not regularly measured in a clinical setting. This study aimed to develop a good prediction model for the osteoporosis risk using a machine learning (ML) approach in adults over 40 years in the Ansan/Anseong cohort and the association of predicted osteoporosis risk with a fracture in the Health Examinees (HEXA) cohort. METHODS: The 109 demographic, anthropometric, biochemical, genetic, nutrient, and lifestyle variables of 8,842 participants were manually selected in an Ansan/Anseong cohort and included in the ML algorithm. The polygenic risk score (PRS) of osteoporosis was generated with a genome-wide association study and added for the genetic impact of osteoporosis. Osteoporosis was defined with < −2.5 T scores of the tibia or radius compared to people in their 20s–30s. They were divided randomly into the training (n = 7,074) and test (n = 1,768) sets—Pearson’s correlation between the predicted osteoporosis risk and fracture in the HEXA cohort. RESULTS: XGBoost, deep neural network, and random forest generated the prediction model with a high area under the curve (AUC, 0.86) of the receiver operating characteristic (ROC) with 10, 15, and 20 features; the prediction model by XGBoost had the highest AUC of ROC, high accuracy and k-fold values (> 0.85) in 15 features among seven ML approaches. The model included the genetic factor, genders, number of children and breastfed children, age, residence area, education, seasons to measure, height, smoking status, hormone replacement therapy, serum albumin, hip circumferences, vitamin B6 intake, and body weight. The prediction models for women alone were similar to those for both genders, with lower accuracy. When the prediction model was applied to the HEXA study, the correlation between the fracture incidence and predicted osteoporosis risk was significant but weak (r = 0.173, P < 0.001). CONCLUSION: The prediction model for osteoporosis risk generated by XGBoost can be applied to estimate osteoporosis risk. The biomarkers can be considered for enhancing the prevention, detection, and early therapy of osteoporosis risk in Asians. The Korean Academy of Medical Sciences 2023-04-25 /pmc/articles/PMC10226854/ /pubmed/37270917 http://dx.doi.org/10.3346/jkms.2023.38.e162 Text en © 2023 The Korean Academy of Medical Sciences. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Wu, Xuangao
Park, Sunmin
A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort
title A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort
title_full A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort
title_fullStr A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort
title_full_unstemmed A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort
title_short A Prediction Model for Osteoporosis Risk Using a Machine-Learning Approach and Its Validation in a Large Cohort
title_sort prediction model for osteoporosis risk using a machine-learning approach and its validation in a large cohort
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10226854/
https://www.ncbi.nlm.nih.gov/pubmed/37270917
http://dx.doi.org/10.3346/jkms.2023.38.e162
work_keys_str_mv AT wuxuangao apredictionmodelforosteoporosisriskusingamachinelearningapproachanditsvalidationinalargecohort
AT parksunmin apredictionmodelforosteoporosisriskusingamachinelearningapproachanditsvalidationinalargecohort
AT wuxuangao predictionmodelforosteoporosisriskusingamachinelearningapproachanditsvalidationinalargecohort
AT parksunmin predictionmodelforosteoporosisriskusingamachinelearningapproachanditsvalidationinalargecohort