Cargando…

Osteoporosis Feature Selection and Risk Prediction Model by Machine Learning Using a Cross-Sectional Database

BACKGROUND: The purpose of this study was to verify the accuracy and validity of using machine learning (ML) to select risk factors, to discriminate differences in feature selection by ML between men and women, and to develop predictive models for patients with osteoporosis in a big database. METHOD...

Descripción completa

Detalles Bibliográficos
Autores principales: Cha, Yonghan, Seo, Sung Hyo, Kim, Jung-Taek, Kim, Jin-Woo, Lee, Sang-Yeob, Yoo, Jun-Il
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Korean Society for Bone and Mineral Research 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10509024/
https://www.ncbi.nlm.nih.gov/pubmed/37718904
http://dx.doi.org/10.11005/jbm.2023.30.3.263
Descripción
Sumario:BACKGROUND: The purpose of this study was to verify the accuracy and validity of using machine learning (ML) to select risk factors, to discriminate differences in feature selection by ML between men and women, and to develop predictive models for patients with osteoporosis in a big database. METHODS: The data on 968 observed features from a total of 3,484 the Korea National Health and Nutrition Examination Survey participants were collected. To find preliminary features that were well-related to osteoporosis, logistic regression, random forest, gradient boosting, adaptive boosting, and support vector machine were used. RESULTS: In osteoporosis feature selection by 5 ML models in this study, the most selected variables as risk factors in men and women were body mass index, monthly alcohol consumption, and dietary surveys. However, differences between men and women in osteoporosis feature selection by ML models were age, smoking, and blood glucose level. The receiver operating characteristic (ROC) analysis revealed that the area under the ROC curve for each ML model was not significantly different for either gender. CONCLUSIONS: ML performed a feature selection of osteoporosis, considering hidden differences between men and women. The present study considers the preprocessing of input data and the feature selection process as well as the ML technique to be important factors for the accuracy of the osteoporosis prediction model.