Cargando…

Predictive Models for Knee Pain in Middle-Aged and Elderly Individuals Based on Machine Learning Methods

AIM: This study used machine learning methods to develop a prediction model for knee pain in middle-aged and elderly individuals. METHODS: A total of 5386 individuals above 45 years old were obtained from the National Health and Nutrition Examination Survey. Participants were randomly divided into a...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Lu, Zhu, Min-min, Cai, Lin-lin, Zhang, Xiao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9529423/
https://www.ncbi.nlm.nih.gov/pubmed/36199766
http://dx.doi.org/10.1155/2022/5005195
_version_ 1784801490140921856
author Liu, Lu
Zhu, Min-min
Cai, Lin-lin
Zhang, Xiao
author_facet Liu, Lu
Zhu, Min-min
Cai, Lin-lin
Zhang, Xiao
author_sort Liu, Lu
collection PubMed
description AIM: This study used machine learning methods to develop a prediction model for knee pain in middle-aged and elderly individuals. METHODS: A total of 5386 individuals above 45 years old were obtained from the National Health and Nutrition Examination Survey. Participants were randomly divided into a training set and a test set at a 7 : 3 ratio. The training set was used to create a prediction model, whereas the test set was used to validate the proposed model. We constructed multiple predictive models based on three machine learning methods: logistic regression, random forest, and Extreme Gradient Boosting. The model performance was evaluated by areas under the receiver (AUC), sensitivity, specificity, positive predictive value, and negative predictive value. Additionally, we created a simplified nomogram based on logistic regression for better clinical application. RESULTS: About 31.4% (1690) individuals were with self-reported knee pain. The logistic regression showed that female gender (odds ratio [OR] = 1.28), pain elsewhere (OR = 4.64), and body mass index (OR = 1.05) were significantly associated with increased risk of knee pain. In the test set, the logistic regression (AUC = 0.71) showed similar but slightly higher accuracy than the random forest (AUC = 0.70), while the performance of the Extreme Gradient Boosting model was less reliable (AUC = 0.59). Based on mean decrease accuracy, the most important first five predictions were pain elsewhere, waist circumference, body mass index, age, and gender. Additionally, the most important first five predictions with the highest mean decrease Gini index were pain elsewhere, body mass index, waist circumference, triglycerides, and age. The nomogram model showed good discrimination ability with an AUC of 0.75 (0.73-0.77), a sensitivity of 0.72, specificity of 0.71, a positive predictive value of 0.45, and a negative predictive value of 0.88. CONCLUSION: This study proposed a convenient nomogram tool to evaluate the risk of knee pain for the middle-aged and elderly US population in primary care. All the input variables can be easily obtained in a clinical setting, and no additional radiologic assessments were required.
format Online
Article
Text
id pubmed-9529423
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-95294232022-10-04 Predictive Models for Knee Pain in Middle-Aged and Elderly Individuals Based on Machine Learning Methods Liu, Lu Zhu, Min-min Cai, Lin-lin Zhang, Xiao Comput Math Methods Med Research Article AIM: This study used machine learning methods to develop a prediction model for knee pain in middle-aged and elderly individuals. METHODS: A total of 5386 individuals above 45 years old were obtained from the National Health and Nutrition Examination Survey. Participants were randomly divided into a training set and a test set at a 7 : 3 ratio. The training set was used to create a prediction model, whereas the test set was used to validate the proposed model. We constructed multiple predictive models based on three machine learning methods: logistic regression, random forest, and Extreme Gradient Boosting. The model performance was evaluated by areas under the receiver (AUC), sensitivity, specificity, positive predictive value, and negative predictive value. Additionally, we created a simplified nomogram based on logistic regression for better clinical application. RESULTS: About 31.4% (1690) individuals were with self-reported knee pain. The logistic regression showed that female gender (odds ratio [OR] = 1.28), pain elsewhere (OR = 4.64), and body mass index (OR = 1.05) were significantly associated with increased risk of knee pain. In the test set, the logistic regression (AUC = 0.71) showed similar but slightly higher accuracy than the random forest (AUC = 0.70), while the performance of the Extreme Gradient Boosting model was less reliable (AUC = 0.59). Based on mean decrease accuracy, the most important first five predictions were pain elsewhere, waist circumference, body mass index, age, and gender. Additionally, the most important first five predictions with the highest mean decrease Gini index were pain elsewhere, body mass index, waist circumference, triglycerides, and age. The nomogram model showed good discrimination ability with an AUC of 0.75 (0.73-0.77), a sensitivity of 0.72, specificity of 0.71, a positive predictive value of 0.45, and a negative predictive value of 0.88. CONCLUSION: This study proposed a convenient nomogram tool to evaluate the risk of knee pain for the middle-aged and elderly US population in primary care. All the input variables can be easily obtained in a clinical setting, and no additional radiologic assessments were required. Hindawi 2022-09-26 /pmc/articles/PMC9529423/ /pubmed/36199766 http://dx.doi.org/10.1155/2022/5005195 Text en Copyright © 2022 Lu Liu et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Liu, Lu
Zhu, Min-min
Cai, Lin-lin
Zhang, Xiao
Predictive Models for Knee Pain in Middle-Aged and Elderly Individuals Based on Machine Learning Methods
title Predictive Models for Knee Pain in Middle-Aged and Elderly Individuals Based on Machine Learning Methods
title_full Predictive Models for Knee Pain in Middle-Aged and Elderly Individuals Based on Machine Learning Methods
title_fullStr Predictive Models for Knee Pain in Middle-Aged and Elderly Individuals Based on Machine Learning Methods
title_full_unstemmed Predictive Models for Knee Pain in Middle-Aged and Elderly Individuals Based on Machine Learning Methods
title_short Predictive Models for Knee Pain in Middle-Aged and Elderly Individuals Based on Machine Learning Methods
title_sort predictive models for knee pain in middle-aged and elderly individuals based on machine learning methods
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9529423/
https://www.ncbi.nlm.nih.gov/pubmed/36199766
http://dx.doi.org/10.1155/2022/5005195
work_keys_str_mv AT liulu predictivemodelsforkneepaininmiddleagedandelderlyindividualsbasedonmachinelearningmethods
AT zhuminmin predictivemodelsforkneepaininmiddleagedandelderlyindividualsbasedonmachinelearningmethods
AT cailinlin predictivemodelsforkneepaininmiddleagedandelderlyindividualsbasedonmachinelearningmethods
AT zhangxiao predictivemodelsforkneepaininmiddleagedandelderlyindividualsbasedonmachinelearningmethods