Cargando…
Machine Learning–Based Prediction of Elevated PTH Levels Among the US General Population
CONTEXT: Although elevated parathyroid hormone (PTH) levels are associated with higher mortality risks, the evidence is limited as to when PTH is expected to be elevated and thus should be measured among the general population. OBJECTIVE: This work aimed to build a machine learning–based prediction...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9693802/ https://www.ncbi.nlm.nih.gov/pubmed/36125184 http://dx.doi.org/10.1210/clinem/dgac544 |
_version_ | 1784837636058251264 |
---|---|
author | Kato, Hajime Hoshino, Yoshitomo Hidaka, Naoko Ito, Nobuaki Makita, Noriko Nangaku, Masaomi Inoue, Kosuke |
author_facet | Kato, Hajime Hoshino, Yoshitomo Hidaka, Naoko Ito, Nobuaki Makita, Noriko Nangaku, Masaomi Inoue, Kosuke |
author_sort | Kato, Hajime |
collection | PubMed |
description | CONTEXT: Although elevated parathyroid hormone (PTH) levels are associated with higher mortality risks, the evidence is limited as to when PTH is expected to be elevated and thus should be measured among the general population. OBJECTIVE: This work aimed to build a machine learning–based prediction model of elevated PTH levels based on demographic, lifestyle, and biochemical data among US adults. METHODS: This population-based study included adults aged 20 years or older with a measurement of serum intact PTH from the National Health and Nutrition Examination Survey (NHANES) 2003 to 2006. We used the NHANES 2003 to 2004 cohort (n = 4096) to train 6 machine-learning prediction models (logistic regression with and without splines, lasso regression, random forest, gradient-boosting machines [GBMs], and SuperLearner). Then, we used the NHANES 2005 to 2006 cohort (n = 4112) to evaluate the model performance including area under the receiver operating characteristic curve (AUC). RESULTS: Of 8208 US adults, 753 (9.2%) showed PTH greater than 74 pg/mL. Across 6 algorithms, the highest AUC was observed among random forest (AUC [95% CI] = 0.79 [0.76-0.81]), GBM (AUC [95% CI] = 0.78 [0.75-0.81]), and SuperLearner (AUC [95% CI] = 0.79 [0.76-0.81]). The AUC improved from 0.69 to 0.77 when we added cubic splines for the estimated glomerular filtration rate (eGFR) in the logistic regression models. Logistic regression models with splines showed the best calibration performance (calibration slope [95% CI] = 0.96 [0.86-1.06]), while other algorithms were less calibrated. Among all covariates included, eGFR was the most important predictor of the random forest model and GBM. CONCLUSION: In this nationally representative data in the United States, we developed a prediction model that potentially helps us to make accurate and early detection of elevated PTH in general clinical practice. Future studies are warranted to assess whether this prediction tool for elevated PTH would improve adverse health outcomes. |
format | Online Article Text |
id | pubmed-9693802 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-96938022022-11-28 Machine Learning–Based Prediction of Elevated PTH Levels Among the US General Population Kato, Hajime Hoshino, Yoshitomo Hidaka, Naoko Ito, Nobuaki Makita, Noriko Nangaku, Masaomi Inoue, Kosuke J Clin Endocrinol Metab Clinical Research Article CONTEXT: Although elevated parathyroid hormone (PTH) levels are associated with higher mortality risks, the evidence is limited as to when PTH is expected to be elevated and thus should be measured among the general population. OBJECTIVE: This work aimed to build a machine learning–based prediction model of elevated PTH levels based on demographic, lifestyle, and biochemical data among US adults. METHODS: This population-based study included adults aged 20 years or older with a measurement of serum intact PTH from the National Health and Nutrition Examination Survey (NHANES) 2003 to 2006. We used the NHANES 2003 to 2004 cohort (n = 4096) to train 6 machine-learning prediction models (logistic regression with and without splines, lasso regression, random forest, gradient-boosting machines [GBMs], and SuperLearner). Then, we used the NHANES 2005 to 2006 cohort (n = 4112) to evaluate the model performance including area under the receiver operating characteristic curve (AUC). RESULTS: Of 8208 US adults, 753 (9.2%) showed PTH greater than 74 pg/mL. Across 6 algorithms, the highest AUC was observed among random forest (AUC [95% CI] = 0.79 [0.76-0.81]), GBM (AUC [95% CI] = 0.78 [0.75-0.81]), and SuperLearner (AUC [95% CI] = 0.79 [0.76-0.81]). The AUC improved from 0.69 to 0.77 when we added cubic splines for the estimated glomerular filtration rate (eGFR) in the logistic regression models. Logistic regression models with splines showed the best calibration performance (calibration slope [95% CI] = 0.96 [0.86-1.06]), while other algorithms were less calibrated. Among all covariates included, eGFR was the most important predictor of the random forest model and GBM. CONCLUSION: In this nationally representative data in the United States, we developed a prediction model that potentially helps us to make accurate and early detection of elevated PTH in general clinical practice. Future studies are warranted to assess whether this prediction tool for elevated PTH would improve adverse health outcomes. Oxford University Press 2022-09-19 /pmc/articles/PMC9693802/ /pubmed/36125184 http://dx.doi.org/10.1210/clinem/dgac544 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the Endocrine Society. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Clinical Research Article Kato, Hajime Hoshino, Yoshitomo Hidaka, Naoko Ito, Nobuaki Makita, Noriko Nangaku, Masaomi Inoue, Kosuke Machine Learning–Based Prediction of Elevated PTH Levels Among the US General Population |
title | Machine Learning–Based Prediction of Elevated PTH Levels Among the US General
Population |
title_full | Machine Learning–Based Prediction of Elevated PTH Levels Among the US General
Population |
title_fullStr | Machine Learning–Based Prediction of Elevated PTH Levels Among the US General
Population |
title_full_unstemmed | Machine Learning–Based Prediction of Elevated PTH Levels Among the US General
Population |
title_short | Machine Learning–Based Prediction of Elevated PTH Levels Among the US General
Population |
title_sort | machine learning–based prediction of elevated pth levels among the us general
population |
topic | Clinical Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9693802/ https://www.ncbi.nlm.nih.gov/pubmed/36125184 http://dx.doi.org/10.1210/clinem/dgac544 |
work_keys_str_mv | AT katohajime machinelearningbasedpredictionofelevatedpthlevelsamongtheusgeneralpopulation AT hoshinoyoshitomo machinelearningbasedpredictionofelevatedpthlevelsamongtheusgeneralpopulation AT hidakanaoko machinelearningbasedpredictionofelevatedpthlevelsamongtheusgeneralpopulation AT itonobuaki machinelearningbasedpredictionofelevatedpthlevelsamongtheusgeneralpopulation AT makitanoriko machinelearningbasedpredictionofelevatedpthlevelsamongtheusgeneralpopulation AT nangakumasaomi machinelearningbasedpredictionofelevatedpthlevelsamongtheusgeneralpopulation AT inouekosuke machinelearningbasedpredictionofelevatedpthlevelsamongtheusgeneralpopulation |