Cargando…

Machine Learning–Based Prediction of Elevated PTH Levels Among the US General Population

CONTEXT: Although elevated parathyroid hormone (PTH) levels are associated with higher mortality risks, the evidence is limited as to when PTH is expected to be elevated and thus should be measured among the general population. OBJECTIVE: This work aimed to build a machine learning–based prediction...

Descripción completa

Detalles Bibliográficos
Autores principales: Kato, Hajime, Hoshino, Yoshitomo, Hidaka, Naoko, Ito, Nobuaki, Makita, Noriko, Nangaku, Masaomi, Inoue, Kosuke
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9693802/
https://www.ncbi.nlm.nih.gov/pubmed/36125184
http://dx.doi.org/10.1210/clinem/dgac544
_version_ 1784837636058251264
author Kato, Hajime
Hoshino, Yoshitomo
Hidaka, Naoko
Ito, Nobuaki
Makita, Noriko
Nangaku, Masaomi
Inoue, Kosuke
author_facet Kato, Hajime
Hoshino, Yoshitomo
Hidaka, Naoko
Ito, Nobuaki
Makita, Noriko
Nangaku, Masaomi
Inoue, Kosuke
author_sort Kato, Hajime
collection PubMed
description CONTEXT: Although elevated parathyroid hormone (PTH) levels are associated with higher mortality risks, the evidence is limited as to when PTH is expected to be elevated and thus should be measured among the general population. OBJECTIVE: This work aimed to build a machine learning–based prediction model of elevated PTH levels based on demographic, lifestyle, and biochemical data among US adults. METHODS: This population-based study included adults aged 20 years or older with a measurement of serum intact PTH from the National Health and Nutrition Examination Survey (NHANES) 2003 to 2006. We used the NHANES 2003 to 2004 cohort (n = 4096) to train 6 machine-learning prediction models (logistic regression with and without splines, lasso regression, random forest, gradient-boosting machines [GBMs], and SuperLearner). Then, we used the NHANES 2005 to 2006 cohort (n = 4112) to evaluate the model performance including area under the receiver operating characteristic curve (AUC). RESULTS: Of 8208 US adults, 753 (9.2%) showed PTH greater than 74 pg/mL. Across 6 algorithms, the highest AUC was observed among random forest (AUC [95% CI] = 0.79 [0.76-0.81]), GBM (AUC [95% CI] = 0.78 [0.75-0.81]), and SuperLearner (AUC [95% CI] = 0.79 [0.76-0.81]). The AUC improved from 0.69 to 0.77 when we added cubic splines for the estimated glomerular filtration rate (eGFR) in the logistic regression models. Logistic regression models with splines showed the best calibration performance (calibration slope [95% CI] = 0.96 [0.86-1.06]), while other algorithms were less calibrated. Among all covariates included, eGFR was the most important predictor of the random forest model and GBM. CONCLUSION: In this nationally representative data in the United States, we developed a prediction model that potentially helps us to make accurate and early detection of elevated PTH in general clinical practice. Future studies are warranted to assess whether this prediction tool for elevated PTH would improve adverse health outcomes.
format Online
Article
Text
id pubmed-9693802
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-96938022022-11-28 Machine Learning–Based Prediction of Elevated PTH Levels Among the US General Population Kato, Hajime Hoshino, Yoshitomo Hidaka, Naoko Ito, Nobuaki Makita, Noriko Nangaku, Masaomi Inoue, Kosuke J Clin Endocrinol Metab Clinical Research Article CONTEXT: Although elevated parathyroid hormone (PTH) levels are associated with higher mortality risks, the evidence is limited as to when PTH is expected to be elevated and thus should be measured among the general population. OBJECTIVE: This work aimed to build a machine learning–based prediction model of elevated PTH levels based on demographic, lifestyle, and biochemical data among US adults. METHODS: This population-based study included adults aged 20 years or older with a measurement of serum intact PTH from the National Health and Nutrition Examination Survey (NHANES) 2003 to 2006. We used the NHANES 2003 to 2004 cohort (n = 4096) to train 6 machine-learning prediction models (logistic regression with and without splines, lasso regression, random forest, gradient-boosting machines [GBMs], and SuperLearner). Then, we used the NHANES 2005 to 2006 cohort (n = 4112) to evaluate the model performance including area under the receiver operating characteristic curve (AUC). RESULTS: Of 8208 US adults, 753 (9.2%) showed PTH greater than 74 pg/mL. Across 6 algorithms, the highest AUC was observed among random forest (AUC [95% CI] = 0.79 [0.76-0.81]), GBM (AUC [95% CI] = 0.78 [0.75-0.81]), and SuperLearner (AUC [95% CI] = 0.79 [0.76-0.81]). The AUC improved from 0.69 to 0.77 when we added cubic splines for the estimated glomerular filtration rate (eGFR) in the logistic regression models. Logistic regression models with splines showed the best calibration performance (calibration slope [95% CI] = 0.96 [0.86-1.06]), while other algorithms were less calibrated. Among all covariates included, eGFR was the most important predictor of the random forest model and GBM. CONCLUSION: In this nationally representative data in the United States, we developed a prediction model that potentially helps us to make accurate and early detection of elevated PTH in general clinical practice. Future studies are warranted to assess whether this prediction tool for elevated PTH would improve adverse health outcomes. Oxford University Press 2022-09-19 /pmc/articles/PMC9693802/ /pubmed/36125184 http://dx.doi.org/10.1210/clinem/dgac544 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the Endocrine Society. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Clinical Research Article
Kato, Hajime
Hoshino, Yoshitomo
Hidaka, Naoko
Ito, Nobuaki
Makita, Noriko
Nangaku, Masaomi
Inoue, Kosuke
Machine Learning–Based Prediction of Elevated PTH Levels Among the US General Population
title Machine Learning–Based Prediction of Elevated PTH Levels Among the US General Population
title_full Machine Learning–Based Prediction of Elevated PTH Levels Among the US General Population
title_fullStr Machine Learning–Based Prediction of Elevated PTH Levels Among the US General Population
title_full_unstemmed Machine Learning–Based Prediction of Elevated PTH Levels Among the US General Population
title_short Machine Learning–Based Prediction of Elevated PTH Levels Among the US General Population
title_sort machine learning–based prediction of elevated pth levels among the us general population
topic Clinical Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9693802/
https://www.ncbi.nlm.nih.gov/pubmed/36125184
http://dx.doi.org/10.1210/clinem/dgac544
work_keys_str_mv AT katohajime machinelearningbasedpredictionofelevatedpthlevelsamongtheusgeneralpopulation
AT hoshinoyoshitomo machinelearningbasedpredictionofelevatedpthlevelsamongtheusgeneralpopulation
AT hidakanaoko machinelearningbasedpredictionofelevatedpthlevelsamongtheusgeneralpopulation
AT itonobuaki machinelearningbasedpredictionofelevatedpthlevelsamongtheusgeneralpopulation
AT makitanoriko machinelearningbasedpredictionofelevatedpthlevelsamongtheusgeneralpopulation
AT nangakumasaomi machinelearningbasedpredictionofelevatedpthlevelsamongtheusgeneralpopulation
AT inouekosuke machinelearningbasedpredictionofelevatedpthlevelsamongtheusgeneralpopulation