Cargando…

Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018

BACKGROUND: Hypokalaemia is a side-effect of diuretics. We aimed to use machine learning to identify features predicting hypokalaemia risk in hypertensive patients. METHODS: Participants with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018 were included f...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Ziying, Cheng, Yuen Ting, Cheung, Bernard Man Yung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10198242/
https://www.ncbi.nlm.nih.gov/pubmed/37162442
http://dx.doi.org/10.1080/07853890.2023.2209336
_version_ 1785044707008577536
author Lin, Ziying
Cheng, Yuen Ting
Cheung, Bernard Man Yung
author_facet Lin, Ziying
Cheng, Yuen Ting
Cheung, Bernard Man Yung
author_sort Lin, Ziying
collection PubMed
description BACKGROUND: Hypokalaemia is a side-effect of diuretics. We aimed to use machine learning to identify features predicting hypokalaemia risk in hypertensive patients. METHODS: Participants with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018 were included for analysis. To select the most suitable algorithm, we tested and evaluated five machine learning algorithms commonly employed in epidemiological studies: Logistic Regression, k-Nearest Neighbor, Random Forest, Recursive Partitioning and Regression Trees, and eXtreme Gradient Boosting. These algorithms were accessed using a set of 38 screened features. We then selected the key hypokalaemia-associated features in the hypertension group and their cardiovascular diseases (CVD) subgroup using the SHapley Additive exPlanations (SHAP) values. Using SHAP values, the key features and their impact pattern on hypokalaemia risk were determined. RESULTS: A total of 25,326 hypertensive participants were included for analysis, of whom 4,511 had known CVD. The Random Forest algorithm had the highest AUROC (hypertension dataset: 0.73 [95%CI, 0.71–0.76]; CVD subgroup: 0.72 [95%CI, 0.66–0.78]). Moreover, the nomogram based on the top twelve key features screened by random forest retained good performance: age, sex, race, poverty income ratio, body mass index, systolic and diastolic blood pressure, non-potassium-sparing diuretics use and duration, renin-angiotensin blockers use and duration, and CVD history in hypertension dataset; while in CVD subgroup, the additional key features were comorbid diabetes, education level, smoking status, and use of bronchodilators. CONCLUSION: Our predictive model based on the random forest algorithm performed best among the tested and evaluated five algorithms. Hypokalaemia-associated key features have been identified in hypertensive patients and the subgroup with CVD. These findings from machine learning facilitate the development of artificial intelligence to highlight hypokalaemia risk in hypertension patients. KEY MESSAGES: 1. Our predictive model based on the random forest algorithm performed best among the tested and evaluated five algorithms, and hypokalemia-associated key features have been identified in hypertensive patients and the subgroup with cardiovascular disease. 2. The nomogram we developed including twelve key features might be useful and applied in primary clinical consultations to identify the hypertensive patients at risk of hypokalaemia. 3. These findings from machine learning facilitate the development of artificial intelligence to highlight hypokalaemia risk in hypertension patients.
format Online
Article
Text
id pubmed-10198242
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Taylor & Francis
record_format MEDLINE/PubMed
spelling pubmed-101982422023-05-20 Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018 Lin, Ziying Cheng, Yuen Ting Cheung, Bernard Man Yung Ann Med Research Article BACKGROUND: Hypokalaemia is a side-effect of diuretics. We aimed to use machine learning to identify features predicting hypokalaemia risk in hypertensive patients. METHODS: Participants with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018 were included for analysis. To select the most suitable algorithm, we tested and evaluated five machine learning algorithms commonly employed in epidemiological studies: Logistic Regression, k-Nearest Neighbor, Random Forest, Recursive Partitioning and Regression Trees, and eXtreme Gradient Boosting. These algorithms were accessed using a set of 38 screened features. We then selected the key hypokalaemia-associated features in the hypertension group and their cardiovascular diseases (CVD) subgroup using the SHapley Additive exPlanations (SHAP) values. Using SHAP values, the key features and their impact pattern on hypokalaemia risk were determined. RESULTS: A total of 25,326 hypertensive participants were included for analysis, of whom 4,511 had known CVD. The Random Forest algorithm had the highest AUROC (hypertension dataset: 0.73 [95%CI, 0.71–0.76]; CVD subgroup: 0.72 [95%CI, 0.66–0.78]). Moreover, the nomogram based on the top twelve key features screened by random forest retained good performance: age, sex, race, poverty income ratio, body mass index, systolic and diastolic blood pressure, non-potassium-sparing diuretics use and duration, renin-angiotensin blockers use and duration, and CVD history in hypertension dataset; while in CVD subgroup, the additional key features were comorbid diabetes, education level, smoking status, and use of bronchodilators. CONCLUSION: Our predictive model based on the random forest algorithm performed best among the tested and evaluated five algorithms. Hypokalaemia-associated key features have been identified in hypertensive patients and the subgroup with CVD. These findings from machine learning facilitate the development of artificial intelligence to highlight hypokalaemia risk in hypertension patients. KEY MESSAGES: 1. Our predictive model based on the random forest algorithm performed best among the tested and evaluated five algorithms, and hypokalemia-associated key features have been identified in hypertensive patients and the subgroup with cardiovascular disease. 2. The nomogram we developed including twelve key features might be useful and applied in primary clinical consultations to identify the hypertensive patients at risk of hypokalaemia. 3. These findings from machine learning facilitate the development of artificial intelligence to highlight hypokalaemia risk in hypertension patients. Taylor & Francis 2023-05-10 /pmc/articles/PMC10198242/ /pubmed/37162442 http://dx.doi.org/10.1080/07853890.2023.2209336 Text en © 2023 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. The terms on which this article has been published allow the posting of the Accepted Manuscript in a repository by the author(s) or with their consent.
spellingShingle Research Article
Lin, Ziying
Cheng, Yuen Ting
Cheung, Bernard Man Yung
Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018
title Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018
title_full Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018
title_fullStr Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018
title_full_unstemmed Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018
title_short Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018
title_sort machine learning algorithms identify hypokalaemia risk in people with hypertension in the united states national health and nutrition examination survey 1999–2018
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10198242/
https://www.ncbi.nlm.nih.gov/pubmed/37162442
http://dx.doi.org/10.1080/07853890.2023.2209336
work_keys_str_mv AT linziying machinelearningalgorithmsidentifyhypokalaemiariskinpeoplewithhypertensionintheunitedstatesnationalhealthandnutritionexaminationsurvey19992018
AT chengyuenting machinelearningalgorithmsidentifyhypokalaemiariskinpeoplewithhypertensionintheunitedstatesnationalhealthandnutritionexaminationsurvey19992018
AT cheungbernardmanyung machinelearningalgorithmsidentifyhypokalaemiariskinpeoplewithhypertensionintheunitedstatesnationalhealthandnutritionexaminationsurvey19992018