Cargando…
Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018
BACKGROUND: Hypokalaemia is a side-effect of diuretics. We aimed to use machine learning to identify features predicting hypokalaemia risk in hypertensive patients. METHODS: Participants with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018 were included f...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Taylor & Francis
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10198242/ https://www.ncbi.nlm.nih.gov/pubmed/37162442 http://dx.doi.org/10.1080/07853890.2023.2209336 |
_version_ | 1785044707008577536 |
---|---|
author | Lin, Ziying Cheng, Yuen Ting Cheung, Bernard Man Yung |
author_facet | Lin, Ziying Cheng, Yuen Ting Cheung, Bernard Man Yung |
author_sort | Lin, Ziying |
collection | PubMed |
description | BACKGROUND: Hypokalaemia is a side-effect of diuretics. We aimed to use machine learning to identify features predicting hypokalaemia risk in hypertensive patients. METHODS: Participants with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018 were included for analysis. To select the most suitable algorithm, we tested and evaluated five machine learning algorithms commonly employed in epidemiological studies: Logistic Regression, k-Nearest Neighbor, Random Forest, Recursive Partitioning and Regression Trees, and eXtreme Gradient Boosting. These algorithms were accessed using a set of 38 screened features. We then selected the key hypokalaemia-associated features in the hypertension group and their cardiovascular diseases (CVD) subgroup using the SHapley Additive exPlanations (SHAP) values. Using SHAP values, the key features and their impact pattern on hypokalaemia risk were determined. RESULTS: A total of 25,326 hypertensive participants were included for analysis, of whom 4,511 had known CVD. The Random Forest algorithm had the highest AUROC (hypertension dataset: 0.73 [95%CI, 0.71–0.76]; CVD subgroup: 0.72 [95%CI, 0.66–0.78]). Moreover, the nomogram based on the top twelve key features screened by random forest retained good performance: age, sex, race, poverty income ratio, body mass index, systolic and diastolic blood pressure, non-potassium-sparing diuretics use and duration, renin-angiotensin blockers use and duration, and CVD history in hypertension dataset; while in CVD subgroup, the additional key features were comorbid diabetes, education level, smoking status, and use of bronchodilators. CONCLUSION: Our predictive model based on the random forest algorithm performed best among the tested and evaluated five algorithms. Hypokalaemia-associated key features have been identified in hypertensive patients and the subgroup with CVD. These findings from machine learning facilitate the development of artificial intelligence to highlight hypokalaemia risk in hypertension patients. KEY MESSAGES: 1. Our predictive model based on the random forest algorithm performed best among the tested and evaluated five algorithms, and hypokalemia-associated key features have been identified in hypertensive patients and the subgroup with cardiovascular disease. 2. The nomogram we developed including twelve key features might be useful and applied in primary clinical consultations to identify the hypertensive patients at risk of hypokalaemia. 3. These findings from machine learning facilitate the development of artificial intelligence to highlight hypokalaemia risk in hypertension patients. |
format | Online Article Text |
id | pubmed-10198242 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Taylor & Francis |
record_format | MEDLINE/PubMed |
spelling | pubmed-101982422023-05-20 Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018 Lin, Ziying Cheng, Yuen Ting Cheung, Bernard Man Yung Ann Med Research Article BACKGROUND: Hypokalaemia is a side-effect of diuretics. We aimed to use machine learning to identify features predicting hypokalaemia risk in hypertensive patients. METHODS: Participants with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018 were included for analysis. To select the most suitable algorithm, we tested and evaluated five machine learning algorithms commonly employed in epidemiological studies: Logistic Regression, k-Nearest Neighbor, Random Forest, Recursive Partitioning and Regression Trees, and eXtreme Gradient Boosting. These algorithms were accessed using a set of 38 screened features. We then selected the key hypokalaemia-associated features in the hypertension group and their cardiovascular diseases (CVD) subgroup using the SHapley Additive exPlanations (SHAP) values. Using SHAP values, the key features and their impact pattern on hypokalaemia risk were determined. RESULTS: A total of 25,326 hypertensive participants were included for analysis, of whom 4,511 had known CVD. The Random Forest algorithm had the highest AUROC (hypertension dataset: 0.73 [95%CI, 0.71–0.76]; CVD subgroup: 0.72 [95%CI, 0.66–0.78]). Moreover, the nomogram based on the top twelve key features screened by random forest retained good performance: age, sex, race, poverty income ratio, body mass index, systolic and diastolic blood pressure, non-potassium-sparing diuretics use and duration, renin-angiotensin blockers use and duration, and CVD history in hypertension dataset; while in CVD subgroup, the additional key features were comorbid diabetes, education level, smoking status, and use of bronchodilators. CONCLUSION: Our predictive model based on the random forest algorithm performed best among the tested and evaluated five algorithms. Hypokalaemia-associated key features have been identified in hypertensive patients and the subgroup with CVD. These findings from machine learning facilitate the development of artificial intelligence to highlight hypokalaemia risk in hypertension patients. KEY MESSAGES: 1. Our predictive model based on the random forest algorithm performed best among the tested and evaluated five algorithms, and hypokalemia-associated key features have been identified in hypertensive patients and the subgroup with cardiovascular disease. 2. The nomogram we developed including twelve key features might be useful and applied in primary clinical consultations to identify the hypertensive patients at risk of hypokalaemia. 3. These findings from machine learning facilitate the development of artificial intelligence to highlight hypokalaemia risk in hypertension patients. Taylor & Francis 2023-05-10 /pmc/articles/PMC10198242/ /pubmed/37162442 http://dx.doi.org/10.1080/07853890.2023.2209336 Text en © 2023 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. The terms on which this article has been published allow the posting of the Accepted Manuscript in a repository by the author(s) or with their consent. |
spellingShingle | Research Article Lin, Ziying Cheng, Yuen Ting Cheung, Bernard Man Yung Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018 |
title | Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018 |
title_full | Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018 |
title_fullStr | Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018 |
title_full_unstemmed | Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018 |
title_short | Machine learning algorithms identify hypokalaemia risk in people with hypertension in the United States National Health and Nutrition Examination Survey 1999–2018 |
title_sort | machine learning algorithms identify hypokalaemia risk in people with hypertension in the united states national health and nutrition examination survey 1999–2018 |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10198242/ https://www.ncbi.nlm.nih.gov/pubmed/37162442 http://dx.doi.org/10.1080/07853890.2023.2209336 |
work_keys_str_mv | AT linziying machinelearningalgorithmsidentifyhypokalaemiariskinpeoplewithhypertensionintheunitedstatesnationalhealthandnutritionexaminationsurvey19992018 AT chengyuenting machinelearningalgorithmsidentifyhypokalaemiariskinpeoplewithhypertensionintheunitedstatesnationalhealthandnutritionexaminationsurvey19992018 AT cheungbernardmanyung machinelearningalgorithmsidentifyhypokalaemiariskinpeoplewithhypertensionintheunitedstatesnationalhealthandnutritionexaminationsurvey19992018 |