Cargando…

Improving Current Glycated Hemoglobin Prediction in Adults: Use of Machine Learning Algorithms With Electronic Health Records

BACKGROUND: Predicting the risk of glycated hemoglobin (HbA(1c)) elevation can help identify patients with the potential for developing serious chronic health problems, such as diabetes. Early preventive interventions based upon advanced predictive models using electronic health records data for ide...

Descripción completa

Detalles Bibliográficos
Autores principales: Alhassan, Zakhriya, Watson, Matthew, Budgen, David, Alshammari, Riyad, Alessa, Ali, Al Moubayed, Noura
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8185616/
https://www.ncbi.nlm.nih.gov/pubmed/34028357
http://dx.doi.org/10.2196/25237
_version_ 1783704823373234176
author Alhassan, Zakhriya
Watson, Matthew
Budgen, David
Alshammari, Riyad
Alessa, Ali
Al Moubayed, Noura
author_facet Alhassan, Zakhriya
Watson, Matthew
Budgen, David
Alshammari, Riyad
Alessa, Ali
Al Moubayed, Noura
author_sort Alhassan, Zakhriya
collection PubMed
description BACKGROUND: Predicting the risk of glycated hemoglobin (HbA(1c)) elevation can help identify patients with the potential for developing serious chronic health problems, such as diabetes. Early preventive interventions based upon advanced predictive models using electronic health records data for identifying such patients can ultimately help provide better health outcomes. OBJECTIVE: Our study investigated the performance of predictive models to forecast HbA(1c) elevation levels by employing several machine learning models. We also examined the use of patient electronic health record longitudinal data in the performance of the predictive models. Explainable methods were employed to interpret the decisions made by the black box models. METHODS: This study employed multiple logistic regression, random forest, support vector machine, and logistic regression models, as well as a deep learning model (multilayer perceptron) to classify patients with normal (<5.7%) and elevated (≥5.7%) levels of HbA(1c). We also integrated current visit data with historical (longitudinal) data from previous visits. Explainable machine learning methods were used to interrogate the models and provide an understanding of the reasons behind the decisions made by the models. All models were trained and tested using a large data set from Saudi Arabia with 18,844 unique patient records. RESULTS: The machine learning models achieved promising results for predicting current HbA(1c) elevation risk. When coupled with longitudinal data, the machine learning models outperformed the multiple logistic regression model used in the comparative study. The multilayer perceptron model achieved an accuracy of 83.22% for the area under receiver operating characteristic curve when used with historical data. All models showed a close level of agreement on the contribution of random blood sugar and age variables with and without longitudinal data. CONCLUSIONS: This study shows that machine learning models can provide promising results for the task of predicting current HbA(1c) levels (≥5.7% or less). Using patients’ longitudinal data improved the performance and affected the relative importance for the predictors used. The models showed results that are consistent with comparable studies.
format Online
Article
Text
id pubmed-8185616
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-81856162021-06-25 Improving Current Glycated Hemoglobin Prediction in Adults: Use of Machine Learning Algorithms With Electronic Health Records Alhassan, Zakhriya Watson, Matthew Budgen, David Alshammari, Riyad Alessa, Ali Al Moubayed, Noura JMIR Med Inform Original Paper BACKGROUND: Predicting the risk of glycated hemoglobin (HbA(1c)) elevation can help identify patients with the potential for developing serious chronic health problems, such as diabetes. Early preventive interventions based upon advanced predictive models using electronic health records data for identifying such patients can ultimately help provide better health outcomes. OBJECTIVE: Our study investigated the performance of predictive models to forecast HbA(1c) elevation levels by employing several machine learning models. We also examined the use of patient electronic health record longitudinal data in the performance of the predictive models. Explainable methods were employed to interpret the decisions made by the black box models. METHODS: This study employed multiple logistic regression, random forest, support vector machine, and logistic regression models, as well as a deep learning model (multilayer perceptron) to classify patients with normal (<5.7%) and elevated (≥5.7%) levels of HbA(1c). We also integrated current visit data with historical (longitudinal) data from previous visits. Explainable machine learning methods were used to interrogate the models and provide an understanding of the reasons behind the decisions made by the models. All models were trained and tested using a large data set from Saudi Arabia with 18,844 unique patient records. RESULTS: The machine learning models achieved promising results for predicting current HbA(1c) elevation risk. When coupled with longitudinal data, the machine learning models outperformed the multiple logistic regression model used in the comparative study. The multilayer perceptron model achieved an accuracy of 83.22% for the area under receiver operating characteristic curve when used with historical data. All models showed a close level of agreement on the contribution of random blood sugar and age variables with and without longitudinal data. CONCLUSIONS: This study shows that machine learning models can provide promising results for the task of predicting current HbA(1c) levels (≥5.7% or less). Using patients’ longitudinal data improved the performance and affected the relative importance for the predictors used. The models showed results that are consistent with comparable studies. JMIR Publications 2021-05-24 /pmc/articles/PMC8185616/ /pubmed/34028357 http://dx.doi.org/10.2196/25237 Text en ©Zakhriya Alhassan, Matthew Watson, David Budgen, Riyad Alshammari, Ali Alessa, Noura Al Moubayed. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 24.05.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Alhassan, Zakhriya
Watson, Matthew
Budgen, David
Alshammari, Riyad
Alessa, Ali
Al Moubayed, Noura
Improving Current Glycated Hemoglobin Prediction in Adults: Use of Machine Learning Algorithms With Electronic Health Records
title Improving Current Glycated Hemoglobin Prediction in Adults: Use of Machine Learning Algorithms With Electronic Health Records
title_full Improving Current Glycated Hemoglobin Prediction in Adults: Use of Machine Learning Algorithms With Electronic Health Records
title_fullStr Improving Current Glycated Hemoglobin Prediction in Adults: Use of Machine Learning Algorithms With Electronic Health Records
title_full_unstemmed Improving Current Glycated Hemoglobin Prediction in Adults: Use of Machine Learning Algorithms With Electronic Health Records
title_short Improving Current Glycated Hemoglobin Prediction in Adults: Use of Machine Learning Algorithms With Electronic Health Records
title_sort improving current glycated hemoglobin prediction in adults: use of machine learning algorithms with electronic health records
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8185616/
https://www.ncbi.nlm.nih.gov/pubmed/34028357
http://dx.doi.org/10.2196/25237
work_keys_str_mv AT alhassanzakhriya improvingcurrentglycatedhemoglobinpredictioninadultsuseofmachinelearningalgorithmswithelectronichealthrecords
AT watsonmatthew improvingcurrentglycatedhemoglobinpredictioninadultsuseofmachinelearningalgorithmswithelectronichealthrecords
AT budgendavid improvingcurrentglycatedhemoglobinpredictioninadultsuseofmachinelearningalgorithmswithelectronichealthrecords
AT alshammaririyad improvingcurrentglycatedhemoglobinpredictioninadultsuseofmachinelearningalgorithmswithelectronichealthrecords
AT alessaali improvingcurrentglycatedhemoglobinpredictioninadultsuseofmachinelearningalgorithmswithelectronichealthrecords
AT almoubayednoura improvingcurrentglycatedhemoglobinpredictioninadultsuseofmachinelearningalgorithmswithelectronichealthrecords