Cargando…

Mortality Prediction of Patients With Cardiovascular Disease Using Medical Claims Data Under Artificial Intelligence Architectures: Validation Study

BACKGROUND: Cardiovascular disease (CVD) is the greatest health problem in Australia, which kills more people than any other disease and incurs enormous costs for the health care system. In this study, we present a benchmark comparison of various artificial intelligence (AI) architectures for predic...

Descripción completa

Detalles Bibliográficos
Autores principales: Tran, Linh, Chi, Lianhua, Bonti, Alessio, Abdelrazek, Mohamed, Chen, Yi-Ping Phoebe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8050753/
https://www.ncbi.nlm.nih.gov/pubmed/33792549
http://dx.doi.org/10.2196/25000
_version_ 1783679632943349760
author Tran, Linh
Chi, Lianhua
Bonti, Alessio
Abdelrazek, Mohamed
Chen, Yi-Ping Phoebe
author_facet Tran, Linh
Chi, Lianhua
Bonti, Alessio
Abdelrazek, Mohamed
Chen, Yi-Ping Phoebe
author_sort Tran, Linh
collection PubMed
description BACKGROUND: Cardiovascular disease (CVD) is the greatest health problem in Australia, which kills more people than any other disease and incurs enormous costs for the health care system. In this study, we present a benchmark comparison of various artificial intelligence (AI) architectures for predicting the mortality rate of patients with CVD using structured medical claims data. Compared with other research in the clinical literature, our models are more efficient because we use a smaller number of features, and this study could help health professionals accurately choose AI models to predict mortality among patients with CVD using only claims data before a clinic visit. OBJECTIVE: This study aims to support health clinicians in accurately predicting mortality among patients with CVD using only claims data before a clinic visit. METHODS: The data set was obtained from the Medicare Benefits Scheme and Pharmaceutical Benefits Scheme service information in the period between 2004 and 2014, released by the Department of Health Australia in 2016. It included 346,201 records, corresponding to 346,201 patients. A total of five AI algorithms, including four classical machine learning algorithms (logistic regression [LR], random forest [RF], extra trees [ET], and gradient boosting trees [GBT]) and a deep learning algorithm, which is a densely connected neural network (DNN), were developed and compared in this study. In addition, because of the minority of deceased patients in the data set, a separate experiment using the Synthetic Minority Oversampling Technique (SMOTE) was conducted to enrich the data. RESULTS: Regarding model performance, in terms of discrimination, GBT and RF were the models with the highest area under the receiver operating characteristic curve (97.8% and 97.7%, respectively), followed by ET (96.8%) and LR (96.4%), whereas DNN was the least discriminative (95.3%). In terms of reliability, LR predictions were the least calibrated compared with the other four algorithms. In this study, despite increasing the training time, SMOTE was proven to further improve the model performance of LR, whereas other algorithms, especially GBT and DNN, worked well with class imbalanced data. CONCLUSIONS: Compared with other research in the clinical literature involving AI models using claims data to predict patient health outcomes, our models are more efficient because we use a smaller number of features but still achieve high performance. This study could help health professionals accurately choose AI models to predict mortality among patients with CVD using only claims data before a clinic visit.
format Online
Article
Text
id pubmed-8050753
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-80507532021-04-22 Mortality Prediction of Patients With Cardiovascular Disease Using Medical Claims Data Under Artificial Intelligence Architectures: Validation Study Tran, Linh Chi, Lianhua Bonti, Alessio Abdelrazek, Mohamed Chen, Yi-Ping Phoebe JMIR Med Inform Original Paper BACKGROUND: Cardiovascular disease (CVD) is the greatest health problem in Australia, which kills more people than any other disease and incurs enormous costs for the health care system. In this study, we present a benchmark comparison of various artificial intelligence (AI) architectures for predicting the mortality rate of patients with CVD using structured medical claims data. Compared with other research in the clinical literature, our models are more efficient because we use a smaller number of features, and this study could help health professionals accurately choose AI models to predict mortality among patients with CVD using only claims data before a clinic visit. OBJECTIVE: This study aims to support health clinicians in accurately predicting mortality among patients with CVD using only claims data before a clinic visit. METHODS: The data set was obtained from the Medicare Benefits Scheme and Pharmaceutical Benefits Scheme service information in the period between 2004 and 2014, released by the Department of Health Australia in 2016. It included 346,201 records, corresponding to 346,201 patients. A total of five AI algorithms, including four classical machine learning algorithms (logistic regression [LR], random forest [RF], extra trees [ET], and gradient boosting trees [GBT]) and a deep learning algorithm, which is a densely connected neural network (DNN), were developed and compared in this study. In addition, because of the minority of deceased patients in the data set, a separate experiment using the Synthetic Minority Oversampling Technique (SMOTE) was conducted to enrich the data. RESULTS: Regarding model performance, in terms of discrimination, GBT and RF were the models with the highest area under the receiver operating characteristic curve (97.8% and 97.7%, respectively), followed by ET (96.8%) and LR (96.4%), whereas DNN was the least discriminative (95.3%). In terms of reliability, LR predictions were the least calibrated compared with the other four algorithms. In this study, despite increasing the training time, SMOTE was proven to further improve the model performance of LR, whereas other algorithms, especially GBT and DNN, worked well with class imbalanced data. CONCLUSIONS: Compared with other research in the clinical literature involving AI models using claims data to predict patient health outcomes, our models are more efficient because we use a smaller number of features but still achieve high performance. This study could help health professionals accurately choose AI models to predict mortality among patients with CVD using only claims data before a clinic visit. JMIR Publications 2021-04-01 /pmc/articles/PMC8050753/ /pubmed/33792549 http://dx.doi.org/10.2196/25000 Text en ©Linh Tran, Lianhua Chi, Alessio Bonti, Mohamed Abdelrazek, Yi-Ping Phoebe Chen. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 01.04.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Tran, Linh
Chi, Lianhua
Bonti, Alessio
Abdelrazek, Mohamed
Chen, Yi-Ping Phoebe
Mortality Prediction of Patients With Cardiovascular Disease Using Medical Claims Data Under Artificial Intelligence Architectures: Validation Study
title Mortality Prediction of Patients With Cardiovascular Disease Using Medical Claims Data Under Artificial Intelligence Architectures: Validation Study
title_full Mortality Prediction of Patients With Cardiovascular Disease Using Medical Claims Data Under Artificial Intelligence Architectures: Validation Study
title_fullStr Mortality Prediction of Patients With Cardiovascular Disease Using Medical Claims Data Under Artificial Intelligence Architectures: Validation Study
title_full_unstemmed Mortality Prediction of Patients With Cardiovascular Disease Using Medical Claims Data Under Artificial Intelligence Architectures: Validation Study
title_short Mortality Prediction of Patients With Cardiovascular Disease Using Medical Claims Data Under Artificial Intelligence Architectures: Validation Study
title_sort mortality prediction of patients with cardiovascular disease using medical claims data under artificial intelligence architectures: validation study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8050753/
https://www.ncbi.nlm.nih.gov/pubmed/33792549
http://dx.doi.org/10.2196/25000
work_keys_str_mv AT tranlinh mortalitypredictionofpatientswithcardiovasculardiseaseusingmedicalclaimsdataunderartificialintelligencearchitecturesvalidationstudy
AT chilianhua mortalitypredictionofpatientswithcardiovasculardiseaseusingmedicalclaimsdataunderartificialintelligencearchitecturesvalidationstudy
AT bontialessio mortalitypredictionofpatientswithcardiovasculardiseaseusingmedicalclaimsdataunderartificialintelligencearchitecturesvalidationstudy
AT abdelrazekmohamed mortalitypredictionofpatientswithcardiovasculardiseaseusingmedicalclaimsdataunderartificialintelligencearchitecturesvalidationstudy
AT chenyipingphoebe mortalitypredictionofpatientswithcardiovasculardiseaseusingmedicalclaimsdataunderartificialintelligencearchitecturesvalidationstudy