Cargando…

Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention

IMPORTANCE: Better prediction of major bleeding after percutaneous coronary intervention (PCI) may improve clinical decisions aimed to reduce bleeding risk. Machine learning techniques, bolstered by better selection of variables, hold promise for enhancing prediction. OBJECTIVE: To determine whether...

Descripción completa

Detalles Bibliográficos
Autores principales: Mortazavi, Bobak J., Bucholz, Emily M., Desai, Nihar R., Huang, Chenxi, Curtis, Jeptha P., Masoudi, Frederick A., Shaw, Richard E., Negahban, Sahand N., Krumholz, Harlan M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Medical Association 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6624806/
https://www.ncbi.nlm.nih.gov/pubmed/31290991
http://dx.doi.org/10.1001/jamanetworkopen.2019.6835
_version_ 1783434293426520064
author Mortazavi, Bobak J.
Bucholz, Emily M.
Desai, Nihar R.
Huang, Chenxi
Curtis, Jeptha P.
Masoudi, Frederick A.
Shaw, Richard E.
Negahban, Sahand N.
Krumholz, Harlan M.
author_facet Mortazavi, Bobak J.
Bucholz, Emily M.
Desai, Nihar R.
Huang, Chenxi
Curtis, Jeptha P.
Masoudi, Frederick A.
Shaw, Richard E.
Negahban, Sahand N.
Krumholz, Harlan M.
author_sort Mortazavi, Bobak J.
collection PubMed
description IMPORTANCE: Better prediction of major bleeding after percutaneous coronary intervention (PCI) may improve clinical decisions aimed to reduce bleeding risk. Machine learning techniques, bolstered by better selection of variables, hold promise for enhancing prediction. OBJECTIVE: To determine whether machine learning techniques better predict post-PCI major bleeding compared with the existing National Cardiovascular Data Registry (NCDR) models. DESIGN, SETTING, AND PARTICIPANTS: This comparative effectiveness study used the NCDR CathPCI Registry data version 4.4 (July 1, 2009, to April 1, 2015), machine learning techniques were used (logistic regression with lasso regularization and gradient descent boosting [XGBoost, version 0.71.2]), and output was then compared with the existing simplified risk score and full NCDR models. The existing models were recreated, and then performance was evaluated through additional techniques and variables in a 5-fold cross-validation in analysis conducted from October 1, 2015, to October 27, 2017. The setting was retrospective modeling of a nationwide clinical registry of PCI. Participants were all patients undergoing PCI. Percutaneous coronary intervention procedures were excluded if they were not the index PCI of admission, if the hospital site had missing outcomes measures, or if the patient underwent subsequent coronary artery bypass grafting. EXPOSURES: Clinical variables available at admission and diagnostic coronary angiography data were used to determine the severity and complexity of presentation. MAIN OUTCOMES AND MEASURES: The main outcome was in-hospital major bleeding within 72 hours after PCI. Results were evaluated by comparing C statistics, calibration, and decision threshold–based metrics, including the F score (harmonic mean of positive predictive value and sensitivity) and the false discovery rate. RESULTS: The post-PCI major bleeding rate among 3 316 465 procedures (patients’ median age, 65 years; interquartile range, 56-73 years; 68.1% male) was 4.5%. The existing full model achieved a mean C statistic of 0.78 (95% CI, 0.78-0.78). The use of XGBoost and full range of selected variables achieved a C statistic of 0.82 (95% CI, 0.82-0.82), with an F score of 0.31 (95% CI, 0.30-0.31). XGBoost correctly identified an additional 3.7% of cases identified as high risk who experienced a bleeding event and an overall improvement of 1.0% of cases identified as low risk who did not experience a bleeding event. The data-driven decision threshold helped improve the false discovery rate of the existing techniques. The existing simplified risk score model improved the false discovery rate from more than 90% to 78.7%. Modifying the model and the data decision threshold improved this rate from 78.7% to 73.4%. CONCLUSIONS AND RELEVANCE: Machine learning techniques improved the prediction of major bleeding after PCI. These techniques may help to better identify patients who would benefit most from strategies to reduce bleeding risk.
format Online
Article
Text
id pubmed-6624806
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher American Medical Association
record_format MEDLINE/PubMed
spelling pubmed-66248062019-07-28 Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention Mortazavi, Bobak J. Bucholz, Emily M. Desai, Nihar R. Huang, Chenxi Curtis, Jeptha P. Masoudi, Frederick A. Shaw, Richard E. Negahban, Sahand N. Krumholz, Harlan M. JAMA Netw Open Original Investigation IMPORTANCE: Better prediction of major bleeding after percutaneous coronary intervention (PCI) may improve clinical decisions aimed to reduce bleeding risk. Machine learning techniques, bolstered by better selection of variables, hold promise for enhancing prediction. OBJECTIVE: To determine whether machine learning techniques better predict post-PCI major bleeding compared with the existing National Cardiovascular Data Registry (NCDR) models. DESIGN, SETTING, AND PARTICIPANTS: This comparative effectiveness study used the NCDR CathPCI Registry data version 4.4 (July 1, 2009, to April 1, 2015), machine learning techniques were used (logistic regression with lasso regularization and gradient descent boosting [XGBoost, version 0.71.2]), and output was then compared with the existing simplified risk score and full NCDR models. The existing models were recreated, and then performance was evaluated through additional techniques and variables in a 5-fold cross-validation in analysis conducted from October 1, 2015, to October 27, 2017. The setting was retrospective modeling of a nationwide clinical registry of PCI. Participants were all patients undergoing PCI. Percutaneous coronary intervention procedures were excluded if they were not the index PCI of admission, if the hospital site had missing outcomes measures, or if the patient underwent subsequent coronary artery bypass grafting. EXPOSURES: Clinical variables available at admission and diagnostic coronary angiography data were used to determine the severity and complexity of presentation. MAIN OUTCOMES AND MEASURES: The main outcome was in-hospital major bleeding within 72 hours after PCI. Results were evaluated by comparing C statistics, calibration, and decision threshold–based metrics, including the F score (harmonic mean of positive predictive value and sensitivity) and the false discovery rate. RESULTS: The post-PCI major bleeding rate among 3 316 465 procedures (patients’ median age, 65 years; interquartile range, 56-73 years; 68.1% male) was 4.5%. The existing full model achieved a mean C statistic of 0.78 (95% CI, 0.78-0.78). The use of XGBoost and full range of selected variables achieved a C statistic of 0.82 (95% CI, 0.82-0.82), with an F score of 0.31 (95% CI, 0.30-0.31). XGBoost correctly identified an additional 3.7% of cases identified as high risk who experienced a bleeding event and an overall improvement of 1.0% of cases identified as low risk who did not experience a bleeding event. The data-driven decision threshold helped improve the false discovery rate of the existing techniques. The existing simplified risk score model improved the false discovery rate from more than 90% to 78.7%. Modifying the model and the data decision threshold improved this rate from 78.7% to 73.4%. CONCLUSIONS AND RELEVANCE: Machine learning techniques improved the prediction of major bleeding after PCI. These techniques may help to better identify patients who would benefit most from strategies to reduce bleeding risk. American Medical Association 2019-07-10 /pmc/articles/PMC6624806/ /pubmed/31290991 http://dx.doi.org/10.1001/jamanetworkopen.2019.6835 Text en Copyright 2019 Mortazavi BJ et al. JAMA Network Open. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the CC-BY License.
spellingShingle Original Investigation
Mortazavi, Bobak J.
Bucholz, Emily M.
Desai, Nihar R.
Huang, Chenxi
Curtis, Jeptha P.
Masoudi, Frederick A.
Shaw, Richard E.
Negahban, Sahand N.
Krumholz, Harlan M.
Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention
title Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention
title_full Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention
title_fullStr Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention
title_full_unstemmed Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention
title_short Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention
title_sort comparison of machine learning methods with national cardiovascular data registry models for prediction of risk of bleeding after percutaneous coronary intervention
topic Original Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6624806/
https://www.ncbi.nlm.nih.gov/pubmed/31290991
http://dx.doi.org/10.1001/jamanetworkopen.2019.6835
work_keys_str_mv AT mortazavibobakj comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention
AT bucholzemilym comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention
AT desainiharr comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention
AT huangchenxi comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention
AT curtisjepthap comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention
AT masoudifredericka comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention
AT shawricharde comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention
AT negahbansahandn comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention
AT krumholzharlanm comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention