Cargando…
Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention
IMPORTANCE: Better prediction of major bleeding after percutaneous coronary intervention (PCI) may improve clinical decisions aimed to reduce bleeding risk. Machine learning techniques, bolstered by better selection of variables, hold promise for enhancing prediction. OBJECTIVE: To determine whether...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Medical Association
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6624806/ https://www.ncbi.nlm.nih.gov/pubmed/31290991 http://dx.doi.org/10.1001/jamanetworkopen.2019.6835 |
_version_ | 1783434293426520064 |
---|---|
author | Mortazavi, Bobak J. Bucholz, Emily M. Desai, Nihar R. Huang, Chenxi Curtis, Jeptha P. Masoudi, Frederick A. Shaw, Richard E. Negahban, Sahand N. Krumholz, Harlan M. |
author_facet | Mortazavi, Bobak J. Bucholz, Emily M. Desai, Nihar R. Huang, Chenxi Curtis, Jeptha P. Masoudi, Frederick A. Shaw, Richard E. Negahban, Sahand N. Krumholz, Harlan M. |
author_sort | Mortazavi, Bobak J. |
collection | PubMed |
description | IMPORTANCE: Better prediction of major bleeding after percutaneous coronary intervention (PCI) may improve clinical decisions aimed to reduce bleeding risk. Machine learning techniques, bolstered by better selection of variables, hold promise for enhancing prediction. OBJECTIVE: To determine whether machine learning techniques better predict post-PCI major bleeding compared with the existing National Cardiovascular Data Registry (NCDR) models. DESIGN, SETTING, AND PARTICIPANTS: This comparative effectiveness study used the NCDR CathPCI Registry data version 4.4 (July 1, 2009, to April 1, 2015), machine learning techniques were used (logistic regression with lasso regularization and gradient descent boosting [XGBoost, version 0.71.2]), and output was then compared with the existing simplified risk score and full NCDR models. The existing models were recreated, and then performance was evaluated through additional techniques and variables in a 5-fold cross-validation in analysis conducted from October 1, 2015, to October 27, 2017. The setting was retrospective modeling of a nationwide clinical registry of PCI. Participants were all patients undergoing PCI. Percutaneous coronary intervention procedures were excluded if they were not the index PCI of admission, if the hospital site had missing outcomes measures, or if the patient underwent subsequent coronary artery bypass grafting. EXPOSURES: Clinical variables available at admission and diagnostic coronary angiography data were used to determine the severity and complexity of presentation. MAIN OUTCOMES AND MEASURES: The main outcome was in-hospital major bleeding within 72 hours after PCI. Results were evaluated by comparing C statistics, calibration, and decision threshold–based metrics, including the F score (harmonic mean of positive predictive value and sensitivity) and the false discovery rate. RESULTS: The post-PCI major bleeding rate among 3 316 465 procedures (patients’ median age, 65 years; interquartile range, 56-73 years; 68.1% male) was 4.5%. The existing full model achieved a mean C statistic of 0.78 (95% CI, 0.78-0.78). The use of XGBoost and full range of selected variables achieved a C statistic of 0.82 (95% CI, 0.82-0.82), with an F score of 0.31 (95% CI, 0.30-0.31). XGBoost correctly identified an additional 3.7% of cases identified as high risk who experienced a bleeding event and an overall improvement of 1.0% of cases identified as low risk who did not experience a bleeding event. The data-driven decision threshold helped improve the false discovery rate of the existing techniques. The existing simplified risk score model improved the false discovery rate from more than 90% to 78.7%. Modifying the model and the data decision threshold improved this rate from 78.7% to 73.4%. CONCLUSIONS AND RELEVANCE: Machine learning techniques improved the prediction of major bleeding after PCI. These techniques may help to better identify patients who would benefit most from strategies to reduce bleeding risk. |
format | Online Article Text |
id | pubmed-6624806 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | American Medical Association |
record_format | MEDLINE/PubMed |
spelling | pubmed-66248062019-07-28 Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention Mortazavi, Bobak J. Bucholz, Emily M. Desai, Nihar R. Huang, Chenxi Curtis, Jeptha P. Masoudi, Frederick A. Shaw, Richard E. Negahban, Sahand N. Krumholz, Harlan M. JAMA Netw Open Original Investigation IMPORTANCE: Better prediction of major bleeding after percutaneous coronary intervention (PCI) may improve clinical decisions aimed to reduce bleeding risk. Machine learning techniques, bolstered by better selection of variables, hold promise for enhancing prediction. OBJECTIVE: To determine whether machine learning techniques better predict post-PCI major bleeding compared with the existing National Cardiovascular Data Registry (NCDR) models. DESIGN, SETTING, AND PARTICIPANTS: This comparative effectiveness study used the NCDR CathPCI Registry data version 4.4 (July 1, 2009, to April 1, 2015), machine learning techniques were used (logistic regression with lasso regularization and gradient descent boosting [XGBoost, version 0.71.2]), and output was then compared with the existing simplified risk score and full NCDR models. The existing models were recreated, and then performance was evaluated through additional techniques and variables in a 5-fold cross-validation in analysis conducted from October 1, 2015, to October 27, 2017. The setting was retrospective modeling of a nationwide clinical registry of PCI. Participants were all patients undergoing PCI. Percutaneous coronary intervention procedures were excluded if they were not the index PCI of admission, if the hospital site had missing outcomes measures, or if the patient underwent subsequent coronary artery bypass grafting. EXPOSURES: Clinical variables available at admission and diagnostic coronary angiography data were used to determine the severity and complexity of presentation. MAIN OUTCOMES AND MEASURES: The main outcome was in-hospital major bleeding within 72 hours after PCI. Results were evaluated by comparing C statistics, calibration, and decision threshold–based metrics, including the F score (harmonic mean of positive predictive value and sensitivity) and the false discovery rate. RESULTS: The post-PCI major bleeding rate among 3 316 465 procedures (patients’ median age, 65 years; interquartile range, 56-73 years; 68.1% male) was 4.5%. The existing full model achieved a mean C statistic of 0.78 (95% CI, 0.78-0.78). The use of XGBoost and full range of selected variables achieved a C statistic of 0.82 (95% CI, 0.82-0.82), with an F score of 0.31 (95% CI, 0.30-0.31). XGBoost correctly identified an additional 3.7% of cases identified as high risk who experienced a bleeding event and an overall improvement of 1.0% of cases identified as low risk who did not experience a bleeding event. The data-driven decision threshold helped improve the false discovery rate of the existing techniques. The existing simplified risk score model improved the false discovery rate from more than 90% to 78.7%. Modifying the model and the data decision threshold improved this rate from 78.7% to 73.4%. CONCLUSIONS AND RELEVANCE: Machine learning techniques improved the prediction of major bleeding after PCI. These techniques may help to better identify patients who would benefit most from strategies to reduce bleeding risk. American Medical Association 2019-07-10 /pmc/articles/PMC6624806/ /pubmed/31290991 http://dx.doi.org/10.1001/jamanetworkopen.2019.6835 Text en Copyright 2019 Mortazavi BJ et al. JAMA Network Open. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the CC-BY License. |
spellingShingle | Original Investigation Mortazavi, Bobak J. Bucholz, Emily M. Desai, Nihar R. Huang, Chenxi Curtis, Jeptha P. Masoudi, Frederick A. Shaw, Richard E. Negahban, Sahand N. Krumholz, Harlan M. Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention |
title | Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention |
title_full | Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention |
title_fullStr | Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention |
title_full_unstemmed | Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention |
title_short | Comparison of Machine Learning Methods With National Cardiovascular Data Registry Models for Prediction of Risk of Bleeding After Percutaneous Coronary Intervention |
title_sort | comparison of machine learning methods with national cardiovascular data registry models for prediction of risk of bleeding after percutaneous coronary intervention |
topic | Original Investigation |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6624806/ https://www.ncbi.nlm.nih.gov/pubmed/31290991 http://dx.doi.org/10.1001/jamanetworkopen.2019.6835 |
work_keys_str_mv | AT mortazavibobakj comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention AT bucholzemilym comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention AT desainiharr comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention AT huangchenxi comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention AT curtisjepthap comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention AT masoudifredericka comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention AT shawricharde comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention AT negahbansahandn comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention AT krumholzharlanm comparisonofmachinelearningmethodswithnationalcardiovasculardataregistrymodelsforpredictionofriskofbleedingafterpercutaneouscoronaryintervention |