Cargando…

Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations

OBJECTIVE: This study aimed to develop and validate a claims-based, machine learning algorithm to predict clinical outcomes across both medical and surgical patient populations. METHODS: This retrospective, observational cohort study, used a random 5% sample of 770,777 fee-for-service Medicare benef...

Descripción completa

Detalles Bibliográficos
Autores principales:	MacKay, Emily J., Stubna, Michael D., Chivers, Corey, Draugelis, Michael E., Hanson, William J., Desai, Nimesh D., Groeneveld, Peter W.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8174683/ https://www.ncbi.nlm.nih.gov/pubmed/34081720 http://dx.doi.org/10.1371/journal.pone.0252585

_version_	1783702951702822912
author	MacKay, Emily J. Stubna, Michael D. Chivers, Corey Draugelis, Michael E. Hanson, William J. Desai, Nimesh D. Groeneveld, Peter W.
author_facet	MacKay, Emily J. Stubna, Michael D. Chivers, Corey Draugelis, Michael E. Hanson, William J. Desai, Nimesh D. Groeneveld, Peter W.
author_sort	MacKay, Emily J.
collection	PubMed
description	OBJECTIVE: This study aimed to develop and validate a claims-based, machine learning algorithm to predict clinical outcomes across both medical and surgical patient populations. METHODS: This retrospective, observational cohort study, used a random 5% sample of 770,777 fee-for-service Medicare beneficiaries with an inpatient hospitalization between 2009–2011. The machine learning algorithms tested included: support vector machine, random forest, multilayer perceptron, extreme gradient boosted tree, and logistic regression. The extreme gradient boosted tree algorithm outperformed the alternatives and was the machine learning method used for the final risk model. Primary outcome was 30-day mortality. Secondary outcomes were: rehospitalization, and any of 23 adverse clinical events occurring within 30 days of the index admission date. RESULTS: The machine learning algorithm performance was evaluated by both the area under the receiver operating curve (AUROC) and Brier Score. The risk model demonstrated high performance for prediction of: 30-day mortality (AUROC = 0.88; Brier Score = 0.06), and 17 of the 23 adverse events (AUROC range: 0.80–0.86; Brier Score range: 0.01–0.05). The risk model demonstrated moderate performance for prediction of: rehospitalization within 30 days (AUROC = 0.73; Brier Score: = 0.07) and six of the 23 adverse events (AUROC range: 0.74–0.79; Brier Score range: 0.01–0.02). The machine learning risk model performed comparably on a second, independent validation dataset, confirming that the risk model was not overfit. CONCLUSIONS AND RELEVANCE: We have developed and validated a robust, claims-based, machine learning risk model that is applicable to both medical and surgical patient populations and demonstrates comparable predictive accuracy to existing risk models.
format	Online Article Text
id	pubmed-8174683
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-81746832021-06-14 Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations MacKay, Emily J. Stubna, Michael D. Chivers, Corey Draugelis, Michael E. Hanson, William J. Desai, Nimesh D. Groeneveld, Peter W. PLoS One Research Article OBJECTIVE: This study aimed to develop and validate a claims-based, machine learning algorithm to predict clinical outcomes across both medical and surgical patient populations. METHODS: This retrospective, observational cohort study, used a random 5% sample of 770,777 fee-for-service Medicare beneficiaries with an inpatient hospitalization between 2009–2011. The machine learning algorithms tested included: support vector machine, random forest, multilayer perceptron, extreme gradient boosted tree, and logistic regression. The extreme gradient boosted tree algorithm outperformed the alternatives and was the machine learning method used for the final risk model. Primary outcome was 30-day mortality. Secondary outcomes were: rehospitalization, and any of 23 adverse clinical events occurring within 30 days of the index admission date. RESULTS: The machine learning algorithm performance was evaluated by both the area under the receiver operating curve (AUROC) and Brier Score. The risk model demonstrated high performance for prediction of: 30-day mortality (AUROC = 0.88; Brier Score = 0.06), and 17 of the 23 adverse events (AUROC range: 0.80–0.86; Brier Score range: 0.01–0.05). The risk model demonstrated moderate performance for prediction of: rehospitalization within 30 days (AUROC = 0.73; Brier Score: = 0.07) and six of the 23 adverse events (AUROC range: 0.74–0.79; Brier Score range: 0.01–0.02). The machine learning risk model performed comparably on a second, independent validation dataset, confirming that the risk model was not overfit. CONCLUSIONS AND RELEVANCE: We have developed and validated a robust, claims-based, machine learning risk model that is applicable to both medical and surgical patient populations and demonstrates comparable predictive accuracy to existing risk models. Public Library of Science 2021-06-03 /pmc/articles/PMC8174683/ /pubmed/34081720 http://dx.doi.org/10.1371/journal.pone.0252585 Text en © 2021 MacKay et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article MacKay, Emily J. Stubna, Michael D. Chivers, Corey Draugelis, Michael E. Hanson, William J. Desai, Nimesh D. Groeneveld, Peter W. Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations
title	Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations
title_full	Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations
title_fullStr	Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations
title_full_unstemmed	Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations
title_short	Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations
title_sort	application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8174683/ https://www.ncbi.nlm.nih.gov/pubmed/34081720 http://dx.doi.org/10.1371/journal.pone.0252585
work_keys_str_mv	AT mackayemilyj applicationofmachinelearningapproachestoadministrativeclaimsdatatopredictclinicaloutcomesinmedicalandsurgicalpatientpopulations AT stubnamichaeld applicationofmachinelearningapproachestoadministrativeclaimsdatatopredictclinicaloutcomesinmedicalandsurgicalpatientpopulations AT chiverscorey applicationofmachinelearningapproachestoadministrativeclaimsdatatopredictclinicaloutcomesinmedicalandsurgicalpatientpopulations AT draugelismichaele applicationofmachinelearningapproachestoadministrativeclaimsdatatopredictclinicaloutcomesinmedicalandsurgicalpatientpopulations AT hansonwilliamj applicationofmachinelearningapproachestoadministrativeclaimsdatatopredictclinicaloutcomesinmedicalandsurgicalpatientpopulations AT desainimeshd applicationofmachinelearningapproachestoadministrativeclaimsdatatopredictclinicaloutcomesinmedicalandsurgicalpatientpopulations AT groeneveldpeterw applicationofmachinelearningapproachestoadministrativeclaimsdatatopredictclinicaloutcomesinmedicalandsurgicalpatientpopulations

Application of machine learning approaches to administrative claims data to predict clinical outcomes in medical and surgical patient populations

Ejemplares similares