Cargando…

Prediction of 30-Day Readmission After Stroke Using Machine Learning and Natural Language Processing

Background and Purpose: This study aims to determine whether machine learning (ML) and natural language processing (NLP) from electronic health records (EHR) improve the prediction of 30-day readmission after stroke. Methods: Among index stroke admissions between 2011 and 2016 at an academic medical...

Descripción completa

Detalles Bibliográficos
Autores principales: Lineback, Christina M., Garg, Ravi, Oh, Elissa, Naidech, Andrew M., Holl, Jane L., Prabhakaran, Shyam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8315788/
https://www.ncbi.nlm.nih.gov/pubmed/34326805
http://dx.doi.org/10.3389/fneur.2021.649521
_version_ 1783729773491519488
author Lineback, Christina M.
Garg, Ravi
Oh, Elissa
Naidech, Andrew M.
Holl, Jane L.
Prabhakaran, Shyam
author_facet Lineback, Christina M.
Garg, Ravi
Oh, Elissa
Naidech, Andrew M.
Holl, Jane L.
Prabhakaran, Shyam
author_sort Lineback, Christina M.
collection PubMed
description Background and Purpose: This study aims to determine whether machine learning (ML) and natural language processing (NLP) from electronic health records (EHR) improve the prediction of 30-day readmission after stroke. Methods: Among index stroke admissions between 2011 and 2016 at an academic medical center, we abstracted discrete data from the EHR on demographics, risk factors, medications, hospital complications, and discharge destination and unstructured textual data from clinician notes. Readmission was defined as any unplanned hospital admission within 30 days of discharge. We developed models to predict two separate outcomes, as follows: (1) 30-day all-cause readmission and (2) 30-day stroke readmission. We compared the performance of logistic regression with advanced ML algorithms. We used several NLP methods to generate additional features from unstructured textual reports. We evaluated the performance of prediction models using a five-fold validation and tested the best model in a held-out test dataset. Areas under the curve (AUCs) were used to compare discrimination of each model. Results: In a held-out test dataset, advanced ML methods along with NLP features out performed logistic regression for all-cause readmission (AUC, 0.64 vs. 0.58; p < 0.001) and stroke readmission prediction (AUC, 0.62 vs. 0.52; p < 0.001). Conclusion: NLP-enhanced machine learning models potentially advance our ability to predict readmission after stroke. However, further improvement is necessary before being implemented in clinical practice given the weak discrimination.
format Online
Article
Text
id pubmed-8315788
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-83157882021-07-28 Prediction of 30-Day Readmission After Stroke Using Machine Learning and Natural Language Processing Lineback, Christina M. Garg, Ravi Oh, Elissa Naidech, Andrew M. Holl, Jane L. Prabhakaran, Shyam Front Neurol Neurology Background and Purpose: This study aims to determine whether machine learning (ML) and natural language processing (NLP) from electronic health records (EHR) improve the prediction of 30-day readmission after stroke. Methods: Among index stroke admissions between 2011 and 2016 at an academic medical center, we abstracted discrete data from the EHR on demographics, risk factors, medications, hospital complications, and discharge destination and unstructured textual data from clinician notes. Readmission was defined as any unplanned hospital admission within 30 days of discharge. We developed models to predict two separate outcomes, as follows: (1) 30-day all-cause readmission and (2) 30-day stroke readmission. We compared the performance of logistic regression with advanced ML algorithms. We used several NLP methods to generate additional features from unstructured textual reports. We evaluated the performance of prediction models using a five-fold validation and tested the best model in a held-out test dataset. Areas under the curve (AUCs) were used to compare discrimination of each model. Results: In a held-out test dataset, advanced ML methods along with NLP features out performed logistic regression for all-cause readmission (AUC, 0.64 vs. 0.58; p < 0.001) and stroke readmission prediction (AUC, 0.62 vs. 0.52; p < 0.001). Conclusion: NLP-enhanced machine learning models potentially advance our ability to predict readmission after stroke. However, further improvement is necessary before being implemented in clinical practice given the weak discrimination. Frontiers Media S.A. 2021-07-13 /pmc/articles/PMC8315788/ /pubmed/34326805 http://dx.doi.org/10.3389/fneur.2021.649521 Text en Copyright © 2021 Lineback, Garg, Oh, Naidech, Holl and Prabhakaran. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neurology
Lineback, Christina M.
Garg, Ravi
Oh, Elissa
Naidech, Andrew M.
Holl, Jane L.
Prabhakaran, Shyam
Prediction of 30-Day Readmission After Stroke Using Machine Learning and Natural Language Processing
title Prediction of 30-Day Readmission After Stroke Using Machine Learning and Natural Language Processing
title_full Prediction of 30-Day Readmission After Stroke Using Machine Learning and Natural Language Processing
title_fullStr Prediction of 30-Day Readmission After Stroke Using Machine Learning and Natural Language Processing
title_full_unstemmed Prediction of 30-Day Readmission After Stroke Using Machine Learning and Natural Language Processing
title_short Prediction of 30-Day Readmission After Stroke Using Machine Learning and Natural Language Processing
title_sort prediction of 30-day readmission after stroke using machine learning and natural language processing
topic Neurology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8315788/
https://www.ncbi.nlm.nih.gov/pubmed/34326805
http://dx.doi.org/10.3389/fneur.2021.649521
work_keys_str_mv AT linebackchristinam predictionof30dayreadmissionafterstrokeusingmachinelearningandnaturallanguageprocessing
AT gargravi predictionof30dayreadmissionafterstrokeusingmachinelearningandnaturallanguageprocessing
AT ohelissa predictionof30dayreadmissionafterstrokeusingmachinelearningandnaturallanguageprocessing
AT naidechandrewm predictionof30dayreadmissionafterstrokeusingmachinelearningandnaturallanguageprocessing
AT holljanel predictionof30dayreadmissionafterstrokeusingmachinelearningandnaturallanguageprocessing
AT prabhakaranshyam predictionof30dayreadmissionafterstrokeusingmachinelearningandnaturallanguageprocessing