Cargando…

Application of Machine Learning Techniques to Identify Data Reliability and Factors Affecting Outcome After Stroke Using Electronic Administrative Records

Aim: To use available electronic administrative records to identify data reliability, predict discharge destination, and identify risk factors associated with specific outcomes following hospital admission with stroke, compared to stroke specific clinical factors, using machine learning techniques....

Descripción completa

Detalles Bibliográficos
Autores principales: Rana, Santu, Luo, Wei, Tran, Truyen, Venkatesh, Svetha, Talman, Paul, Phan, Thanh, Phung, Dinh, Clissold, Benjamin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8503552/
https://www.ncbi.nlm.nih.gov/pubmed/34646226
http://dx.doi.org/10.3389/fneur.2021.670379
_version_ 1784581149224337408
author Rana, Santu
Luo, Wei
Tran, Truyen
Venkatesh, Svetha
Talman, Paul
Phan, Thanh
Phung, Dinh
Clissold, Benjamin
author_facet Rana, Santu
Luo, Wei
Tran, Truyen
Venkatesh, Svetha
Talman, Paul
Phan, Thanh
Phung, Dinh
Clissold, Benjamin
author_sort Rana, Santu
collection PubMed
description Aim: To use available electronic administrative records to identify data reliability, predict discharge destination, and identify risk factors associated with specific outcomes following hospital admission with stroke, compared to stroke specific clinical factors, using machine learning techniques. Method: The study included 2,531 patients having at least one admission with a confirmed diagnosis of stroke, collected from a regional hospital in Australia within 2009–2013. Using machine learning (penalized regression with Lasso) techniques, patients having their index admission between June 2009 and July 2012 were used to derive predictive models, and patients having their index admission between July 2012 and June 2013 were used for validation. Three different stroke types [intracerebral hemorrhage (ICH), ischemic stroke, transient ischemic attack (TIA)] were considered and five different comparison outcome settings were considered. Our electronic administrative record based predictive model was compared with a predictive model composed of “baseline” clinical features, more specific for stroke, such as age, gender, smoking habits, co-morbidities (high cholesterol, hypertension, atrial fibrillation, and ischemic heart disease), types of imaging done (CT scan, MRI, etc.), and occurrence of in-hospital pneumonia. Risk factors associated with likelihood of negative outcomes were identified. Results: The data was highly reliable at predicting discharge to rehabilitation and all other outcomes vs. death for ICH (AUC 0.85 and 0.825, respectively), all discharge outcomes except home vs. rehabilitation for ischemic stroke, and discharge home vs. others and home vs. rehabilitation for TIA (AUC 0.948 and 0.873, respectively). Electronic health record data appeared to provide improved prediction of outcomes over stroke specific clinical factors from the machine learning models. Common risk factors associated with a negative impact on expected outcomes appeared clinically intuitive, and included older age groups, prior ventilatory support, urinary incontinence, need for imaging, and need for allied health input. Conclusion: Electronic administrative records from this cohort produced reliable outcome prediction and identified clinically appropriate factors negatively impacting most outcome variables following hospital admission with stroke. This presents a means of future identification of modifiable factors associated with patient discharge destination. This may potentially aid in patient selection for certain interventions and aid in better patient and clinician education regarding expected discharge outcomes.
format Online
Article
Text
id pubmed-8503552
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-85035522021-10-12 Application of Machine Learning Techniques to Identify Data Reliability and Factors Affecting Outcome After Stroke Using Electronic Administrative Records Rana, Santu Luo, Wei Tran, Truyen Venkatesh, Svetha Talman, Paul Phan, Thanh Phung, Dinh Clissold, Benjamin Front Neurol Neurology Aim: To use available electronic administrative records to identify data reliability, predict discharge destination, and identify risk factors associated with specific outcomes following hospital admission with stroke, compared to stroke specific clinical factors, using machine learning techniques. Method: The study included 2,531 patients having at least one admission with a confirmed diagnosis of stroke, collected from a regional hospital in Australia within 2009–2013. Using machine learning (penalized regression with Lasso) techniques, patients having their index admission between June 2009 and July 2012 were used to derive predictive models, and patients having their index admission between July 2012 and June 2013 were used for validation. Three different stroke types [intracerebral hemorrhage (ICH), ischemic stroke, transient ischemic attack (TIA)] were considered and five different comparison outcome settings were considered. Our electronic administrative record based predictive model was compared with a predictive model composed of “baseline” clinical features, more specific for stroke, such as age, gender, smoking habits, co-morbidities (high cholesterol, hypertension, atrial fibrillation, and ischemic heart disease), types of imaging done (CT scan, MRI, etc.), and occurrence of in-hospital pneumonia. Risk factors associated with likelihood of negative outcomes were identified. Results: The data was highly reliable at predicting discharge to rehabilitation and all other outcomes vs. death for ICH (AUC 0.85 and 0.825, respectively), all discharge outcomes except home vs. rehabilitation for ischemic stroke, and discharge home vs. others and home vs. rehabilitation for TIA (AUC 0.948 and 0.873, respectively). Electronic health record data appeared to provide improved prediction of outcomes over stroke specific clinical factors from the machine learning models. Common risk factors associated with a negative impact on expected outcomes appeared clinically intuitive, and included older age groups, prior ventilatory support, urinary incontinence, need for imaging, and need for allied health input. Conclusion: Electronic administrative records from this cohort produced reliable outcome prediction and identified clinically appropriate factors negatively impacting most outcome variables following hospital admission with stroke. This presents a means of future identification of modifiable factors associated with patient discharge destination. This may potentially aid in patient selection for certain interventions and aid in better patient and clinician education regarding expected discharge outcomes. Frontiers Media S.A. 2021-09-27 /pmc/articles/PMC8503552/ /pubmed/34646226 http://dx.doi.org/10.3389/fneur.2021.670379 Text en Copyright © 2021 Rana, Luo, Tran, Venkatesh, Talman, Phan, Phung and Clissold. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neurology
Rana, Santu
Luo, Wei
Tran, Truyen
Venkatesh, Svetha
Talman, Paul
Phan, Thanh
Phung, Dinh
Clissold, Benjamin
Application of Machine Learning Techniques to Identify Data Reliability and Factors Affecting Outcome After Stroke Using Electronic Administrative Records
title Application of Machine Learning Techniques to Identify Data Reliability and Factors Affecting Outcome After Stroke Using Electronic Administrative Records
title_full Application of Machine Learning Techniques to Identify Data Reliability and Factors Affecting Outcome After Stroke Using Electronic Administrative Records
title_fullStr Application of Machine Learning Techniques to Identify Data Reliability and Factors Affecting Outcome After Stroke Using Electronic Administrative Records
title_full_unstemmed Application of Machine Learning Techniques to Identify Data Reliability and Factors Affecting Outcome After Stroke Using Electronic Administrative Records
title_short Application of Machine Learning Techniques to Identify Data Reliability and Factors Affecting Outcome After Stroke Using Electronic Administrative Records
title_sort application of machine learning techniques to identify data reliability and factors affecting outcome after stroke using electronic administrative records
topic Neurology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8503552/
https://www.ncbi.nlm.nih.gov/pubmed/34646226
http://dx.doi.org/10.3389/fneur.2021.670379
work_keys_str_mv AT ranasantu applicationofmachinelearningtechniquestoidentifydatareliabilityandfactorsaffectingoutcomeafterstrokeusingelectronicadministrativerecords
AT luowei applicationofmachinelearningtechniquestoidentifydatareliabilityandfactorsaffectingoutcomeafterstrokeusingelectronicadministrativerecords
AT trantruyen applicationofmachinelearningtechniquestoidentifydatareliabilityandfactorsaffectingoutcomeafterstrokeusingelectronicadministrativerecords
AT venkateshsvetha applicationofmachinelearningtechniquestoidentifydatareliabilityandfactorsaffectingoutcomeafterstrokeusingelectronicadministrativerecords
AT talmanpaul applicationofmachinelearningtechniquestoidentifydatareliabilityandfactorsaffectingoutcomeafterstrokeusingelectronicadministrativerecords
AT phanthanh applicationofmachinelearningtechniquestoidentifydatareliabilityandfactorsaffectingoutcomeafterstrokeusingelectronicadministrativerecords
AT phungdinh applicationofmachinelearningtechniquestoidentifydatareliabilityandfactorsaffectingoutcomeafterstrokeusingelectronicadministrativerecords
AT clissoldbenjamin applicationofmachinelearningtechniquestoidentifydatareliabilityandfactorsaffectingoutcomeafterstrokeusingelectronicadministrativerecords