Cargando…

Boosting Delirium Identification Accuracy With Sentiment-Based Natural Language Processing: Mixed Methods Study

BACKGROUND: Delirium is an acute neurocognitive disorder that affects up to half of older hospitalized medical patients and can lead to dementia, longer hospital stays, increased health costs, and death. Although delirium can be prevented and treated, it is difficult to identify and predict. OBJECTI...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Lu, Zhang, Yilun, Chignell, Mark, Shan, Baizun, Sheehan, Kathleen A, Razak, Fahad, Verma, Amol
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9812273/
https://www.ncbi.nlm.nih.gov/pubmed/36538363
http://dx.doi.org/10.2196/38161
_version_ 1784863693398343680
author Wang, Lu
Zhang, Yilun
Chignell, Mark
Shan, Baizun
Sheehan, Kathleen A
Razak, Fahad
Verma, Amol
author_facet Wang, Lu
Zhang, Yilun
Chignell, Mark
Shan, Baizun
Sheehan, Kathleen A
Razak, Fahad
Verma, Amol
author_sort Wang, Lu
collection PubMed
description BACKGROUND: Delirium is an acute neurocognitive disorder that affects up to half of older hospitalized medical patients and can lead to dementia, longer hospital stays, increased health costs, and death. Although delirium can be prevented and treated, it is difficult to identify and predict. OBJECTIVE: This study aimed to improve machine learning models that retrospectively identify the presence of delirium during hospital stays (eg, to measure the effectiveness of delirium prevention interventions) by using the natural language processing (NLP) technique of sentiment analysis (in this case a feature that identifies sentiment toward, or away from, a delirium diagnosis). METHODS: Using data from the General Medicine Inpatient Initiative, a Canadian hospital data and analytics network, a detailed manual review of medical records was conducted from nearly 4000 admissions at 6 Toronto area hospitals. Furthermore, 25.74% (994/3862) of the eligible hospital admissions were labeled as having delirium. Using the data set collected from this study, we developed machine learning models with, and without, the benefit of NLP methods applied to diagnostic imaging reports, and we asked the question “can NLP improve machine learning identification of delirium?” RESULTS: Among the eligible 3862 hospital admissions, 994 (25.74%) admissions were labeled as having delirium. Identification and calibration of the models were satisfactory. The accuracy and area under the receiver operating characteristic curve of the main model with NLP in the independent testing data set were 0.807 and 0.930, respectively. The accuracy and area under the receiver operating characteristic curve of the main model without NLP in the independent testing data set were 0.811 and 0.869, respectively. Model performance was also found to be stable over the 5-year period used in the experiment, with identification for a likely future holdout test set being no worse than identification for retrospective holdout test sets. CONCLUSIONS: Our machine learning model that included NLP (ie, sentiment analysis in medical image description text mining) produced valid identification of delirium with the sentiment analysis, providing significant additional benefit over the model without NLP.
format Online
Article
Text
id pubmed-9812273
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-98122732023-01-05 Boosting Delirium Identification Accuracy With Sentiment-Based Natural Language Processing: Mixed Methods Study Wang, Lu Zhang, Yilun Chignell, Mark Shan, Baizun Sheehan, Kathleen A Razak, Fahad Verma, Amol JMIR Med Inform Original Paper BACKGROUND: Delirium is an acute neurocognitive disorder that affects up to half of older hospitalized medical patients and can lead to dementia, longer hospital stays, increased health costs, and death. Although delirium can be prevented and treated, it is difficult to identify and predict. OBJECTIVE: This study aimed to improve machine learning models that retrospectively identify the presence of delirium during hospital stays (eg, to measure the effectiveness of delirium prevention interventions) by using the natural language processing (NLP) technique of sentiment analysis (in this case a feature that identifies sentiment toward, or away from, a delirium diagnosis). METHODS: Using data from the General Medicine Inpatient Initiative, a Canadian hospital data and analytics network, a detailed manual review of medical records was conducted from nearly 4000 admissions at 6 Toronto area hospitals. Furthermore, 25.74% (994/3862) of the eligible hospital admissions were labeled as having delirium. Using the data set collected from this study, we developed machine learning models with, and without, the benefit of NLP methods applied to diagnostic imaging reports, and we asked the question “can NLP improve machine learning identification of delirium?” RESULTS: Among the eligible 3862 hospital admissions, 994 (25.74%) admissions were labeled as having delirium. Identification and calibration of the models were satisfactory. The accuracy and area under the receiver operating characteristic curve of the main model with NLP in the independent testing data set were 0.807 and 0.930, respectively. The accuracy and area under the receiver operating characteristic curve of the main model without NLP in the independent testing data set were 0.811 and 0.869, respectively. Model performance was also found to be stable over the 5-year period used in the experiment, with identification for a likely future holdout test set being no worse than identification for retrospective holdout test sets. CONCLUSIONS: Our machine learning model that included NLP (ie, sentiment analysis in medical image description text mining) produced valid identification of delirium with the sentiment analysis, providing significant additional benefit over the model without NLP. JMIR Publications 2022-12-20 /pmc/articles/PMC9812273/ /pubmed/36538363 http://dx.doi.org/10.2196/38161 Text en ©Lu Wang, Yilun Zhang, Mark Chignell, Baizun Shan, Kathleen A Sheehan, Fahad Razak, Amol Verma. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 20.12.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Wang, Lu
Zhang, Yilun
Chignell, Mark
Shan, Baizun
Sheehan, Kathleen A
Razak, Fahad
Verma, Amol
Boosting Delirium Identification Accuracy With Sentiment-Based Natural Language Processing: Mixed Methods Study
title Boosting Delirium Identification Accuracy With Sentiment-Based Natural Language Processing: Mixed Methods Study
title_full Boosting Delirium Identification Accuracy With Sentiment-Based Natural Language Processing: Mixed Methods Study
title_fullStr Boosting Delirium Identification Accuracy With Sentiment-Based Natural Language Processing: Mixed Methods Study
title_full_unstemmed Boosting Delirium Identification Accuracy With Sentiment-Based Natural Language Processing: Mixed Methods Study
title_short Boosting Delirium Identification Accuracy With Sentiment-Based Natural Language Processing: Mixed Methods Study
title_sort boosting delirium identification accuracy with sentiment-based natural language processing: mixed methods study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9812273/
https://www.ncbi.nlm.nih.gov/pubmed/36538363
http://dx.doi.org/10.2196/38161
work_keys_str_mv AT wanglu boostingdeliriumidentificationaccuracywithsentimentbasednaturallanguageprocessingmixedmethodsstudy
AT zhangyilun boostingdeliriumidentificationaccuracywithsentimentbasednaturallanguageprocessingmixedmethodsstudy
AT chignellmark boostingdeliriumidentificationaccuracywithsentimentbasednaturallanguageprocessingmixedmethodsstudy
AT shanbaizun boostingdeliriumidentificationaccuracywithsentimentbasednaturallanguageprocessingmixedmethodsstudy
AT sheehankathleena boostingdeliriumidentificationaccuracywithsentimentbasednaturallanguageprocessingmixedmethodsstudy
AT razakfahad boostingdeliriumidentificationaccuracywithsentimentbasednaturallanguageprocessingmixedmethodsstudy
AT vermaamol boostingdeliriumidentificationaccuracywithsentimentbasednaturallanguageprocessingmixedmethodsstudy