Cargando…

Categorising patient concerns using natural language processing techniques

OBJECTIVES: Patient feedback is critical to identify and resolve patient safety and experience issues in healthcare systems. However, large volumes of unstructured text data can pose problems for manual (human) analysis. This study reports the results of using a semiautomated, computational topic-mo...

Descripción completa

Detalles Bibliográficos
Autores principales: Fairie, Paul, Zhang, Zilong, D'Souza, Adam G, Walsh, Tara, Quan, Hude, Santana, Maria J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8246286/
https://www.ncbi.nlm.nih.gov/pubmed/34193519
http://dx.doi.org/10.1136/bmjhci-2020-100274
_version_ 1783716282903822336
author Fairie, Paul
Zhang, Zilong
D'Souza, Adam G
Walsh, Tara
Quan, Hude
Santana, Maria J
author_facet Fairie, Paul
Zhang, Zilong
D'Souza, Adam G
Walsh, Tara
Quan, Hude
Santana, Maria J
author_sort Fairie, Paul
collection PubMed
description OBJECTIVES: Patient feedback is critical to identify and resolve patient safety and experience issues in healthcare systems. However, large volumes of unstructured text data can pose problems for manual (human) analysis. This study reports the results of using a semiautomated, computational topic-modelling approach to analyse a corpus of patient feedback. METHODS: Patient concerns were received by Alberta Health Services between 2011 and 2018 (n=76 163), regarding 806 care facilities in 163 municipalities, including hospitals, clinics, community care centres and retirement homes, in a province of 4.4 million. Their existing framework requires manual labelling of pre-defined categories. We applied an automated latent Dirichlet allocation (LDA)-based topic modelling algorithm to identify the topics present in these concerns, and thereby produce a framework-free categorisation. RESULTS: The LDA model produced 40 topics which, following manual interpretation by researchers, were reduced to 28 coherent topics. The most frequent topics identified were communication issues causing delays (frequency: 10.58%), community care for elderly patients (8.82%), interactions with nurses (8.80%) and emergency department care (7.52%). Many patient concerns were categorised into multiple topics. Some were more specific versions of categories from the existing framework (eg, communication issues causing delays), while others were novel (eg, smoking in inappropriate settings). DISCUSSION: LDA-generated topics were more nuanced than the manually labelled categories. For example, LDA found that concerns with community care were related to concerns about nursing for seniors, providing opportunities for insight and action. CONCLUSION: Our findings outline the range of concerns patients share in a large health system and demonstrate the usefulness of using LDA to identify categories of patient concerns.
format Online
Article
Text
id pubmed-8246286
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-82462862021-07-13 Categorising patient concerns using natural language processing techniques Fairie, Paul Zhang, Zilong D'Souza, Adam G Walsh, Tara Quan, Hude Santana, Maria J BMJ Health Care Inform Original Research OBJECTIVES: Patient feedback is critical to identify and resolve patient safety and experience issues in healthcare systems. However, large volumes of unstructured text data can pose problems for manual (human) analysis. This study reports the results of using a semiautomated, computational topic-modelling approach to analyse a corpus of patient feedback. METHODS: Patient concerns were received by Alberta Health Services between 2011 and 2018 (n=76 163), regarding 806 care facilities in 163 municipalities, including hospitals, clinics, community care centres and retirement homes, in a province of 4.4 million. Their existing framework requires manual labelling of pre-defined categories. We applied an automated latent Dirichlet allocation (LDA)-based topic modelling algorithm to identify the topics present in these concerns, and thereby produce a framework-free categorisation. RESULTS: The LDA model produced 40 topics which, following manual interpretation by researchers, were reduced to 28 coherent topics. The most frequent topics identified were communication issues causing delays (frequency: 10.58%), community care for elderly patients (8.82%), interactions with nurses (8.80%) and emergency department care (7.52%). Many patient concerns were categorised into multiple topics. Some were more specific versions of categories from the existing framework (eg, communication issues causing delays), while others were novel (eg, smoking in inappropriate settings). DISCUSSION: LDA-generated topics were more nuanced than the manually labelled categories. For example, LDA found that concerns with community care were related to concerns about nursing for seniors, providing opportunities for insight and action. CONCLUSION: Our findings outline the range of concerns patients share in a large health system and demonstrate the usefulness of using LDA to identify categories of patient concerns. BMJ Publishing Group 2021-06-30 /pmc/articles/PMC8246286/ /pubmed/34193519 http://dx.doi.org/10.1136/bmjhci-2020-100274 Text en © Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Original Research
Fairie, Paul
Zhang, Zilong
D'Souza, Adam G
Walsh, Tara
Quan, Hude
Santana, Maria J
Categorising patient concerns using natural language processing techniques
title Categorising patient concerns using natural language processing techniques
title_full Categorising patient concerns using natural language processing techniques
title_fullStr Categorising patient concerns using natural language processing techniques
title_full_unstemmed Categorising patient concerns using natural language processing techniques
title_short Categorising patient concerns using natural language processing techniques
title_sort categorising patient concerns using natural language processing techniques
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8246286/
https://www.ncbi.nlm.nih.gov/pubmed/34193519
http://dx.doi.org/10.1136/bmjhci-2020-100274
work_keys_str_mv AT fairiepaul categorisingpatientconcernsusingnaturallanguageprocessingtechniques
AT zhangzilong categorisingpatientconcernsusingnaturallanguageprocessingtechniques
AT dsouzaadamg categorisingpatientconcernsusingnaturallanguageprocessingtechniques
AT walshtara categorisingpatientconcernsusingnaturallanguageprocessingtechniques
AT quanhude categorisingpatientconcernsusingnaturallanguageprocessingtechniques
AT santanamariaj categorisingpatientconcernsusingnaturallanguageprocessingtechniques