Cargando…
Categorising patient concerns using natural language processing techniques
OBJECTIVES: Patient feedback is critical to identify and resolve patient safety and experience issues in healthcare systems. However, large volumes of unstructured text data can pose problems for manual (human) analysis. This study reports the results of using a semiautomated, computational topic-mo...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BMJ Publishing Group
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8246286/ https://www.ncbi.nlm.nih.gov/pubmed/34193519 http://dx.doi.org/10.1136/bmjhci-2020-100274 |
_version_ | 1783716282903822336 |
---|---|
author | Fairie, Paul Zhang, Zilong D'Souza, Adam G Walsh, Tara Quan, Hude Santana, Maria J |
author_facet | Fairie, Paul Zhang, Zilong D'Souza, Adam G Walsh, Tara Quan, Hude Santana, Maria J |
author_sort | Fairie, Paul |
collection | PubMed |
description | OBJECTIVES: Patient feedback is critical to identify and resolve patient safety and experience issues in healthcare systems. However, large volumes of unstructured text data can pose problems for manual (human) analysis. This study reports the results of using a semiautomated, computational topic-modelling approach to analyse a corpus of patient feedback. METHODS: Patient concerns were received by Alberta Health Services between 2011 and 2018 (n=76 163), regarding 806 care facilities in 163 municipalities, including hospitals, clinics, community care centres and retirement homes, in a province of 4.4 million. Their existing framework requires manual labelling of pre-defined categories. We applied an automated latent Dirichlet allocation (LDA)-based topic modelling algorithm to identify the topics present in these concerns, and thereby produce a framework-free categorisation. RESULTS: The LDA model produced 40 topics which, following manual interpretation by researchers, were reduced to 28 coherent topics. The most frequent topics identified were communication issues causing delays (frequency: 10.58%), community care for elderly patients (8.82%), interactions with nurses (8.80%) and emergency department care (7.52%). Many patient concerns were categorised into multiple topics. Some were more specific versions of categories from the existing framework (eg, communication issues causing delays), while others were novel (eg, smoking in inappropriate settings). DISCUSSION: LDA-generated topics were more nuanced than the manually labelled categories. For example, LDA found that concerns with community care were related to concerns about nursing for seniors, providing opportunities for insight and action. CONCLUSION: Our findings outline the range of concerns patients share in a large health system and demonstrate the usefulness of using LDA to identify categories of patient concerns. |
format | Online Article Text |
id | pubmed-8246286 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BMJ Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-82462862021-07-13 Categorising patient concerns using natural language processing techniques Fairie, Paul Zhang, Zilong D'Souza, Adam G Walsh, Tara Quan, Hude Santana, Maria J BMJ Health Care Inform Original Research OBJECTIVES: Patient feedback is critical to identify and resolve patient safety and experience issues in healthcare systems. However, large volumes of unstructured text data can pose problems for manual (human) analysis. This study reports the results of using a semiautomated, computational topic-modelling approach to analyse a corpus of patient feedback. METHODS: Patient concerns were received by Alberta Health Services between 2011 and 2018 (n=76 163), regarding 806 care facilities in 163 municipalities, including hospitals, clinics, community care centres and retirement homes, in a province of 4.4 million. Their existing framework requires manual labelling of pre-defined categories. We applied an automated latent Dirichlet allocation (LDA)-based topic modelling algorithm to identify the topics present in these concerns, and thereby produce a framework-free categorisation. RESULTS: The LDA model produced 40 topics which, following manual interpretation by researchers, were reduced to 28 coherent topics. The most frequent topics identified were communication issues causing delays (frequency: 10.58%), community care for elderly patients (8.82%), interactions with nurses (8.80%) and emergency department care (7.52%). Many patient concerns were categorised into multiple topics. Some were more specific versions of categories from the existing framework (eg, communication issues causing delays), while others were novel (eg, smoking in inappropriate settings). DISCUSSION: LDA-generated topics were more nuanced than the manually labelled categories. For example, LDA found that concerns with community care were related to concerns about nursing for seniors, providing opportunities for insight and action. CONCLUSION: Our findings outline the range of concerns patients share in a large health system and demonstrate the usefulness of using LDA to identify categories of patient concerns. BMJ Publishing Group 2021-06-30 /pmc/articles/PMC8246286/ /pubmed/34193519 http://dx.doi.org/10.1136/bmjhci-2020-100274 Text en © Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) . |
spellingShingle | Original Research Fairie, Paul Zhang, Zilong D'Souza, Adam G Walsh, Tara Quan, Hude Santana, Maria J Categorising patient concerns using natural language processing techniques |
title | Categorising patient concerns using natural language processing techniques |
title_full | Categorising patient concerns using natural language processing techniques |
title_fullStr | Categorising patient concerns using natural language processing techniques |
title_full_unstemmed | Categorising patient concerns using natural language processing techniques |
title_short | Categorising patient concerns using natural language processing techniques |
title_sort | categorising patient concerns using natural language processing techniques |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8246286/ https://www.ncbi.nlm.nih.gov/pubmed/34193519 http://dx.doi.org/10.1136/bmjhci-2020-100274 |
work_keys_str_mv | AT fairiepaul categorisingpatientconcernsusingnaturallanguageprocessingtechniques AT zhangzilong categorisingpatientconcernsusingnaturallanguageprocessingtechniques AT dsouzaadamg categorisingpatientconcernsusingnaturallanguageprocessingtechniques AT walshtara categorisingpatientconcernsusingnaturallanguageprocessingtechniques AT quanhude categorisingpatientconcernsusingnaturallanguageprocessingtechniques AT santanamariaj categorisingpatientconcernsusingnaturallanguageprocessingtechniques |