Cargando…
Automating Large-scale Health Care Service Feedback Analysis: Sentiment Analysis and Topic Modeling Study
BACKGROUND: Obtaining patient feedback is an essential mechanism for health care service providers to assess their quality and effectiveness. Unlike assessments of clinical outcomes, feedback from patients offers insights into their lived experiences. The Department of Health and Social Care in Engl...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9039814/ https://www.ncbi.nlm.nih.gov/pubmed/35404254 http://dx.doi.org/10.2196/29385 |
_version_ | 1784694211826679808 |
---|---|
author | Alexander, George Bahja, Mohammed Butt, Gibran Farook |
author_facet | Alexander, George Bahja, Mohammed Butt, Gibran Farook |
author_sort | Alexander, George |
collection | PubMed |
description | BACKGROUND: Obtaining patient feedback is an essential mechanism for health care service providers to assess their quality and effectiveness. Unlike assessments of clinical outcomes, feedback from patients offers insights into their lived experiences. The Department of Health and Social Care in England via National Health Service Digital operates a patient feedback web service through which patients can leave feedback of their experiences in structured and free-text report forms. Free-text feedback, compared with structured questionnaires, may be less biased by the feedback collector and, thus, more representative; however, it is harder to analyze in large quantities and challenging to derive meaningful, quantitative outcomes. OBJECTIVE: The aim of this study is to build a novel data analysis and interactive visualization pipeline accessible through an interactive web application to facilitate the interrogation of and provide unique insights into National Health Service patient feedback. METHODS: This study details the development of a text analysis tool that uses contemporary natural language processing and machine learning models to analyze free-text clinical service reviews to develop a robust classification model and interactive visualization web application. The methodology is based on the design science research paradigm and was conducted in three iterations: a sentiment analysis of the patient feedback corpus in the first iteration, topic modeling (unigram and bigram)–based analysis for topic identification in the second iteration, and nested topic modeling in the third iteration that combines sentiment analysis and topic modeling methods. An interactive data visualization web application for use by the general public was then created, presenting the data on a geographic representation of the country, making it easily accessible. RESULTS: Of the 11,103 possible clinical services that could be reviewed across England, 2030 (18.28%) different services received a combined total of 51,845 reviews between October 1, 2017, and September 30, 2019. Dominant topics were identified for the entire corpus followed by negative- and positive-sentiment topics in turn. Reviews containing high- and low-sentiment topics occurred more frequently than reviews containing less polarized topics. Time-series analysis identified trends in topic and sentiment occurrence frequency across the study period. CONCLUSIONS: Using contemporary natural language processing techniques, unstructured text data were effectively characterized for further analysis and visualization. An efficient pipeline was successfully combined with a web application, making automated analysis and dissemination of large volumes of information accessible. This study represents a significant step in efforts to generate and visualize useful, actionable, and unique information from free-text patient reviews. |
format | Online Article Text |
id | pubmed-9039814 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-90398142022-04-27 Automating Large-scale Health Care Service Feedback Analysis: Sentiment Analysis and Topic Modeling Study Alexander, George Bahja, Mohammed Butt, Gibran Farook JMIR Med Inform Original Paper BACKGROUND: Obtaining patient feedback is an essential mechanism for health care service providers to assess their quality and effectiveness. Unlike assessments of clinical outcomes, feedback from patients offers insights into their lived experiences. The Department of Health and Social Care in England via National Health Service Digital operates a patient feedback web service through which patients can leave feedback of their experiences in structured and free-text report forms. Free-text feedback, compared with structured questionnaires, may be less biased by the feedback collector and, thus, more representative; however, it is harder to analyze in large quantities and challenging to derive meaningful, quantitative outcomes. OBJECTIVE: The aim of this study is to build a novel data analysis and interactive visualization pipeline accessible through an interactive web application to facilitate the interrogation of and provide unique insights into National Health Service patient feedback. METHODS: This study details the development of a text analysis tool that uses contemporary natural language processing and machine learning models to analyze free-text clinical service reviews to develop a robust classification model and interactive visualization web application. The methodology is based on the design science research paradigm and was conducted in three iterations: a sentiment analysis of the patient feedback corpus in the first iteration, topic modeling (unigram and bigram)–based analysis for topic identification in the second iteration, and nested topic modeling in the third iteration that combines sentiment analysis and topic modeling methods. An interactive data visualization web application for use by the general public was then created, presenting the data on a geographic representation of the country, making it easily accessible. RESULTS: Of the 11,103 possible clinical services that could be reviewed across England, 2030 (18.28%) different services received a combined total of 51,845 reviews between October 1, 2017, and September 30, 2019. Dominant topics were identified for the entire corpus followed by negative- and positive-sentiment topics in turn. Reviews containing high- and low-sentiment topics occurred more frequently than reviews containing less polarized topics. Time-series analysis identified trends in topic and sentiment occurrence frequency across the study period. CONCLUSIONS: Using contemporary natural language processing techniques, unstructured text data were effectively characterized for further analysis and visualization. An efficient pipeline was successfully combined with a web application, making automated analysis and dissemination of large volumes of information accessible. This study represents a significant step in efforts to generate and visualize useful, actionable, and unique information from free-text patient reviews. JMIR Publications 2022-04-11 /pmc/articles/PMC9039814/ /pubmed/35404254 http://dx.doi.org/10.2196/29385 Text en ©George Alexander, Mohammed Bahja, Gibran Farook Butt. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 11.04.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Alexander, George Bahja, Mohammed Butt, Gibran Farook Automating Large-scale Health Care Service Feedback Analysis: Sentiment Analysis and Topic Modeling Study |
title | Automating Large-scale Health Care Service Feedback Analysis: Sentiment Analysis and Topic Modeling Study |
title_full | Automating Large-scale Health Care Service Feedback Analysis: Sentiment Analysis and Topic Modeling Study |
title_fullStr | Automating Large-scale Health Care Service Feedback Analysis: Sentiment Analysis and Topic Modeling Study |
title_full_unstemmed | Automating Large-scale Health Care Service Feedback Analysis: Sentiment Analysis and Topic Modeling Study |
title_short | Automating Large-scale Health Care Service Feedback Analysis: Sentiment Analysis and Topic Modeling Study |
title_sort | automating large-scale health care service feedback analysis: sentiment analysis and topic modeling study |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9039814/ https://www.ncbi.nlm.nih.gov/pubmed/35404254 http://dx.doi.org/10.2196/29385 |
work_keys_str_mv | AT alexandergeorge automatinglargescalehealthcareservicefeedbackanalysissentimentanalysisandtopicmodelingstudy AT bahjamohammed automatinglargescalehealthcareservicefeedbackanalysissentimentanalysisandtopicmodelingstudy AT buttgibranfarook automatinglargescalehealthcareservicefeedbackanalysissentimentanalysisandtopicmodelingstudy |