Cargando…

Applying machine-learning to rapidly analyze large qualitative text datasets to inform the COVID-19 pandemic response: comparing human and machine-assisted topic analysis techniques

INTRODUCTION: Machine-assisted topic analysis (MATA) uses artificial intelligence methods to help qualitative researchers analyze large datasets. This is useful for researchers to rapidly update healthcare interventions during changing healthcare contexts, such as a pandemic. We examined the potenti...

Descripción completa

Detalles Bibliográficos
Autores principales:	Towler, Lauren, Bondaronek, Paulina, Papakonstantinou, Trisevgeni, Amlôt, Richard, Chadborn, Tim, Ainsworth, Ben, Yardley, Lucy
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2023
Materias:	Public Health
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10644111/ https://www.ncbi.nlm.nih.gov/pubmed/38026376 http://dx.doi.org/10.3389/fpubh.2023.1268223

_version_	1785134482310823936
author	Towler, Lauren Bondaronek, Paulina Papakonstantinou, Trisevgeni Amlôt, Richard Chadborn, Tim Ainsworth, Ben Yardley, Lucy
author_facet	Towler, Lauren Bondaronek, Paulina Papakonstantinou, Trisevgeni Amlôt, Richard Chadborn, Tim Ainsworth, Ben Yardley, Lucy
author_sort	Towler, Lauren
collection	PubMed
description	INTRODUCTION: Machine-assisted topic analysis (MATA) uses artificial intelligence methods to help qualitative researchers analyze large datasets. This is useful for researchers to rapidly update healthcare interventions during changing healthcare contexts, such as a pandemic. We examined the potential to support healthcare interventions by comparing MATA with “human-only” thematic analysis techniques on the same dataset (1,472 user responses from a COVID-19 behavioral intervention). METHODS: In MATA, an unsupervised topic-modeling approach identified latent topics in the text, from which researchers identified broad themes. In human-only codebook analysis, researchers developed an initial codebook based on previous research that was applied to the dataset by the team, who met regularly to discuss and refine the codes. Formal triangulation using a “convergence coding matrix” compared findings between methods, categorizing them as “agreement”, “complementary”, “dissonant”, or “silent”. RESULTS: Human analysis took much longer than MATA (147.5 vs. 40 h). Both methods identified key themes about what users found helpful and unhelpful. Formal triangulation showed both sets of findings were highly similar. The formal triangulation showed high similarity between the findings. All MATA codes were classified as in agreement or complementary to the human themes. When findings differed slightly, this was due to human researcher interpretations or nuance from human-only analysis. DISCUSSION: Results produced by MATA were similar to human-only thematic analysis, with substantial time savings. For simple analyses that do not require an in-depth or subtle understanding of the data, MATA is a useful tool that can support qualitative researchers to interpret and analyze large datasets quickly. This approach can support intervention development and implementation, such as enabling rapid optimization during public health emergencies.
format	Online Article Text
id	pubmed-10644111
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-106441112023-10-31 Applying machine-learning to rapidly analyze large qualitative text datasets to inform the COVID-19 pandemic response: comparing human and machine-assisted topic analysis techniques Towler, Lauren Bondaronek, Paulina Papakonstantinou, Trisevgeni Amlôt, Richard Chadborn, Tim Ainsworth, Ben Yardley, Lucy Front Public Health Public Health INTRODUCTION: Machine-assisted topic analysis (MATA) uses artificial intelligence methods to help qualitative researchers analyze large datasets. This is useful for researchers to rapidly update healthcare interventions during changing healthcare contexts, such as a pandemic. We examined the potential to support healthcare interventions by comparing MATA with “human-only” thematic analysis techniques on the same dataset (1,472 user responses from a COVID-19 behavioral intervention). METHODS: In MATA, an unsupervised topic-modeling approach identified latent topics in the text, from which researchers identified broad themes. In human-only codebook analysis, researchers developed an initial codebook based on previous research that was applied to the dataset by the team, who met regularly to discuss and refine the codes. Formal triangulation using a “convergence coding matrix” compared findings between methods, categorizing them as “agreement”, “complementary”, “dissonant”, or “silent”. RESULTS: Human analysis took much longer than MATA (147.5 vs. 40 h). Both methods identified key themes about what users found helpful and unhelpful. Formal triangulation showed both sets of findings were highly similar. The formal triangulation showed high similarity between the findings. All MATA codes were classified as in agreement or complementary to the human themes. When findings differed slightly, this was due to human researcher interpretations or nuance from human-only analysis. DISCUSSION: Results produced by MATA were similar to human-only thematic analysis, with substantial time savings. For simple analyses that do not require an in-depth or subtle understanding of the data, MATA is a useful tool that can support qualitative researchers to interpret and analyze large datasets quickly. This approach can support intervention development and implementation, such as enabling rapid optimization during public health emergencies. Frontiers Media S.A. 2023-10-31 /pmc/articles/PMC10644111/ /pubmed/38026376 http://dx.doi.org/10.3389/fpubh.2023.1268223 Text en Copyright © 2023 Towler, Bondaronek, Papakonstantinou, Amlôt, Chadborn, Ainsworth and Yardley. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Public Health Towler, Lauren Bondaronek, Paulina Papakonstantinou, Trisevgeni Amlôt, Richard Chadborn, Tim Ainsworth, Ben Yardley, Lucy Applying machine-learning to rapidly analyze large qualitative text datasets to inform the COVID-19 pandemic response: comparing human and machine-assisted topic analysis techniques
title	Applying machine-learning to rapidly analyze large qualitative text datasets to inform the COVID-19 pandemic response: comparing human and machine-assisted topic analysis techniques
title_full	Applying machine-learning to rapidly analyze large qualitative text datasets to inform the COVID-19 pandemic response: comparing human and machine-assisted topic analysis techniques
title_fullStr	Applying machine-learning to rapidly analyze large qualitative text datasets to inform the COVID-19 pandemic response: comparing human and machine-assisted topic analysis techniques
title_full_unstemmed	Applying machine-learning to rapidly analyze large qualitative text datasets to inform the COVID-19 pandemic response: comparing human and machine-assisted topic analysis techniques
title_short	Applying machine-learning to rapidly analyze large qualitative text datasets to inform the COVID-19 pandemic response: comparing human and machine-assisted topic analysis techniques
title_sort	applying machine-learning to rapidly analyze large qualitative text datasets to inform the covid-19 pandemic response: comparing human and machine-assisted topic analysis techniques
topic	Public Health
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10644111/ https://www.ncbi.nlm.nih.gov/pubmed/38026376 http://dx.doi.org/10.3389/fpubh.2023.1268223
work_keys_str_mv	AT towlerlauren applyingmachinelearningtorapidlyanalyzelargequalitativetextdatasetstoinformthecovid19pandemicresponsecomparinghumanandmachineassistedtopicanalysistechniques AT bondaronekpaulina applyingmachinelearningtorapidlyanalyzelargequalitativetextdatasetstoinformthecovid19pandemicresponsecomparinghumanandmachineassistedtopicanalysistechniques AT papakonstantinoutrisevgeni applyingmachinelearningtorapidlyanalyzelargequalitativetextdatasetstoinformthecovid19pandemicresponsecomparinghumanandmachineassistedtopicanalysistechniques AT amlotrichard applyingmachinelearningtorapidlyanalyzelargequalitativetextdatasetstoinformthecovid19pandemicresponsecomparinghumanandmachineassistedtopicanalysistechniques AT chadborntim applyingmachinelearningtorapidlyanalyzelargequalitativetextdatasetstoinformthecovid19pandemicresponsecomparinghumanandmachineassistedtopicanalysistechniques AT ainsworthben applyingmachinelearningtorapidlyanalyzelargequalitativetextdatasetstoinformthecovid19pandemicresponsecomparinghumanandmachineassistedtopicanalysistechniques AT yardleylucy applyingmachinelearningtorapidlyanalyzelargequalitativetextdatasetstoinformthecovid19pandemicresponsecomparinghumanandmachineassistedtopicanalysistechniques

Applying machine-learning to rapidly analyze large qualitative text datasets to inform the COVID-19 pandemic response: comparing human and machine-assisted topic analysis techniques

Ejemplares similares