Cargando…

Developing and testing an automated qualitative assistant (AQUA) to support qualitative analysis

Qualitative research remains underused, in part due to the time and cost of annotating qualitative data (coding). Artificial intelligence (AI) has been suggested as a means to reduce those burdens, and has been used in exploratory studies to reduce the burden of coding. However, methods to date use...

Descripción completa

Detalles Bibliográficos
Autores principales: Lennon, Robert P, Fraleigh, Robbie, Van Scoy, Lauren J, Keshaviah, Aparna, Hu, Xindi C, Snyder, Bethany L, Miller, Erin L, Calo, William A, Zgierska, Aleksandra E, Griffin, Christopher
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8627418/
https://www.ncbi.nlm.nih.gov/pubmed/34824135
http://dx.doi.org/10.1136/fmch-2021-001287
_version_ 1784606851850043392
author Lennon, Robert P
Fraleigh, Robbie
Van Scoy, Lauren J
Keshaviah, Aparna
Hu, Xindi C
Snyder, Bethany L
Miller, Erin L
Calo, William A
Zgierska, Aleksandra E
Griffin, Christopher
author_facet Lennon, Robert P
Fraleigh, Robbie
Van Scoy, Lauren J
Keshaviah, Aparna
Hu, Xindi C
Snyder, Bethany L
Miller, Erin L
Calo, William A
Zgierska, Aleksandra E
Griffin, Christopher
author_sort Lennon, Robert P
collection PubMed
description Qualitative research remains underused, in part due to the time and cost of annotating qualitative data (coding). Artificial intelligence (AI) has been suggested as a means to reduce those burdens, and has been used in exploratory studies to reduce the burden of coding. However, methods to date use AI analytical techniques that lack transparency, potentially limiting acceptance of results. We developed an automated qualitative assistant (AQUA) using a semiclassical approach, replacing Latent Semantic Indexing/Latent Dirichlet Allocation with a more transparent graph-theoretic topic extraction and clustering method. Applied to a large dataset of free-text survey responses, AQUA generated unsupervised topic categories and circle hierarchical representations of free-text responses, enabling rapid interpretation of data. When tasked with coding a subset of free-text data into user-defined qualitative categories, AQUA demonstrated intercoder reliability in several multicategory combinations with a Cohen’s kappa comparable to human coders (0.62–0.72), enabling researchers to automate coding on those categories for the entire dataset. The aim of this manuscript is to describe pertinent components of best practices of AI/machine learning (ML)-assisted qualitative methods, illustrating how primary care researchers may use AQUA to rapidly and accurately code large text datasets. The contribution of this article is providing guidance that should increase AI/ML transparency and reproducibility.
format Online
Article
Text
id pubmed-8627418
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-86274182021-12-10 Developing and testing an automated qualitative assistant (AQUA) to support qualitative analysis Lennon, Robert P Fraleigh, Robbie Van Scoy, Lauren J Keshaviah, Aparna Hu, Xindi C Snyder, Bethany L Miller, Erin L Calo, William A Zgierska, Aleksandra E Griffin, Christopher Fam Med Community Health Methodology and Research Methods Qualitative research remains underused, in part due to the time and cost of annotating qualitative data (coding). Artificial intelligence (AI) has been suggested as a means to reduce those burdens, and has been used in exploratory studies to reduce the burden of coding. However, methods to date use AI analytical techniques that lack transparency, potentially limiting acceptance of results. We developed an automated qualitative assistant (AQUA) using a semiclassical approach, replacing Latent Semantic Indexing/Latent Dirichlet Allocation with a more transparent graph-theoretic topic extraction and clustering method. Applied to a large dataset of free-text survey responses, AQUA generated unsupervised topic categories and circle hierarchical representations of free-text responses, enabling rapid interpretation of data. When tasked with coding a subset of free-text data into user-defined qualitative categories, AQUA demonstrated intercoder reliability in several multicategory combinations with a Cohen’s kappa comparable to human coders (0.62–0.72), enabling researchers to automate coding on those categories for the entire dataset. The aim of this manuscript is to describe pertinent components of best practices of AI/machine learning (ML)-assisted qualitative methods, illustrating how primary care researchers may use AQUA to rapidly and accurately code large text datasets. The contribution of this article is providing guidance that should increase AI/ML transparency and reproducibility. BMJ Publishing Group 2021-11-25 /pmc/articles/PMC8627418/ /pubmed/34824135 http://dx.doi.org/10.1136/fmch-2021-001287 Text en © Author(s) (or their employer(s)) 2021. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Methodology and Research Methods
Lennon, Robert P
Fraleigh, Robbie
Van Scoy, Lauren J
Keshaviah, Aparna
Hu, Xindi C
Snyder, Bethany L
Miller, Erin L
Calo, William A
Zgierska, Aleksandra E
Griffin, Christopher
Developing and testing an automated qualitative assistant (AQUA) to support qualitative analysis
title Developing and testing an automated qualitative assistant (AQUA) to support qualitative analysis
title_full Developing and testing an automated qualitative assistant (AQUA) to support qualitative analysis
title_fullStr Developing and testing an automated qualitative assistant (AQUA) to support qualitative analysis
title_full_unstemmed Developing and testing an automated qualitative assistant (AQUA) to support qualitative analysis
title_short Developing and testing an automated qualitative assistant (AQUA) to support qualitative analysis
title_sort developing and testing an automated qualitative assistant (aqua) to support qualitative analysis
topic Methodology and Research Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8627418/
https://www.ncbi.nlm.nih.gov/pubmed/34824135
http://dx.doi.org/10.1136/fmch-2021-001287
work_keys_str_mv AT lennonrobertp developingandtestinganautomatedqualitativeassistantaquatosupportqualitativeanalysis
AT fraleighrobbie developingandtestinganautomatedqualitativeassistantaquatosupportqualitativeanalysis
AT vanscoylaurenj developingandtestinganautomatedqualitativeassistantaquatosupportqualitativeanalysis
AT keshaviahaparna developingandtestinganautomatedqualitativeassistantaquatosupportqualitativeanalysis
AT huxindic developingandtestinganautomatedqualitativeassistantaquatosupportqualitativeanalysis
AT snyderbethanyl developingandtestinganautomatedqualitativeassistantaquatosupportqualitativeanalysis
AT millererinl developingandtestinganautomatedqualitativeassistantaquatosupportqualitativeanalysis
AT calowilliama developingandtestinganautomatedqualitativeassistantaquatosupportqualitativeanalysis
AT zgierskaaleksandrae developingandtestinganautomatedqualitativeassistantaquatosupportqualitativeanalysis
AT griffinchristopher developingandtestinganautomatedqualitativeassistantaquatosupportqualitativeanalysis