Cargando…

Examining Analytic Practices in Latent Dirichlet Allocation Within Psychological Science: Scoping Review

BACKGROUND: Topic modeling approaches allow researchers to analyze and represent written texts. One of the commonly used approaches in psychology is latent Dirichlet allocation (LDA), which is used for rapidly synthesizing patterns of text within “big data,” but outputs can be sensitive to decisions...

Descripción completa

Detalles Bibliográficos
Autores principales: Hagg, Lauryn J, Merkouris, Stephanie S, O’Dea, Gypsy A, Francis, Lauren M, Greenwood, Christopher J, Fuller-Tyszkiewicz, Matthew, Westrupp, Elizabeth M, Macdonald, Jacqui A, Youssef, George J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9682457/
https://www.ncbi.nlm.nih.gov/pubmed/36346659
http://dx.doi.org/10.2196/33166
_version_ 1784834856237137920
author Hagg, Lauryn J
Merkouris, Stephanie S
O’Dea, Gypsy A
Francis, Lauren M
Greenwood, Christopher J
Fuller-Tyszkiewicz, Matthew
Westrupp, Elizabeth M
Macdonald, Jacqui A
Youssef, George J
author_facet Hagg, Lauryn J
Merkouris, Stephanie S
O’Dea, Gypsy A
Francis, Lauren M
Greenwood, Christopher J
Fuller-Tyszkiewicz, Matthew
Westrupp, Elizabeth M
Macdonald, Jacqui A
Youssef, George J
author_sort Hagg, Lauryn J
collection PubMed
description BACKGROUND: Topic modeling approaches allow researchers to analyze and represent written texts. One of the commonly used approaches in psychology is latent Dirichlet allocation (LDA), which is used for rapidly synthesizing patterns of text within “big data,” but outputs can be sensitive to decisions made during the analytic pipeline and may not be suitable for certain scenarios such as short texts, and we highlight resources for alternative approaches. This review focuses on the complex analytical practices specific to LDA, which existing practical guides for training LDA models have not addressed. OBJECTIVE: This scoping review used key analytical steps (data selection, data preprocessing, and data analysis) as a framework to understand the methodological approaches being used in psychology research using LDA. METHODS: A total of 4 psychology and health databases were searched. Studies were included if they used LDA to analyze written words and focused on a psychological construct or issue. The data charting processes were constructed and employed based on common data selection, preprocessing, and data analysis steps. RESULTS: A total of 68 studies were included. These studies explored a range of research areas and mostly sourced their data from social media platforms. Although some studies reported on preprocessing and data analysis steps taken, most studies did not provide sufficient detail for reproducibility. Furthermore, the debate surrounding the necessity of certain preprocessing and data analysis steps is revealed. CONCLUSIONS: Our findings highlight the growing use of LDA in psychological science. However, there is a need to improve analytical reporting standards and identify comprehensive and evidence-based best practice recommendations. To work toward this, we developed an LDA Preferred Reporting Checklist that will allow for consistent documentation of LDA analytic decisions and reproducible research outcomes.
format Online
Article
Text
id pubmed-9682457
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-96824572022-11-24 Examining Analytic Practices in Latent Dirichlet Allocation Within Psychological Science: Scoping Review Hagg, Lauryn J Merkouris, Stephanie S O’Dea, Gypsy A Francis, Lauren M Greenwood, Christopher J Fuller-Tyszkiewicz, Matthew Westrupp, Elizabeth M Macdonald, Jacqui A Youssef, George J J Med Internet Res Review BACKGROUND: Topic modeling approaches allow researchers to analyze and represent written texts. One of the commonly used approaches in psychology is latent Dirichlet allocation (LDA), which is used for rapidly synthesizing patterns of text within “big data,” but outputs can be sensitive to decisions made during the analytic pipeline and may not be suitable for certain scenarios such as short texts, and we highlight resources for alternative approaches. This review focuses on the complex analytical practices specific to LDA, which existing practical guides for training LDA models have not addressed. OBJECTIVE: This scoping review used key analytical steps (data selection, data preprocessing, and data analysis) as a framework to understand the methodological approaches being used in psychology research using LDA. METHODS: A total of 4 psychology and health databases were searched. Studies were included if they used LDA to analyze written words and focused on a psychological construct or issue. The data charting processes were constructed and employed based on common data selection, preprocessing, and data analysis steps. RESULTS: A total of 68 studies were included. These studies explored a range of research areas and mostly sourced their data from social media platforms. Although some studies reported on preprocessing and data analysis steps taken, most studies did not provide sufficient detail for reproducibility. Furthermore, the debate surrounding the necessity of certain preprocessing and data analysis steps is revealed. CONCLUSIONS: Our findings highlight the growing use of LDA in psychological science. However, there is a need to improve analytical reporting standards and identify comprehensive and evidence-based best practice recommendations. To work toward this, we developed an LDA Preferred Reporting Checklist that will allow for consistent documentation of LDA analytic decisions and reproducible research outcomes. JMIR Publications 2022-11-08 /pmc/articles/PMC9682457/ /pubmed/36346659 http://dx.doi.org/10.2196/33166 Text en ©Lauryn J Hagg, Stephanie S Merkouris, Gypsy A O’Dea, Lauren M Francis, Christopher J Greenwood, Matthew Fuller-Tyszkiewicz, Elizabeth M Westrupp, Jacqui A Macdonald, George J Youssef. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 08.11.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Review
Hagg, Lauryn J
Merkouris, Stephanie S
O’Dea, Gypsy A
Francis, Lauren M
Greenwood, Christopher J
Fuller-Tyszkiewicz, Matthew
Westrupp, Elizabeth M
Macdonald, Jacqui A
Youssef, George J
Examining Analytic Practices in Latent Dirichlet Allocation Within Psychological Science: Scoping Review
title Examining Analytic Practices in Latent Dirichlet Allocation Within Psychological Science: Scoping Review
title_full Examining Analytic Practices in Latent Dirichlet Allocation Within Psychological Science: Scoping Review
title_fullStr Examining Analytic Practices in Latent Dirichlet Allocation Within Psychological Science: Scoping Review
title_full_unstemmed Examining Analytic Practices in Latent Dirichlet Allocation Within Psychological Science: Scoping Review
title_short Examining Analytic Practices in Latent Dirichlet Allocation Within Psychological Science: Scoping Review
title_sort examining analytic practices in latent dirichlet allocation within psychological science: scoping review
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9682457/
https://www.ncbi.nlm.nih.gov/pubmed/36346659
http://dx.doi.org/10.2196/33166
work_keys_str_mv AT hagglaurynj examininganalyticpracticesinlatentdirichletallocationwithinpsychologicalsciencescopingreview
AT merkourisstephanies examininganalyticpracticesinlatentdirichletallocationwithinpsychologicalsciencescopingreview
AT odeagypsya examininganalyticpracticesinlatentdirichletallocationwithinpsychologicalsciencescopingreview
AT francislaurenm examininganalyticpracticesinlatentdirichletallocationwithinpsychologicalsciencescopingreview
AT greenwoodchristopherj examininganalyticpracticesinlatentdirichletallocationwithinpsychologicalsciencescopingreview
AT fullertyszkiewiczmatthew examininganalyticpracticesinlatentdirichletallocationwithinpsychologicalsciencescopingreview
AT westruppelizabethm examininganalyticpracticesinlatentdirichletallocationwithinpsychologicalsciencescopingreview
AT macdonaldjacquia examininganalyticpracticesinlatentdirichletallocationwithinpsychologicalsciencescopingreview
AT youssefgeorgej examininganalyticpracticesinlatentdirichletallocationwithinpsychologicalsciencescopingreview