Cargando…

Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys

Behavioral scientists have become increasingly reliant on online survey platforms such as Amazon's Mechanical Turk (Mturk). These platforms have many advantages, for example it provides ease of access to difficult to sample populations, a large pool of participants, and an easy to use implement...

Descripción completa

Detalles Bibliográficos
Autores principales: Roman, Zachary Joseph, Brandt, Holger, Miller, Jason Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9093679/
https://www.ncbi.nlm.nih.gov/pubmed/35572225
http://dx.doi.org/10.3389/fpsyg.2022.789223
_version_ 1784705385348726784
author Roman, Zachary Joseph
Brandt, Holger
Miller, Jason Michael
author_facet Roman, Zachary Joseph
Brandt, Holger
Miller, Jason Michael
author_sort Roman, Zachary Joseph
collection PubMed
description Behavioral scientists have become increasingly reliant on online survey platforms such as Amazon's Mechanical Turk (Mturk). These platforms have many advantages, for example it provides ease of access to difficult to sample populations, a large pool of participants, and an easy to use implementation. A major drawback is the existence of bots that are used to complete online surveys for financial gain. These bots contaminate data and need to be identified in order to draw valid conclusions from data obtained with these platforms. In this article, we will provide a Bayesian latent class joint modeling approach that can be routinely applied to identify bots and simultaneously estimate a model of interest. This method can be used to separate the bots' response patterns from real human responses that were provided in line with the item content. The model has the advantage that it is very flexible and is based on plausible assumptions that are met in most empirical settings. We will provide a simulation study that investigates the performance of the model under several relevant scenarios including sample size, proportion of bots, and model complexity. We will show that ignoring bots will lead to severe parameter bias whereas the Bayesian latent class model results in unbiased estimates and thus controls this source of bias. We will illustrate the model and its capabilities with data from an empirical political ideation survey with known bots. We will discuss the implications of the findings with regard to future data collection via online platforms.
format Online
Article
Text
id pubmed-9093679
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-90936792022-05-12 Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys Roman, Zachary Joseph Brandt, Holger Miller, Jason Michael Front Psychol Psychology Behavioral scientists have become increasingly reliant on online survey platforms such as Amazon's Mechanical Turk (Mturk). These platforms have many advantages, for example it provides ease of access to difficult to sample populations, a large pool of participants, and an easy to use implementation. A major drawback is the existence of bots that are used to complete online surveys for financial gain. These bots contaminate data and need to be identified in order to draw valid conclusions from data obtained with these platforms. In this article, we will provide a Bayesian latent class joint modeling approach that can be routinely applied to identify bots and simultaneously estimate a model of interest. This method can be used to separate the bots' response patterns from real human responses that were provided in line with the item content. The model has the advantage that it is very flexible and is based on plausible assumptions that are met in most empirical settings. We will provide a simulation study that investigates the performance of the model under several relevant scenarios including sample size, proportion of bots, and model complexity. We will show that ignoring bots will lead to severe parameter bias whereas the Bayesian latent class model results in unbiased estimates and thus controls this source of bias. We will illustrate the model and its capabilities with data from an empirical political ideation survey with known bots. We will discuss the implications of the findings with regard to future data collection via online platforms. Frontiers Media S.A. 2022-04-27 /pmc/articles/PMC9093679/ /pubmed/35572225 http://dx.doi.org/10.3389/fpsyg.2022.789223 Text en Copyright © 2022 Roman, Brandt and Miller. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Roman, Zachary Joseph
Brandt, Holger
Miller, Jason Michael
Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys
title Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys
title_full Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys
title_fullStr Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys
title_full_unstemmed Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys
title_short Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys
title_sort automated bot detection using bayesian latent class models in online surveys
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9093679/
https://www.ncbi.nlm.nih.gov/pubmed/35572225
http://dx.doi.org/10.3389/fpsyg.2022.789223
work_keys_str_mv AT romanzacharyjoseph automatedbotdetectionusingbayesianlatentclassmodelsinonlinesurveys
AT brandtholger automatedbotdetectionusingbayesianlatentclassmodelsinonlinesurveys
AT millerjasonmichael automatedbotdetectionusingbayesianlatentclassmodelsinonlinesurveys