Cargando…
Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys
Behavioral scientists have become increasingly reliant on online survey platforms such as Amazon's Mechanical Turk (Mturk). These platforms have many advantages, for example it provides ease of access to difficult to sample populations, a large pool of participants, and an easy to use implement...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9093679/ https://www.ncbi.nlm.nih.gov/pubmed/35572225 http://dx.doi.org/10.3389/fpsyg.2022.789223 |
_version_ | 1784705385348726784 |
---|---|
author | Roman, Zachary Joseph Brandt, Holger Miller, Jason Michael |
author_facet | Roman, Zachary Joseph Brandt, Holger Miller, Jason Michael |
author_sort | Roman, Zachary Joseph |
collection | PubMed |
description | Behavioral scientists have become increasingly reliant on online survey platforms such as Amazon's Mechanical Turk (Mturk). These platforms have many advantages, for example it provides ease of access to difficult to sample populations, a large pool of participants, and an easy to use implementation. A major drawback is the existence of bots that are used to complete online surveys for financial gain. These bots contaminate data and need to be identified in order to draw valid conclusions from data obtained with these platforms. In this article, we will provide a Bayesian latent class joint modeling approach that can be routinely applied to identify bots and simultaneously estimate a model of interest. This method can be used to separate the bots' response patterns from real human responses that were provided in line with the item content. The model has the advantage that it is very flexible and is based on plausible assumptions that are met in most empirical settings. We will provide a simulation study that investigates the performance of the model under several relevant scenarios including sample size, proportion of bots, and model complexity. We will show that ignoring bots will lead to severe parameter bias whereas the Bayesian latent class model results in unbiased estimates and thus controls this source of bias. We will illustrate the model and its capabilities with data from an empirical political ideation survey with known bots. We will discuss the implications of the findings with regard to future data collection via online platforms. |
format | Online Article Text |
id | pubmed-9093679 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-90936792022-05-12 Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys Roman, Zachary Joseph Brandt, Holger Miller, Jason Michael Front Psychol Psychology Behavioral scientists have become increasingly reliant on online survey platforms such as Amazon's Mechanical Turk (Mturk). These platforms have many advantages, for example it provides ease of access to difficult to sample populations, a large pool of participants, and an easy to use implementation. A major drawback is the existence of bots that are used to complete online surveys for financial gain. These bots contaminate data and need to be identified in order to draw valid conclusions from data obtained with these platforms. In this article, we will provide a Bayesian latent class joint modeling approach that can be routinely applied to identify bots and simultaneously estimate a model of interest. This method can be used to separate the bots' response patterns from real human responses that were provided in line with the item content. The model has the advantage that it is very flexible and is based on plausible assumptions that are met in most empirical settings. We will provide a simulation study that investigates the performance of the model under several relevant scenarios including sample size, proportion of bots, and model complexity. We will show that ignoring bots will lead to severe parameter bias whereas the Bayesian latent class model results in unbiased estimates and thus controls this source of bias. We will illustrate the model and its capabilities with data from an empirical political ideation survey with known bots. We will discuss the implications of the findings with regard to future data collection via online platforms. Frontiers Media S.A. 2022-04-27 /pmc/articles/PMC9093679/ /pubmed/35572225 http://dx.doi.org/10.3389/fpsyg.2022.789223 Text en Copyright © 2022 Roman, Brandt and Miller. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Psychology Roman, Zachary Joseph Brandt, Holger Miller, Jason Michael Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys |
title | Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys |
title_full | Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys |
title_fullStr | Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys |
title_full_unstemmed | Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys |
title_short | Automated Bot Detection Using Bayesian Latent Class Models in Online Surveys |
title_sort | automated bot detection using bayesian latent class models in online surveys |
topic | Psychology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9093679/ https://www.ncbi.nlm.nih.gov/pubmed/35572225 http://dx.doi.org/10.3389/fpsyg.2022.789223 |
work_keys_str_mv | AT romanzacharyjoseph automatedbotdetectionusingbayesianlatentclassmodelsinonlinesurveys AT brandtholger automatedbotdetectionusingbayesianlatentclassmodelsinonlinesurveys AT millerjasonmichael automatedbotdetectionusingbayesianlatentclassmodelsinonlinesurveys |