Cargando…

Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic

Web-based survey data collection has become increasingly popular, and limitations on in-person data collection during the COVID-19 pandemic have fueled this growth. However, the anonymity of the online environment increases the risk of fraudulent responses provided by bots or those who complete surv...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, June, Calderon, Gabriela, Hager, Erin R., Edwards, Lorece V., Berry, Andrea A., Liu, Yisi, Dinh, Janny, Summers, August C., Connor, Katherine A., Collins, Megan E., Prichett, Laura, Marshall, Beth R., Johnson, Sara B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10446196/
https://www.ncbi.nlm.nih.gov/pubmed/37610999
http://dx.doi.org/10.1371/journal.pgph.0001452
_version_ 1785094351136751616
author Wang, June
Calderon, Gabriela
Hager, Erin R.
Edwards, Lorece V.
Berry, Andrea A.
Liu, Yisi
Dinh, Janny
Summers, August C.
Connor, Katherine A.
Collins, Megan E.
Prichett, Laura
Marshall, Beth R.
Johnson, Sara B.
author_facet Wang, June
Calderon, Gabriela
Hager, Erin R.
Edwards, Lorece V.
Berry, Andrea A.
Liu, Yisi
Dinh, Janny
Summers, August C.
Connor, Katherine A.
Collins, Megan E.
Prichett, Laura
Marshall, Beth R.
Johnson, Sara B.
author_sort Wang, June
collection PubMed
description Web-based survey data collection has become increasingly popular, and limitations on in-person data collection during the COVID-19 pandemic have fueled this growth. However, the anonymity of the online environment increases the risk of fraudulent responses provided by bots or those who complete surveys to receive incentives, a major risk to data integrity. As part of a study of COVID-19 and the return to in-person school, we implemented a web-based survey of parents in Maryland between December 2021 and July 2022. Recruitment relied, in part, on social media advertisements. Despite implementing many existing best practices, we found the survey challenged by sophisticated fraudsters. In response, we iteratively improved survey security. In this paper, we describe efforts to identify and prevent fraudulent online survey responses. Informed by this experience, we provide specific, actionable recommendations for identifying and preventing online survey fraud in future research. Some strategies can be deployed within the data collection platform such as careful crafting of survey links, Internet Protocol address logging to identify duplicate responses, and comparison of client-side and server-side time stamps to identify responses that may have been completed by respondents outside of the survey’s target geography. Other strategies can be implemented during the survey design phase. These approaches include the use of a 2-stage design in which respondents must be eligible on a preliminary screener before receiving a personalized link. Other design-based strategies include within-survey and cross-survey validation questions, the addition of “speed bump” questions to thwart careless or computerized responders, and the use of optional open-ended survey questions to identify fraudsters. We describe best practices for ongoing monitoring and post-completion survey data review and verification, including algorithms to expedite some aspects of data review and quality assurance. Such strategies are increasingly critical to safeguarding survey-based public health research.
format Online
Article
Text
id pubmed-10446196
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-104461962023-08-24 Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic Wang, June Calderon, Gabriela Hager, Erin R. Edwards, Lorece V. Berry, Andrea A. Liu, Yisi Dinh, Janny Summers, August C. Connor, Katherine A. Collins, Megan E. Prichett, Laura Marshall, Beth R. Johnson, Sara B. PLOS Glob Public Health Research Article Web-based survey data collection has become increasingly popular, and limitations on in-person data collection during the COVID-19 pandemic have fueled this growth. However, the anonymity of the online environment increases the risk of fraudulent responses provided by bots or those who complete surveys to receive incentives, a major risk to data integrity. As part of a study of COVID-19 and the return to in-person school, we implemented a web-based survey of parents in Maryland between December 2021 and July 2022. Recruitment relied, in part, on social media advertisements. Despite implementing many existing best practices, we found the survey challenged by sophisticated fraudsters. In response, we iteratively improved survey security. In this paper, we describe efforts to identify and prevent fraudulent online survey responses. Informed by this experience, we provide specific, actionable recommendations for identifying and preventing online survey fraud in future research. Some strategies can be deployed within the data collection platform such as careful crafting of survey links, Internet Protocol address logging to identify duplicate responses, and comparison of client-side and server-side time stamps to identify responses that may have been completed by respondents outside of the survey’s target geography. Other strategies can be implemented during the survey design phase. These approaches include the use of a 2-stage design in which respondents must be eligible on a preliminary screener before receiving a personalized link. Other design-based strategies include within-survey and cross-survey validation questions, the addition of “speed bump” questions to thwart careless or computerized responders, and the use of optional open-ended survey questions to identify fraudsters. We describe best practices for ongoing monitoring and post-completion survey data review and verification, including algorithms to expedite some aspects of data review and quality assurance. Such strategies are increasingly critical to safeguarding survey-based public health research. Public Library of Science 2023-08-23 /pmc/articles/PMC10446196/ /pubmed/37610999 http://dx.doi.org/10.1371/journal.pgph.0001452 Text en © 2023 Wang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, June
Calderon, Gabriela
Hager, Erin R.
Edwards, Lorece V.
Berry, Andrea A.
Liu, Yisi
Dinh, Janny
Summers, August C.
Connor, Katherine A.
Collins, Megan E.
Prichett, Laura
Marshall, Beth R.
Johnson, Sara B.
Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic
title Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic
title_full Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic
title_fullStr Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic
title_full_unstemmed Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic
title_short Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic
title_sort identifying and preventing fraudulent responses in online public health surveys: lessons learned during the covid-19 pandemic
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10446196/
https://www.ncbi.nlm.nih.gov/pubmed/37610999
http://dx.doi.org/10.1371/journal.pgph.0001452
work_keys_str_mv AT wangjune identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic
AT calderongabriela identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic
AT hagererinr identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic
AT edwardslorecev identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic
AT berryandreaa identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic
AT liuyisi identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic
AT dinhjanny identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic
AT summersaugustc identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic
AT connorkatherinea identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic
AT collinsmegane identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic
AT prichettlaura identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic
AT marshallbethr identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic
AT johnsonsarab identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic