Cargando…
Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic
Web-based survey data collection has become increasingly popular, and limitations on in-person data collection during the COVID-19 pandemic have fueled this growth. However, the anonymity of the online environment increases the risk of fraudulent responses provided by bots or those who complete surv...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10446196/ https://www.ncbi.nlm.nih.gov/pubmed/37610999 http://dx.doi.org/10.1371/journal.pgph.0001452 |
_version_ | 1785094351136751616 |
---|---|
author | Wang, June Calderon, Gabriela Hager, Erin R. Edwards, Lorece V. Berry, Andrea A. Liu, Yisi Dinh, Janny Summers, August C. Connor, Katherine A. Collins, Megan E. Prichett, Laura Marshall, Beth R. Johnson, Sara B. |
author_facet | Wang, June Calderon, Gabriela Hager, Erin R. Edwards, Lorece V. Berry, Andrea A. Liu, Yisi Dinh, Janny Summers, August C. Connor, Katherine A. Collins, Megan E. Prichett, Laura Marshall, Beth R. Johnson, Sara B. |
author_sort | Wang, June |
collection | PubMed |
description | Web-based survey data collection has become increasingly popular, and limitations on in-person data collection during the COVID-19 pandemic have fueled this growth. However, the anonymity of the online environment increases the risk of fraudulent responses provided by bots or those who complete surveys to receive incentives, a major risk to data integrity. As part of a study of COVID-19 and the return to in-person school, we implemented a web-based survey of parents in Maryland between December 2021 and July 2022. Recruitment relied, in part, on social media advertisements. Despite implementing many existing best practices, we found the survey challenged by sophisticated fraudsters. In response, we iteratively improved survey security. In this paper, we describe efforts to identify and prevent fraudulent online survey responses. Informed by this experience, we provide specific, actionable recommendations for identifying and preventing online survey fraud in future research. Some strategies can be deployed within the data collection platform such as careful crafting of survey links, Internet Protocol address logging to identify duplicate responses, and comparison of client-side and server-side time stamps to identify responses that may have been completed by respondents outside of the survey’s target geography. Other strategies can be implemented during the survey design phase. These approaches include the use of a 2-stage design in which respondents must be eligible on a preliminary screener before receiving a personalized link. Other design-based strategies include within-survey and cross-survey validation questions, the addition of “speed bump” questions to thwart careless or computerized responders, and the use of optional open-ended survey questions to identify fraudsters. We describe best practices for ongoing monitoring and post-completion survey data review and verification, including algorithms to expedite some aspects of data review and quality assurance. Such strategies are increasingly critical to safeguarding survey-based public health research. |
format | Online Article Text |
id | pubmed-10446196 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-104461962023-08-24 Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic Wang, June Calderon, Gabriela Hager, Erin R. Edwards, Lorece V. Berry, Andrea A. Liu, Yisi Dinh, Janny Summers, August C. Connor, Katherine A. Collins, Megan E. Prichett, Laura Marshall, Beth R. Johnson, Sara B. PLOS Glob Public Health Research Article Web-based survey data collection has become increasingly popular, and limitations on in-person data collection during the COVID-19 pandemic have fueled this growth. However, the anonymity of the online environment increases the risk of fraudulent responses provided by bots or those who complete surveys to receive incentives, a major risk to data integrity. As part of a study of COVID-19 and the return to in-person school, we implemented a web-based survey of parents in Maryland between December 2021 and July 2022. Recruitment relied, in part, on social media advertisements. Despite implementing many existing best practices, we found the survey challenged by sophisticated fraudsters. In response, we iteratively improved survey security. In this paper, we describe efforts to identify and prevent fraudulent online survey responses. Informed by this experience, we provide specific, actionable recommendations for identifying and preventing online survey fraud in future research. Some strategies can be deployed within the data collection platform such as careful crafting of survey links, Internet Protocol address logging to identify duplicate responses, and comparison of client-side and server-side time stamps to identify responses that may have been completed by respondents outside of the survey’s target geography. Other strategies can be implemented during the survey design phase. These approaches include the use of a 2-stage design in which respondents must be eligible on a preliminary screener before receiving a personalized link. Other design-based strategies include within-survey and cross-survey validation questions, the addition of “speed bump” questions to thwart careless or computerized responders, and the use of optional open-ended survey questions to identify fraudsters. We describe best practices for ongoing monitoring and post-completion survey data review and verification, including algorithms to expedite some aspects of data review and quality assurance. Such strategies are increasingly critical to safeguarding survey-based public health research. Public Library of Science 2023-08-23 /pmc/articles/PMC10446196/ /pubmed/37610999 http://dx.doi.org/10.1371/journal.pgph.0001452 Text en © 2023 Wang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Wang, June Calderon, Gabriela Hager, Erin R. Edwards, Lorece V. Berry, Andrea A. Liu, Yisi Dinh, Janny Summers, August C. Connor, Katherine A. Collins, Megan E. Prichett, Laura Marshall, Beth R. Johnson, Sara B. Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic |
title | Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic |
title_full | Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic |
title_fullStr | Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic |
title_full_unstemmed | Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic |
title_short | Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic |
title_sort | identifying and preventing fraudulent responses in online public health surveys: lessons learned during the covid-19 pandemic |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10446196/ https://www.ncbi.nlm.nih.gov/pubmed/37610999 http://dx.doi.org/10.1371/journal.pgph.0001452 |
work_keys_str_mv | AT wangjune identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic AT calderongabriela identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic AT hagererinr identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic AT edwardslorecev identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic AT berryandreaa identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic AT liuyisi identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic AT dinhjanny identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic AT summersaugustc identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic AT connorkatherinea identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic AT collinsmegane identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic AT prichettlaura identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic AT marshallbethr identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic AT johnsonsarab identifyingandpreventingfraudulentresponsesinonlinepublichealthsurveyslessonslearnedduringthecovid19pandemic |