Cargando…

Effects of Excluding Those Who Report Having “Syndomitis” or “Chekalism” on Data Quality: Longitudinal Health Survey of a Sample From Amazon’s Mechanical Turk

BACKGROUND: Researchers have implemented multiple approaches to increase data quality from existing web-based panels such as Amazon’s Mechanical Turk (MTurk). OBJECTIVE: This study extends prior work by examining improvements in data quality and effects on mean estimates of health status by excludin...

Descripción completa

Detalles Bibliográficos
Autores principales: Hays, Ron D, Qureshi, Nabeel, Herman, Patricia M, Rodriguez, Anthony, Kapteyn, Arie, Edelen, Maria Orlando
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10439462/
https://www.ncbi.nlm.nih.gov/pubmed/37540543
http://dx.doi.org/10.2196/46421
_version_ 1785092949882699776
author Hays, Ron D
Qureshi, Nabeel
Herman, Patricia M
Rodriguez, Anthony
Kapteyn, Arie
Edelen, Maria Orlando
author_facet Hays, Ron D
Qureshi, Nabeel
Herman, Patricia M
Rodriguez, Anthony
Kapteyn, Arie
Edelen, Maria Orlando
author_sort Hays, Ron D
collection PubMed
description BACKGROUND: Researchers have implemented multiple approaches to increase data quality from existing web-based panels such as Amazon’s Mechanical Turk (MTurk). OBJECTIVE: This study extends prior work by examining improvements in data quality and effects on mean estimates of health status by excluding respondents who endorse 1 or both of 2 fake health conditions (“Syndomitis” and “Chekalism”). METHODS: Survey data were collected in 2021 at baseline and 3 months later from MTurk study participants, aged 18 years or older, with an internet protocol address in the United States, and who had completed a minimum of 500 previous MTurk “human intelligence tasks.” We included questions about demographic characteristics, health conditions (including the 2 fake conditions), and the Patient Reported Outcomes Measurement Information System (PROMIS)-29+2 (version 2.1) preference–based score survey. The 3-month follow-up survey was only administered to those who reported having back pain and did not endorse a fake condition at baseline. RESULTS: In total, 15% (996/6832) of the sample endorsed at least 1 of the 2 fake conditions at baseline. Those who endorsed a fake condition at baseline were more likely to identify as male, non-White, younger, report more health conditions, and take longer to complete the survey than those who did not endorse a fake condition. They also had substantially lower internal consistency reliability on the PROMIS-29+2 scales than those who did not endorse a fake condition: physical function (0.69 vs 0.89), pain interference (0.80 vs 0.94), fatigue (0.80 vs 0.92), depression (0.78 vs 0.92), anxiety (0.78 vs 0.90), sleep disturbance (−0.27 vs 0.84), ability to participate in social roles and activities (0.77 vs 0.92), and cognitive function (0.65 vs 0.77). The lack of reliability of the sleep disturbance scale for those endorsing a fake condition was because it includes both positively and negatively worded items. Those who reported a fake condition reported significantly worse self-reported health scores (except for sleep disturbance) than those who did not endorse a fake condition. Excluding those who endorsed a fake condition improved the overall mean PROMIS-29+2 (version 2.1) T-scores by 1-2 points and the PROMIS preference–based score by 0.04. Although they did not endorse a fake condition at baseline, 6% (n=59) of them endorsed at least 1 of them on the 3-month survey and they had lower PROMIS-29+2 score internal consistency reliability and worse mean scores on the 3-month survey than those who did not report having a fake condition. Based on these results, we estimate that 25% (1708/6832) of the MTurk respondents provided careless or dishonest responses. CONCLUSIONS: This study provides evidence that asking about fake health conditions can help to screen out respondents who may be dishonest or careless. We recommend this approach be used routinely in samples of members of MTurk.
format Online
Article
Text
id pubmed-10439462
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-104394622023-08-20 Effects of Excluding Those Who Report Having “Syndomitis” or “Chekalism” on Data Quality: Longitudinal Health Survey of a Sample From Amazon’s Mechanical Turk Hays, Ron D Qureshi, Nabeel Herman, Patricia M Rodriguez, Anthony Kapteyn, Arie Edelen, Maria Orlando J Med Internet Res Original Paper BACKGROUND: Researchers have implemented multiple approaches to increase data quality from existing web-based panels such as Amazon’s Mechanical Turk (MTurk). OBJECTIVE: This study extends prior work by examining improvements in data quality and effects on mean estimates of health status by excluding respondents who endorse 1 or both of 2 fake health conditions (“Syndomitis” and “Chekalism”). METHODS: Survey data were collected in 2021 at baseline and 3 months later from MTurk study participants, aged 18 years or older, with an internet protocol address in the United States, and who had completed a minimum of 500 previous MTurk “human intelligence tasks.” We included questions about demographic characteristics, health conditions (including the 2 fake conditions), and the Patient Reported Outcomes Measurement Information System (PROMIS)-29+2 (version 2.1) preference–based score survey. The 3-month follow-up survey was only administered to those who reported having back pain and did not endorse a fake condition at baseline. RESULTS: In total, 15% (996/6832) of the sample endorsed at least 1 of the 2 fake conditions at baseline. Those who endorsed a fake condition at baseline were more likely to identify as male, non-White, younger, report more health conditions, and take longer to complete the survey than those who did not endorse a fake condition. They also had substantially lower internal consistency reliability on the PROMIS-29+2 scales than those who did not endorse a fake condition: physical function (0.69 vs 0.89), pain interference (0.80 vs 0.94), fatigue (0.80 vs 0.92), depression (0.78 vs 0.92), anxiety (0.78 vs 0.90), sleep disturbance (−0.27 vs 0.84), ability to participate in social roles and activities (0.77 vs 0.92), and cognitive function (0.65 vs 0.77). The lack of reliability of the sleep disturbance scale for those endorsing a fake condition was because it includes both positively and negatively worded items. Those who reported a fake condition reported significantly worse self-reported health scores (except for sleep disturbance) than those who did not endorse a fake condition. Excluding those who endorsed a fake condition improved the overall mean PROMIS-29+2 (version 2.1) T-scores by 1-2 points and the PROMIS preference–based score by 0.04. Although they did not endorse a fake condition at baseline, 6% (n=59) of them endorsed at least 1 of them on the 3-month survey and they had lower PROMIS-29+2 score internal consistency reliability and worse mean scores on the 3-month survey than those who did not report having a fake condition. Based on these results, we estimate that 25% (1708/6832) of the MTurk respondents provided careless or dishonest responses. CONCLUSIONS: This study provides evidence that asking about fake health conditions can help to screen out respondents who may be dishonest or careless. We recommend this approach be used routinely in samples of members of MTurk. JMIR Publications 2023-08-04 /pmc/articles/PMC10439462/ /pubmed/37540543 http://dx.doi.org/10.2196/46421 Text en ©Ron D Hays, Nabeel Qureshi, Patricia M Herman, Anthony Rodriguez, Arie Kapteyn, Maria Orlando Edelen. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 04.08.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Hays, Ron D
Qureshi, Nabeel
Herman, Patricia M
Rodriguez, Anthony
Kapteyn, Arie
Edelen, Maria Orlando
Effects of Excluding Those Who Report Having “Syndomitis” or “Chekalism” on Data Quality: Longitudinal Health Survey of a Sample From Amazon’s Mechanical Turk
title Effects of Excluding Those Who Report Having “Syndomitis” or “Chekalism” on Data Quality: Longitudinal Health Survey of a Sample From Amazon’s Mechanical Turk
title_full Effects of Excluding Those Who Report Having “Syndomitis” or “Chekalism” on Data Quality: Longitudinal Health Survey of a Sample From Amazon’s Mechanical Turk
title_fullStr Effects of Excluding Those Who Report Having “Syndomitis” or “Chekalism” on Data Quality: Longitudinal Health Survey of a Sample From Amazon’s Mechanical Turk
title_full_unstemmed Effects of Excluding Those Who Report Having “Syndomitis” or “Chekalism” on Data Quality: Longitudinal Health Survey of a Sample From Amazon’s Mechanical Turk
title_short Effects of Excluding Those Who Report Having “Syndomitis” or “Chekalism” on Data Quality: Longitudinal Health Survey of a Sample From Amazon’s Mechanical Turk
title_sort effects of excluding those who report having “syndomitis” or “chekalism” on data quality: longitudinal health survey of a sample from amazon’s mechanical turk
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10439462/
https://www.ncbi.nlm.nih.gov/pubmed/37540543
http://dx.doi.org/10.2196/46421
work_keys_str_mv AT haysrond effectsofexcludingthosewhoreporthavingsyndomitisorchekalismondataqualitylongitudinalhealthsurveyofasamplefromamazonsmechanicalturk
AT qureshinabeel effectsofexcludingthosewhoreporthavingsyndomitisorchekalismondataqualitylongitudinalhealthsurveyofasamplefromamazonsmechanicalturk
AT hermanpatriciam effectsofexcludingthosewhoreporthavingsyndomitisorchekalismondataqualitylongitudinalhealthsurveyofasamplefromamazonsmechanicalturk
AT rodriguezanthony effectsofexcludingthosewhoreporthavingsyndomitisorchekalismondataqualitylongitudinalhealthsurveyofasamplefromamazonsmechanicalturk
AT kapteynarie effectsofexcludingthosewhoreporthavingsyndomitisorchekalismondataqualitylongitudinalhealthsurveyofasamplefromamazonsmechanicalturk
AT edelenmariaorlando effectsofexcludingthosewhoreporthavingsyndomitisorchekalismondataqualitylongitudinalhealthsurveyofasamplefromamazonsmechanicalturk