Cargando…
Validity of Internet-Based Longitudinal Study Data: The Elephant in the Virtual Room
BACKGROUND: Internet-based data collection relies on well-designed and validated questionnaires. The theory behind designing and validating questionnaires is well described, but few practical examples of how to approach validation are available in the literature. OBJECTIVE: We aimed to validate data...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications Inc.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4416133/ https://www.ncbi.nlm.nih.gov/pubmed/25887101 http://dx.doi.org/10.2196/jmir.3530 |
_version_ | 1782369183511609344 |
---|---|
author | Pugh, Carys A Summers, Kim M Bronsvoort, B Mark C Handel, Ian G Clements, Dylan N |
author_facet | Pugh, Carys A Summers, Kim M Bronsvoort, B Mark C Handel, Ian G Clements, Dylan N |
author_sort | Pugh, Carys A |
collection | PubMed |
description | BACKGROUND: Internet-based data collection relies on well-designed and validated questionnaires. The theory behind designing and validating questionnaires is well described, but few practical examples of how to approach validation are available in the literature. OBJECTIVE: We aimed to validate data collected in an ongoing Internet-based longitudinal health study through direct visits to participants and recall of their health records. We demonstrate that despite extensive pre-planning, social desirability can still affect data in unexpected ways and that anticipation of poor quality data may be confounded by positive validation. METHODS: Dogslife is a large-scale, Web-based longitudinal study of canine health, in which owners of Labrador Retrievers were recruited and questioned at regular intervals about the lifestyle and health of their dogs using an Internet-based questionnaire. The Dogslife questionnaire predominantly consists of closed-answer questions. In our work, two separate validation methodologies were used: (1) direct interviews with 43 participants during visits to their households and (2) comparison of owner-entered health reports with 139 historical health records. RESULTS: Our results indicate that user-derived measures should not be regarded as a single category; instead, each measurement should be considered separately as each presents its own challenge to participants. We recommend trying to ascertain the extent of recall decay within a study and, if necessary, using this to guide data collection timepoints and analyses. Finally, we recommend that multiple methods of communication facilitate validation studies and aid cohort engagement. CONCLUSIONS: Our study highlighted how the theory underpinning online questionnaire design and validation translates into practical data issues when applied to Internet-based studies. Validation should be regarded as an extension of questionnaire design, and that validation work should commence as soon as sufficient data are available. We believe that validation is a crucial step and hope our suggested guidelines will help facilitate validation of other Internet-based cohort studies. |
format | Online Article Text |
id | pubmed-4416133 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | JMIR Publications Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-44161332015-05-08 Validity of Internet-Based Longitudinal Study Data: The Elephant in the Virtual Room Pugh, Carys A Summers, Kim M Bronsvoort, B Mark C Handel, Ian G Clements, Dylan N J Med Internet Res Viewpoint BACKGROUND: Internet-based data collection relies on well-designed and validated questionnaires. The theory behind designing and validating questionnaires is well described, but few practical examples of how to approach validation are available in the literature. OBJECTIVE: We aimed to validate data collected in an ongoing Internet-based longitudinal health study through direct visits to participants and recall of their health records. We demonstrate that despite extensive pre-planning, social desirability can still affect data in unexpected ways and that anticipation of poor quality data may be confounded by positive validation. METHODS: Dogslife is a large-scale, Web-based longitudinal study of canine health, in which owners of Labrador Retrievers were recruited and questioned at regular intervals about the lifestyle and health of their dogs using an Internet-based questionnaire. The Dogslife questionnaire predominantly consists of closed-answer questions. In our work, two separate validation methodologies were used: (1) direct interviews with 43 participants during visits to their households and (2) comparison of owner-entered health reports with 139 historical health records. RESULTS: Our results indicate that user-derived measures should not be regarded as a single category; instead, each measurement should be considered separately as each presents its own challenge to participants. We recommend trying to ascertain the extent of recall decay within a study and, if necessary, using this to guide data collection timepoints and analyses. Finally, we recommend that multiple methods of communication facilitate validation studies and aid cohort engagement. CONCLUSIONS: Our study highlighted how the theory underpinning online questionnaire design and validation translates into practical data issues when applied to Internet-based studies. Validation should be regarded as an extension of questionnaire design, and that validation work should commence as soon as sufficient data are available. We believe that validation is a crucial step and hope our suggested guidelines will help facilitate validation of other Internet-based cohort studies. JMIR Publications Inc. 2015-04-16 /pmc/articles/PMC4416133/ /pubmed/25887101 http://dx.doi.org/10.2196/jmir.3530 Text en ©Carys A Pugh, Kim M Summers, B Mark C Bronsvoort, Ian G Handel, Dylan N Clements. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 16.04.2015. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Viewpoint Pugh, Carys A Summers, Kim M Bronsvoort, B Mark C Handel, Ian G Clements, Dylan N Validity of Internet-Based Longitudinal Study Data: The Elephant in the Virtual Room |
title | Validity of Internet-Based Longitudinal Study Data: The Elephant in the Virtual Room |
title_full | Validity of Internet-Based Longitudinal Study Data: The Elephant in the Virtual Room |
title_fullStr | Validity of Internet-Based Longitudinal Study Data: The Elephant in the Virtual Room |
title_full_unstemmed | Validity of Internet-Based Longitudinal Study Data: The Elephant in the Virtual Room |
title_short | Validity of Internet-Based Longitudinal Study Data: The Elephant in the Virtual Room |
title_sort | validity of internet-based longitudinal study data: the elephant in the virtual room |
topic | Viewpoint |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4416133/ https://www.ncbi.nlm.nih.gov/pubmed/25887101 http://dx.doi.org/10.2196/jmir.3530 |
work_keys_str_mv | AT pughcarysa validityofinternetbasedlongitudinalstudydatatheelephantinthevirtualroom AT summerskimm validityofinternetbasedlongitudinalstudydatatheelephantinthevirtualroom AT bronsvoortbmarkc validityofinternetbasedlongitudinalstudydatatheelephantinthevirtualroom AT handeliang validityofinternetbasedlongitudinalstudydatatheelephantinthevirtualroom AT clementsdylann validityofinternetbasedlongitudinalstudydatatheelephantinthevirtualroom |