Cargando…

Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools

OBJECTIVES: To compare how clinical researchers generate data-driven hypotheses with a visual interactive analytic tool (VIADS, a visual interactive analysis tool for filtering and summarizing large data sets coded with hierarchical terminologies) or other tools. METHODS: We recruited clinical resea...

Descripción completa

Detalles Bibliográficos
Autores principales: Jing, Xia, Cimino, James J., Patel, Vimla L., Zhou, Yuchun, Shubrook, Jay H., De Lacalle, Sonsoles, Draghi, Brooke N., Ernst, Mytchell A., Weaver, Aneesa, Sekar, Shriram, Liu, Chang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10274969/
https://www.ncbi.nlm.nih.gov/pubmed/37333271
http://dx.doi.org/10.1101/2023.05.30.23290719
_version_ 1785059818216620032
author Jing, Xia
Cimino, James J.
Patel, Vimla L.
Zhou, Yuchun
Shubrook, Jay H.
De Lacalle, Sonsoles
Draghi, Brooke N.
Ernst, Mytchell A.
Weaver, Aneesa
Sekar, Shriram
Liu, Chang
author_facet Jing, Xia
Cimino, James J.
Patel, Vimla L.
Zhou, Yuchun
Shubrook, Jay H.
De Lacalle, Sonsoles
Draghi, Brooke N.
Ernst, Mytchell A.
Weaver, Aneesa
Sekar, Shriram
Liu, Chang
author_sort Jing, Xia
collection PubMed
description OBJECTIVES: To compare how clinical researchers generate data-driven hypotheses with a visual interactive analytic tool (VIADS, a visual interactive analysis tool for filtering and summarizing large data sets coded with hierarchical terminologies) or other tools. METHODS: We recruited clinical researchers and separated them into “experienced” and “inexperienced” groups. Participants were randomly assigned to a VIADS or control group within the groups. Each participant conducted a remote 2-hour study session for hypothesis generation with the same study facilitator on the same datasets by following a think-aloud protocol. Screen activities and audio were recorded, transcribed, coded, and analyzed. Hypotheses were evaluated by seven experts on their validity, significance, and feasibility. We conducted multilevel random effect modeling for statistical tests. RESULTS: Eighteen participants generated 227 hypotheses, of which 147 (65%) were valid. The VIADS and control groups generated a similar number of hypotheses. The VIADS group took a significantly shorter time to generate one hypothesis (e.g., among inexperienced clinical researchers, 258 seconds versus 379 seconds, p = 0.046, power = 0.437, ICC = 0.15). The VIADS group received significantly lower ratings than the control group on feasibility and the combination rating of validity, significance, and feasibility. CONCLUSION: The role of VIADS in hypothesis generation seems inconclusive. The VIADS group took a significantly shorter time to generate each hypothesis. However, the combined validity, significance, and feasibility ratings of their hypotheses were significantly lower. Further characterization of hypotheses, including specifics on how they might be improved, could guide future tool development.
format Online
Article
Text
id pubmed-10274969
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-102749692023-06-17 Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools Jing, Xia Cimino, James J. Patel, Vimla L. Zhou, Yuchun Shubrook, Jay H. De Lacalle, Sonsoles Draghi, Brooke N. Ernst, Mytchell A. Weaver, Aneesa Sekar, Shriram Liu, Chang medRxiv Article OBJECTIVES: To compare how clinical researchers generate data-driven hypotheses with a visual interactive analytic tool (VIADS, a visual interactive analysis tool for filtering and summarizing large data sets coded with hierarchical terminologies) or other tools. METHODS: We recruited clinical researchers and separated them into “experienced” and “inexperienced” groups. Participants were randomly assigned to a VIADS or control group within the groups. Each participant conducted a remote 2-hour study session for hypothesis generation with the same study facilitator on the same datasets by following a think-aloud protocol. Screen activities and audio were recorded, transcribed, coded, and analyzed. Hypotheses were evaluated by seven experts on their validity, significance, and feasibility. We conducted multilevel random effect modeling for statistical tests. RESULTS: Eighteen participants generated 227 hypotheses, of which 147 (65%) were valid. The VIADS and control groups generated a similar number of hypotheses. The VIADS group took a significantly shorter time to generate one hypothesis (e.g., among inexperienced clinical researchers, 258 seconds versus 379 seconds, p = 0.046, power = 0.437, ICC = 0.15). The VIADS group received significantly lower ratings than the control group on feasibility and the combination rating of validity, significance, and feasibility. CONCLUSION: The role of VIADS in hypothesis generation seems inconclusive. The VIADS group took a significantly shorter time to generate each hypothesis. However, the combined validity, significance, and feasibility ratings of their hypotheses were significantly lower. Further characterization of hypotheses, including specifics on how they might be improved, could guide future tool development. Cold Spring Harbor Laboratory 2023-10-31 /pmc/articles/PMC10274969/ /pubmed/37333271 http://dx.doi.org/10.1101/2023.05.30.23290719 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Jing, Xia
Cimino, James J.
Patel, Vimla L.
Zhou, Yuchun
Shubrook, Jay H.
De Lacalle, Sonsoles
Draghi, Brooke N.
Ernst, Mytchell A.
Weaver, Aneesa
Sekar, Shriram
Liu, Chang
Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools
title Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools
title_full Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools
title_fullStr Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools
title_full_unstemmed Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools
title_short Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools
title_sort data-driven hypothesis generation among inexperienced clinical researchers: a comparison of secondary data analyses with visualization (viads) and other tools
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10274969/
https://www.ncbi.nlm.nih.gov/pubmed/37333271
http://dx.doi.org/10.1101/2023.05.30.23290719
work_keys_str_mv AT jingxia datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools
AT ciminojamesj datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools
AT patelvimlal datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools
AT zhouyuchun datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools
AT shubrookjayh datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools
AT delacallesonsoles datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools
AT draghibrooken datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools
AT ernstmytchella datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools
AT weaveraneesa datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools
AT sekarshriram datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools
AT liuchang datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools