Cargando…
Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools
OBJECTIVES: To compare how clinical researchers generate data-driven hypotheses with a visual interactive analytic tool (VIADS, a visual interactive analysis tool for filtering and summarizing large data sets coded with hierarchical terminologies) or other tools. METHODS: We recruited clinical resea...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10274969/ https://www.ncbi.nlm.nih.gov/pubmed/37333271 http://dx.doi.org/10.1101/2023.05.30.23290719 |
_version_ | 1785059818216620032 |
---|---|
author | Jing, Xia Cimino, James J. Patel, Vimla L. Zhou, Yuchun Shubrook, Jay H. De Lacalle, Sonsoles Draghi, Brooke N. Ernst, Mytchell A. Weaver, Aneesa Sekar, Shriram Liu, Chang |
author_facet | Jing, Xia Cimino, James J. Patel, Vimla L. Zhou, Yuchun Shubrook, Jay H. De Lacalle, Sonsoles Draghi, Brooke N. Ernst, Mytchell A. Weaver, Aneesa Sekar, Shriram Liu, Chang |
author_sort | Jing, Xia |
collection | PubMed |
description | OBJECTIVES: To compare how clinical researchers generate data-driven hypotheses with a visual interactive analytic tool (VIADS, a visual interactive analysis tool for filtering and summarizing large data sets coded with hierarchical terminologies) or other tools. METHODS: We recruited clinical researchers and separated them into “experienced” and “inexperienced” groups. Participants were randomly assigned to a VIADS or control group within the groups. Each participant conducted a remote 2-hour study session for hypothesis generation with the same study facilitator on the same datasets by following a think-aloud protocol. Screen activities and audio were recorded, transcribed, coded, and analyzed. Hypotheses were evaluated by seven experts on their validity, significance, and feasibility. We conducted multilevel random effect modeling for statistical tests. RESULTS: Eighteen participants generated 227 hypotheses, of which 147 (65%) were valid. The VIADS and control groups generated a similar number of hypotheses. The VIADS group took a significantly shorter time to generate one hypothesis (e.g., among inexperienced clinical researchers, 258 seconds versus 379 seconds, p = 0.046, power = 0.437, ICC = 0.15). The VIADS group received significantly lower ratings than the control group on feasibility and the combination rating of validity, significance, and feasibility. CONCLUSION: The role of VIADS in hypothesis generation seems inconclusive. The VIADS group took a significantly shorter time to generate each hypothesis. However, the combined validity, significance, and feasibility ratings of their hypotheses were significantly lower. Further characterization of hypotheses, including specifics on how they might be improved, could guide future tool development. |
format | Online Article Text |
id | pubmed-10274969 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-102749692023-06-17 Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools Jing, Xia Cimino, James J. Patel, Vimla L. Zhou, Yuchun Shubrook, Jay H. De Lacalle, Sonsoles Draghi, Brooke N. Ernst, Mytchell A. Weaver, Aneesa Sekar, Shriram Liu, Chang medRxiv Article OBJECTIVES: To compare how clinical researchers generate data-driven hypotheses with a visual interactive analytic tool (VIADS, a visual interactive analysis tool for filtering and summarizing large data sets coded with hierarchical terminologies) or other tools. METHODS: We recruited clinical researchers and separated them into “experienced” and “inexperienced” groups. Participants were randomly assigned to a VIADS or control group within the groups. Each participant conducted a remote 2-hour study session for hypothesis generation with the same study facilitator on the same datasets by following a think-aloud protocol. Screen activities and audio were recorded, transcribed, coded, and analyzed. Hypotheses were evaluated by seven experts on their validity, significance, and feasibility. We conducted multilevel random effect modeling for statistical tests. RESULTS: Eighteen participants generated 227 hypotheses, of which 147 (65%) were valid. The VIADS and control groups generated a similar number of hypotheses. The VIADS group took a significantly shorter time to generate one hypothesis (e.g., among inexperienced clinical researchers, 258 seconds versus 379 seconds, p = 0.046, power = 0.437, ICC = 0.15). The VIADS group received significantly lower ratings than the control group on feasibility and the combination rating of validity, significance, and feasibility. CONCLUSION: The role of VIADS in hypothesis generation seems inconclusive. The VIADS group took a significantly shorter time to generate each hypothesis. However, the combined validity, significance, and feasibility ratings of their hypotheses were significantly lower. Further characterization of hypotheses, including specifics on how they might be improved, could guide future tool development. Cold Spring Harbor Laboratory 2023-10-31 /pmc/articles/PMC10274969/ /pubmed/37333271 http://dx.doi.org/10.1101/2023.05.30.23290719 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Jing, Xia Cimino, James J. Patel, Vimla L. Zhou, Yuchun Shubrook, Jay H. De Lacalle, Sonsoles Draghi, Brooke N. Ernst, Mytchell A. Weaver, Aneesa Sekar, Shriram Liu, Chang Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools |
title | Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools |
title_full | Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools |
title_fullStr | Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools |
title_full_unstemmed | Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools |
title_short | Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools |
title_sort | data-driven hypothesis generation among inexperienced clinical researchers: a comparison of secondary data analyses with visualization (viads) and other tools |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10274969/ https://www.ncbi.nlm.nih.gov/pubmed/37333271 http://dx.doi.org/10.1101/2023.05.30.23290719 |
work_keys_str_mv | AT jingxia datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools AT ciminojamesj datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools AT patelvimlal datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools AT zhouyuchun datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools AT shubrookjayh datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools AT delacallesonsoles datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools AT draghibrooken datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools AT ernstmytchella datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools AT weaveraneesa datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools AT sekarshriram datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools AT liuchang datadrivenhypothesisgenerationamonginexperiencedclinicalresearchersacomparisonofsecondarydataanalyseswithvisualizationviadsandothertools |