Cargando…
2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility
OBJECTIVES/SPECIFIC AIMS: Key factors causing irreproducibility of research include those related to inappropriate study design methodologies and statistical analysis. In modern statistical practice irreproducibility could arise due to statistical (false discoveries, p-hacking, overuse/misuse of p-v...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cambridge University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798981/ http://dx.doi.org/10.1017/cts.2017.78 |
_version_ | 1783460180569096192 |
---|---|
author | Gouripeddi, Ram Cummins, Mollie Madsen, Randy LaSalle, Bernie Middleton Redd, Andrew Paige Presson, Angela Ye, Xiangyang Facelli, Julio C. Green, Tom Harper, Steve |
author_facet | Gouripeddi, Ram Cummins, Mollie Madsen, Randy LaSalle, Bernie Middleton Redd, Andrew Paige Presson, Angela Ye, Xiangyang Facelli, Julio C. Green, Tom Harper, Steve |
author_sort | Gouripeddi, Ram |
collection | PubMed |
description | OBJECTIVES/SPECIFIC AIMS: Key factors causing irreproducibility of research include those related to inappropriate study design methodologies and statistical analysis. In modern statistical practice irreproducibility could arise due to statistical (false discoveries, p-hacking, overuse/misuse of p-values, low power, poor experimental design) and computational (data, code and software management) issues. These require understanding the processes and workflows practiced by an organization, and the development and use of metrics to quantify reproducibility. METHODS/STUDY POPULATION: Within the Foundation of Discovery – Population Health Research, Center for Clinical and Translational Science, University of Utah, we are undertaking a project to streamline the study design and statistical analysis workflows and processes. As a first step we met with key stakeholders to understand the current practices by eliciting example statistical projects, and then developed process information models for different types of statistical needs using Lucidchart. We then reviewed these with the Foundation’s leadership and the Standards Committee to come up with ideal workflows and model, and defined key measurement points (such as those around study design, analysis plan, final report, requirements for quality checks, and double coding) for assessing reproducibility. As next steps we are using our finding to embed analytical and infrastructural approaches within the statisticians’ workflows. This will include data and code dissemination platforms such as Box, Bitbucket, and GitHub, documentation platforms such as Confluence, and workflow tracking platforms such as Jira. These tools will simplify and automate the capture of communications as a statistician work through a project. Data-intensive process will use process-workflow management platforms such as Activiti, Pegasus, and Taverna. RESULTS/ANTICIPATED RESULTS: These strategies for sharing and publishing study protocols, data, code, and results across the spectrum, active collaboration with the research team, automation of key steps, along with decision support. DISCUSSION/SIGNIFICANCE OF IMPACT: This analysis of statistical methods and process and computational methods to automate them ensure quality of statistical methods and reproducibility of research. |
format | Online Article Text |
id | pubmed-6798981 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Cambridge University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-67989812019-10-28 2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility Gouripeddi, Ram Cummins, Mollie Madsen, Randy LaSalle, Bernie Middleton Redd, Andrew Paige Presson, Angela Ye, Xiangyang Facelli, Julio C. Green, Tom Harper, Steve J Clin Transl Sci Biomedical Informatics/Health Informatics OBJECTIVES/SPECIFIC AIMS: Key factors causing irreproducibility of research include those related to inappropriate study design methodologies and statistical analysis. In modern statistical practice irreproducibility could arise due to statistical (false discoveries, p-hacking, overuse/misuse of p-values, low power, poor experimental design) and computational (data, code and software management) issues. These require understanding the processes and workflows practiced by an organization, and the development and use of metrics to quantify reproducibility. METHODS/STUDY POPULATION: Within the Foundation of Discovery – Population Health Research, Center for Clinical and Translational Science, University of Utah, we are undertaking a project to streamline the study design and statistical analysis workflows and processes. As a first step we met with key stakeholders to understand the current practices by eliciting example statistical projects, and then developed process information models for different types of statistical needs using Lucidchart. We then reviewed these with the Foundation’s leadership and the Standards Committee to come up with ideal workflows and model, and defined key measurement points (such as those around study design, analysis plan, final report, requirements for quality checks, and double coding) for assessing reproducibility. As next steps we are using our finding to embed analytical and infrastructural approaches within the statisticians’ workflows. This will include data and code dissemination platforms such as Box, Bitbucket, and GitHub, documentation platforms such as Confluence, and workflow tracking platforms such as Jira. These tools will simplify and automate the capture of communications as a statistician work through a project. Data-intensive process will use process-workflow management platforms such as Activiti, Pegasus, and Taverna. RESULTS/ANTICIPATED RESULTS: These strategies for sharing and publishing study protocols, data, code, and results across the spectrum, active collaboration with the research team, automation of key steps, along with decision support. DISCUSSION/SIGNIFICANCE OF IMPACT: This analysis of statistical methods and process and computational methods to automate them ensure quality of statistical methods and reproducibility of research. Cambridge University Press 2018-05-10 /pmc/articles/PMC6798981/ http://dx.doi.org/10.1017/cts.2017.78 Text en © The Association for Clinical and Translational Science 2018 http://creativecommons.org/licenses/by/4.0/ This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Biomedical Informatics/Health Informatics Gouripeddi, Ram Cummins, Mollie Madsen, Randy LaSalle, Bernie Middleton Redd, Andrew Paige Presson, Angela Ye, Xiangyang Facelli, Julio C. Green, Tom Harper, Steve 2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility |
title | 2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility |
title_full | 2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility |
title_fullStr | 2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility |
title_full_unstemmed | 2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility |
title_short | 2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility |
title_sort | 2469: streamlining study design and statistical analysis for quality improvement and research reproducibility |
topic | Biomedical Informatics/Health Informatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798981/ http://dx.doi.org/10.1017/cts.2017.78 |
work_keys_str_mv | AT gouripeddiram 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility AT cumminsmollie 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility AT madsenrandy 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility AT lasallebernie 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility AT middletonreddandrew 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility AT paigepressonangela 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility AT yexiangyang 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility AT facellijulioc 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility AT greentom 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility AT harpersteve 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility |