Cargando…

2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility

OBJECTIVES/SPECIFIC AIMS: Key factors causing irreproducibility of research include those related to inappropriate study design methodologies and statistical analysis. In modern statistical practice irreproducibility could arise due to statistical (false discoveries, p-hacking, overuse/misuse of p-v...

Descripción completa

Detalles Bibliográficos
Autores principales: Gouripeddi, Ram, Cummins, Mollie, Madsen, Randy, LaSalle, Bernie, Middleton Redd, Andrew, Paige Presson, Angela, Ye, Xiangyang, Facelli, Julio C., Green, Tom, Harper, Steve
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cambridge University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798981/
http://dx.doi.org/10.1017/cts.2017.78
_version_ 1783460180569096192
author Gouripeddi, Ram
Cummins, Mollie
Madsen, Randy
LaSalle, Bernie
Middleton Redd, Andrew
Paige Presson, Angela
Ye, Xiangyang
Facelli, Julio C.
Green, Tom
Harper, Steve
author_facet Gouripeddi, Ram
Cummins, Mollie
Madsen, Randy
LaSalle, Bernie
Middleton Redd, Andrew
Paige Presson, Angela
Ye, Xiangyang
Facelli, Julio C.
Green, Tom
Harper, Steve
author_sort Gouripeddi, Ram
collection PubMed
description OBJECTIVES/SPECIFIC AIMS: Key factors causing irreproducibility of research include those related to inappropriate study design methodologies and statistical analysis. In modern statistical practice irreproducibility could arise due to statistical (false discoveries, p-hacking, overuse/misuse of p-values, low power, poor experimental design) and computational (data, code and software management) issues. These require understanding the processes and workflows practiced by an organization, and the development and use of metrics to quantify reproducibility. METHODS/STUDY POPULATION: Within the Foundation of Discovery – Population Health Research, Center for Clinical and Translational Science, University of Utah, we are undertaking a project to streamline the study design and statistical analysis workflows and processes. As a first step we met with key stakeholders to understand the current practices by eliciting example statistical projects, and then developed process information models for different types of statistical needs using Lucidchart. We then reviewed these with the Foundation’s leadership and the Standards Committee to come up with ideal workflows and model, and defined key measurement points (such as those around study design, analysis plan, final report, requirements for quality checks, and double coding) for assessing reproducibility. As next steps we are using our finding to embed analytical and infrastructural approaches within the statisticians’ workflows. This will include data and code dissemination platforms such as Box, Bitbucket, and GitHub, documentation platforms such as Confluence, and workflow tracking platforms such as Jira. These tools will simplify and automate the capture of communications as a statistician work through a project. Data-intensive process will use process-workflow management platforms such as Activiti, Pegasus, and Taverna. RESULTS/ANTICIPATED RESULTS: These strategies for sharing and publishing study protocols, data, code, and results across the spectrum, active collaboration with the research team, automation of key steps, along with decision support. DISCUSSION/SIGNIFICANCE OF IMPACT: This analysis of statistical methods and process and computational methods to automate them ensure quality of statistical methods and reproducibility of research.
format Online
Article
Text
id pubmed-6798981
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Cambridge University Press
record_format MEDLINE/PubMed
spelling pubmed-67989812019-10-28 2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility Gouripeddi, Ram Cummins, Mollie Madsen, Randy LaSalle, Bernie Middleton Redd, Andrew Paige Presson, Angela Ye, Xiangyang Facelli, Julio C. Green, Tom Harper, Steve J Clin Transl Sci Biomedical Informatics/Health Informatics OBJECTIVES/SPECIFIC AIMS: Key factors causing irreproducibility of research include those related to inappropriate study design methodologies and statistical analysis. In modern statistical practice irreproducibility could arise due to statistical (false discoveries, p-hacking, overuse/misuse of p-values, low power, poor experimental design) and computational (data, code and software management) issues. These require understanding the processes and workflows practiced by an organization, and the development and use of metrics to quantify reproducibility. METHODS/STUDY POPULATION: Within the Foundation of Discovery – Population Health Research, Center for Clinical and Translational Science, University of Utah, we are undertaking a project to streamline the study design and statistical analysis workflows and processes. As a first step we met with key stakeholders to understand the current practices by eliciting example statistical projects, and then developed process information models for different types of statistical needs using Lucidchart. We then reviewed these with the Foundation’s leadership and the Standards Committee to come up with ideal workflows and model, and defined key measurement points (such as those around study design, analysis plan, final report, requirements for quality checks, and double coding) for assessing reproducibility. As next steps we are using our finding to embed analytical and infrastructural approaches within the statisticians’ workflows. This will include data and code dissemination platforms such as Box, Bitbucket, and GitHub, documentation platforms such as Confluence, and workflow tracking platforms such as Jira. These tools will simplify and automate the capture of communications as a statistician work through a project. Data-intensive process will use process-workflow management platforms such as Activiti, Pegasus, and Taverna. RESULTS/ANTICIPATED RESULTS: These strategies for sharing and publishing study protocols, data, code, and results across the spectrum, active collaboration with the research team, automation of key steps, along with decision support. DISCUSSION/SIGNIFICANCE OF IMPACT: This analysis of statistical methods and process and computational methods to automate them ensure quality of statistical methods and reproducibility of research. Cambridge University Press 2018-05-10 /pmc/articles/PMC6798981/ http://dx.doi.org/10.1017/cts.2017.78 Text en © The Association for Clinical and Translational Science 2018 http://creativecommons.org/licenses/by/4.0/ This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Biomedical Informatics/Health Informatics
Gouripeddi, Ram
Cummins, Mollie
Madsen, Randy
LaSalle, Bernie
Middleton Redd, Andrew
Paige Presson, Angela
Ye, Xiangyang
Facelli, Julio C.
Green, Tom
Harper, Steve
2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility
title 2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility
title_full 2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility
title_fullStr 2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility
title_full_unstemmed 2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility
title_short 2469: Streamlining study design and statistical analysis for quality improvement and research reproducibility
title_sort 2469: streamlining study design and statistical analysis for quality improvement and research reproducibility
topic Biomedical Informatics/Health Informatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798981/
http://dx.doi.org/10.1017/cts.2017.78
work_keys_str_mv AT gouripeddiram 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility
AT cumminsmollie 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility
AT madsenrandy 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility
AT lasallebernie 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility
AT middletonreddandrew 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility
AT paigepressonangela 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility
AT yexiangyang 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility
AT facellijulioc 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility
AT greentom 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility
AT harpersteve 2469streamliningstudydesignandstatisticalanalysisforqualityimprovementandresearchreproducibility