Cargando…

Analytical code sharing practices in biomedical research

Data-driven computational analysis is becoming increasingly important in biomedical research, as the amount of data being generated continues to grow. However, the lack of practices of sharing research outputs, such as data, source code and methods, affects transparency and reproducibility of studie...

Descripción completa

Detalles Bibliográficos
Autores principales: Sharma, Nitesh Kumar, Ayyala, Ram, Deshpande, Dhrithi, Patel, Yesha M, Munteanu, Viorel, Ciorba, Dumitru, Fiscutean, Andrada, Vahed, Mohammad, Sarkar, Aditya, Guo, Ruiwei, Moore, Andrew, Darci-Maher, Nicholas, Nogoy, Nicole A, Abedalthagafi, Malak S., Mangul, Serghei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10441317/
https://www.ncbi.nlm.nih.gov/pubmed/37609176
http://dx.doi.org/10.1101/2023.07.31.551384
_version_ 1785093352359723008
author Sharma, Nitesh Kumar
Ayyala, Ram
Deshpande, Dhrithi
Patel, Yesha M
Munteanu, Viorel
Ciorba, Dumitru
Fiscutean, Andrada
Vahed, Mohammad
Sarkar, Aditya
Guo, Ruiwei
Moore, Andrew
Darci-Maher, Nicholas
Nogoy, Nicole A
Abedalthagafi, Malak S.
Mangul, Serghei
author_facet Sharma, Nitesh Kumar
Ayyala, Ram
Deshpande, Dhrithi
Patel, Yesha M
Munteanu, Viorel
Ciorba, Dumitru
Fiscutean, Andrada
Vahed, Mohammad
Sarkar, Aditya
Guo, Ruiwei
Moore, Andrew
Darci-Maher, Nicholas
Nogoy, Nicole A
Abedalthagafi, Malak S.
Mangul, Serghei
author_sort Sharma, Nitesh Kumar
collection PubMed
description Data-driven computational analysis is becoming increasingly important in biomedical research, as the amount of data being generated continues to grow. However, the lack of practices of sharing research outputs, such as data, source code and methods, affects transparency and reproducibility of studies, which are critical to the advancement of science. Many published studies are not reproducible due to insufficient documentation, code, and data being shared. We conducted a comprehensive analysis of 453 manuscripts published between 2016–2021 and found that 50.1% of them fail to share the analytical code. Even among those that did disclose their code, a vast majority failed to offer additional research outputs, such as data. Furthermore, only one in ten papers organized their code in a structured and reproducible manner. We discovered a significant association between the presence of code availability statements and increased code availability (p=2.71×10(−9)). Additionally, a greater proportion of studies conducting secondary analyses were inclined to share their code compared to those conducting primary analyses (p=1.15*10(−07)). In light of our findings, we propose raising awareness of code sharing practices and taking immediate steps to enhance code availability to improve reproducibility in biomedical research. By increasing transparency and reproducibility, we can promote scientific rigor, encourage collaboration, and accelerate scientific discoveries. We must prioritize open science practices, including sharing code, data, and other research products, to ensure that biomedical research can be replicated and built upon by others in the scientific community.
format Online
Article
Text
id pubmed-10441317
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-104413172023-08-22 Analytical code sharing practices in biomedical research Sharma, Nitesh Kumar Ayyala, Ram Deshpande, Dhrithi Patel, Yesha M Munteanu, Viorel Ciorba, Dumitru Fiscutean, Andrada Vahed, Mohammad Sarkar, Aditya Guo, Ruiwei Moore, Andrew Darci-Maher, Nicholas Nogoy, Nicole A Abedalthagafi, Malak S. Mangul, Serghei bioRxiv Article Data-driven computational analysis is becoming increasingly important in biomedical research, as the amount of data being generated continues to grow. However, the lack of practices of sharing research outputs, such as data, source code and methods, affects transparency and reproducibility of studies, which are critical to the advancement of science. Many published studies are not reproducible due to insufficient documentation, code, and data being shared. We conducted a comprehensive analysis of 453 manuscripts published between 2016–2021 and found that 50.1% of them fail to share the analytical code. Even among those that did disclose their code, a vast majority failed to offer additional research outputs, such as data. Furthermore, only one in ten papers organized their code in a structured and reproducible manner. We discovered a significant association between the presence of code availability statements and increased code availability (p=2.71×10(−9)). Additionally, a greater proportion of studies conducting secondary analyses were inclined to share their code compared to those conducting primary analyses (p=1.15*10(−07)). In light of our findings, we propose raising awareness of code sharing practices and taking immediate steps to enhance code availability to improve reproducibility in biomedical research. By increasing transparency and reproducibility, we can promote scientific rigor, encourage collaboration, and accelerate scientific discoveries. We must prioritize open science practices, including sharing code, data, and other research products, to ensure that biomedical research can be replicated and built upon by others in the scientific community. Cold Spring Harbor Laboratory 2023-08-07 /pmc/articles/PMC10441317/ /pubmed/37609176 http://dx.doi.org/10.1101/2023.07.31.551384 Text en https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (https://creativecommons.org/licenses/by-nc/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Sharma, Nitesh Kumar
Ayyala, Ram
Deshpande, Dhrithi
Patel, Yesha M
Munteanu, Viorel
Ciorba, Dumitru
Fiscutean, Andrada
Vahed, Mohammad
Sarkar, Aditya
Guo, Ruiwei
Moore, Andrew
Darci-Maher, Nicholas
Nogoy, Nicole A
Abedalthagafi, Malak S.
Mangul, Serghei
Analytical code sharing practices in biomedical research
title Analytical code sharing practices in biomedical research
title_full Analytical code sharing practices in biomedical research
title_fullStr Analytical code sharing practices in biomedical research
title_full_unstemmed Analytical code sharing practices in biomedical research
title_short Analytical code sharing practices in biomedical research
title_sort analytical code sharing practices in biomedical research
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10441317/
https://www.ncbi.nlm.nih.gov/pubmed/37609176
http://dx.doi.org/10.1101/2023.07.31.551384
work_keys_str_mv AT sharmaniteshkumar analyticalcodesharingpracticesinbiomedicalresearch
AT ayyalaram analyticalcodesharingpracticesinbiomedicalresearch
AT deshpandedhrithi analyticalcodesharingpracticesinbiomedicalresearch
AT patelyesham analyticalcodesharingpracticesinbiomedicalresearch
AT munteanuviorel analyticalcodesharingpracticesinbiomedicalresearch
AT ciorbadumitru analyticalcodesharingpracticesinbiomedicalresearch
AT fiscuteanandrada analyticalcodesharingpracticesinbiomedicalresearch
AT vahedmohammad analyticalcodesharingpracticesinbiomedicalresearch
AT sarkaraditya analyticalcodesharingpracticesinbiomedicalresearch
AT guoruiwei analyticalcodesharingpracticesinbiomedicalresearch
AT mooreandrew analyticalcodesharingpracticesinbiomedicalresearch
AT darcimahernicholas analyticalcodesharingpracticesinbiomedicalresearch
AT nogoynicolea analyticalcodesharingpracticesinbiomedicalresearch
AT abedalthagafimalaks analyticalcodesharingpracticesinbiomedicalresearch
AT mangulserghei analyticalcodesharingpracticesinbiomedicalresearch