Cargando…

EGAsubmitter: A software to automate submission of nucleic acid sequencing data to the European Genome-phenome Archive

Making raw data available to the research community is one of the pillars of Findability, Accessibility, Interoperability, and Reuse (FAIR) research. However, the submission of raw data to public databases still involves many manually operated procedures that are intrinsically time-consuming and err...

Descripción completa

Detalles Bibliográficos
Autores principales: Viviani, Marco, Montemurro, Marilisa, Trusolino, Livio, Bertotti, Andrea, Urgese, Gianvito, Grassi, Elena
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10098081/
https://www.ncbi.nlm.nih.gov/pubmed/37063647
http://dx.doi.org/10.3389/fbinf.2023.1143014
_version_ 1785024720989585408
author Viviani, Marco
Montemurro, Marilisa
Trusolino, Livio
Bertotti, Andrea
Urgese, Gianvito
Grassi, Elena
author_facet Viviani, Marco
Montemurro, Marilisa
Trusolino, Livio
Bertotti, Andrea
Urgese, Gianvito
Grassi, Elena
author_sort Viviani, Marco
collection PubMed
description Making raw data available to the research community is one of the pillars of Findability, Accessibility, Interoperability, and Reuse (FAIR) research. However, the submission of raw data to public databases still involves many manually operated procedures that are intrinsically time-consuming and error-prone, which raises potential reliability issues for both the data themselves and the ensuing metadata. For example, submitting sequencing data to the European Genome-phenome Archive (EGA) is estimated to take 1 month overall, and mainly relies on a web interface for metadata management that requires manual completion of forms and the upload of several comma separated values (CSV) files, which are not structured from a formal point of view. To tackle these limitations, here we present EGAsubmitter, a Snakemake-based pipeline that guides the user across all the submission steps, ranging from files encryption and upload, to metadata submission. EGASubmitter is expected to streamline the automated submission of sequencing data to EGA, minimizing user errors and ensuring higher end product fidelity.
format Online
Article
Text
id pubmed-10098081
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-100980812023-04-14 EGAsubmitter: A software to automate submission of nucleic acid sequencing data to the European Genome-phenome Archive Viviani, Marco Montemurro, Marilisa Trusolino, Livio Bertotti, Andrea Urgese, Gianvito Grassi, Elena Front Bioinform Bioinformatics Making raw data available to the research community is one of the pillars of Findability, Accessibility, Interoperability, and Reuse (FAIR) research. However, the submission of raw data to public databases still involves many manually operated procedures that are intrinsically time-consuming and error-prone, which raises potential reliability issues for both the data themselves and the ensuing metadata. For example, submitting sequencing data to the European Genome-phenome Archive (EGA) is estimated to take 1 month overall, and mainly relies on a web interface for metadata management that requires manual completion of forms and the upload of several comma separated values (CSV) files, which are not structured from a formal point of view. To tackle these limitations, here we present EGAsubmitter, a Snakemake-based pipeline that guides the user across all the submission steps, ranging from files encryption and upload, to metadata submission. EGASubmitter is expected to streamline the automated submission of sequencing data to EGA, minimizing user errors and ensuring higher end product fidelity. Frontiers Media S.A. 2023-03-30 /pmc/articles/PMC10098081/ /pubmed/37063647 http://dx.doi.org/10.3389/fbinf.2023.1143014 Text en Copyright © 2023 Viviani, Montemurro, Trusolino, Bertotti, Urgese and Grassi. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioinformatics
Viviani, Marco
Montemurro, Marilisa
Trusolino, Livio
Bertotti, Andrea
Urgese, Gianvito
Grassi, Elena
EGAsubmitter: A software to automate submission of nucleic acid sequencing data to the European Genome-phenome Archive
title EGAsubmitter: A software to automate submission of nucleic acid sequencing data to the European Genome-phenome Archive
title_full EGAsubmitter: A software to automate submission of nucleic acid sequencing data to the European Genome-phenome Archive
title_fullStr EGAsubmitter: A software to automate submission of nucleic acid sequencing data to the European Genome-phenome Archive
title_full_unstemmed EGAsubmitter: A software to automate submission of nucleic acid sequencing data to the European Genome-phenome Archive
title_short EGAsubmitter: A software to automate submission of nucleic acid sequencing data to the European Genome-phenome Archive
title_sort egasubmitter: a software to automate submission of nucleic acid sequencing data to the european genome-phenome archive
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10098081/
https://www.ncbi.nlm.nih.gov/pubmed/37063647
http://dx.doi.org/10.3389/fbinf.2023.1143014
work_keys_str_mv AT vivianimarco egasubmitterasoftwaretoautomatesubmissionofnucleicacidsequencingdatatotheeuropeangenomephenomearchive
AT montemurromarilisa egasubmitterasoftwaretoautomatesubmissionofnucleicacidsequencingdatatotheeuropeangenomephenomearchive
AT trusolinolivio egasubmitterasoftwaretoautomatesubmissionofnucleicacidsequencingdatatotheeuropeangenomephenomearchive
AT bertottiandrea egasubmitterasoftwaretoautomatesubmissionofnucleicacidsequencingdatatotheeuropeangenomephenomearchive
AT urgesegianvito egasubmitterasoftwaretoautomatesubmissionofnucleicacidsequencingdatatotheeuropeangenomephenomearchive
AT grassielena egasubmitterasoftwaretoautomatesubmissionofnucleicacidsequencingdatatotheeuropeangenomephenomearchive