Cargando…

Seq2science: an end-to-end workflow for functional genomics analysis

Sequencing databases contain enormous amounts of functional genomics data, making them an extensive resource for genome-scale analysis. Reanalyzing publicly available data, and integrating it with new, project-specific data sets, can be invaluable. With current technologies, genomic experiments have...

Descripción completa

Detalles Bibliográficos
Autores principales: van der Sande, Maarten, Frölich, Siebren, Schäfers, Tilman, Smits, Jos G.A., Snabel, Rebecca R., Rinzema, Sybren, van Heeringen, Simon J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10656911/
https://www.ncbi.nlm.nih.gov/pubmed/38025697
http://dx.doi.org/10.7717/peerj.16380
_version_ 1785148097732542464
author van der Sande, Maarten
Frölich, Siebren
Schäfers, Tilman
Smits, Jos G.A.
Snabel, Rebecca R.
Rinzema, Sybren
van Heeringen, Simon J.
author_facet van der Sande, Maarten
Frölich, Siebren
Schäfers, Tilman
Smits, Jos G.A.
Snabel, Rebecca R.
Rinzema, Sybren
van Heeringen, Simon J.
author_sort van der Sande, Maarten
collection PubMed
description Sequencing databases contain enormous amounts of functional genomics data, making them an extensive resource for genome-scale analysis. Reanalyzing publicly available data, and integrating it with new, project-specific data sets, can be invaluable. With current technologies, genomic experiments have become feasible for virtually any species of interest. However, using and integrating this data comes with its challenges, such as standardized and reproducible analysis. Seq2science is a multi-purpose workflow that covers preprocessing, quality control, visualization, and analysis of functional genomics sequencing data. It facilitates the downloading of sequencing data from all major databases, including NCBI SRA, EBI ENA, DDBJ, GSA, and ENCODE. Furthermore, it automates the retrieval of any genome assembly available from Ensembl, NCBI, and UCSC. It has been tested on a variety of species, and includes diverse workflows such as ATAC-, RNA-, and ChIP-seq. It consists of both generic as well as advanced steps, such as differential gene expression or peak accessibility analysis and differential motif analysis. Seq2science is built on the Snakemake workflow language and thus can be run on a range of computing infrastructures. It is available at https://github.com/vanheeringen-lab/seq2science.
format Online
Article
Text
id pubmed-10656911
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-106569112023-11-15 Seq2science: an end-to-end workflow for functional genomics analysis van der Sande, Maarten Frölich, Siebren Schäfers, Tilman Smits, Jos G.A. Snabel, Rebecca R. Rinzema, Sybren van Heeringen, Simon J. PeerJ Bioinformatics Sequencing databases contain enormous amounts of functional genomics data, making them an extensive resource for genome-scale analysis. Reanalyzing publicly available data, and integrating it with new, project-specific data sets, can be invaluable. With current technologies, genomic experiments have become feasible for virtually any species of interest. However, using and integrating this data comes with its challenges, such as standardized and reproducible analysis. Seq2science is a multi-purpose workflow that covers preprocessing, quality control, visualization, and analysis of functional genomics sequencing data. It facilitates the downloading of sequencing data from all major databases, including NCBI SRA, EBI ENA, DDBJ, GSA, and ENCODE. Furthermore, it automates the retrieval of any genome assembly available from Ensembl, NCBI, and UCSC. It has been tested on a variety of species, and includes diverse workflows such as ATAC-, RNA-, and ChIP-seq. It consists of both generic as well as advanced steps, such as differential gene expression or peak accessibility analysis and differential motif analysis. Seq2science is built on the Snakemake workflow language and thus can be run on a range of computing infrastructures. It is available at https://github.com/vanheeringen-lab/seq2science. PeerJ Inc. 2023-11-15 /pmc/articles/PMC10656911/ /pubmed/38025697 http://dx.doi.org/10.7717/peerj.16380 Text en © 2023 van der Sande et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
van der Sande, Maarten
Frölich, Siebren
Schäfers, Tilman
Smits, Jos G.A.
Snabel, Rebecca R.
Rinzema, Sybren
van Heeringen, Simon J.
Seq2science: an end-to-end workflow for functional genomics analysis
title Seq2science: an end-to-end workflow for functional genomics analysis
title_full Seq2science: an end-to-end workflow for functional genomics analysis
title_fullStr Seq2science: an end-to-end workflow for functional genomics analysis
title_full_unstemmed Seq2science: an end-to-end workflow for functional genomics analysis
title_short Seq2science: an end-to-end workflow for functional genomics analysis
title_sort seq2science: an end-to-end workflow for functional genomics analysis
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10656911/
https://www.ncbi.nlm.nih.gov/pubmed/38025697
http://dx.doi.org/10.7717/peerj.16380
work_keys_str_mv AT vandersandemaarten seq2scienceanendtoendworkflowforfunctionalgenomicsanalysis
AT frolichsiebren seq2scienceanendtoendworkflowforfunctionalgenomicsanalysis
AT schaferstilman seq2scienceanendtoendworkflowforfunctionalgenomicsanalysis
AT smitsjosga seq2scienceanendtoendworkflowforfunctionalgenomicsanalysis
AT snabelrebeccar seq2scienceanendtoendworkflowforfunctionalgenomicsanalysis
AT rinzemasybren seq2scienceanendtoendworkflowforfunctionalgenomicsanalysis
AT vanheeringensimonj seq2scienceanendtoendworkflowforfunctionalgenomicsanalysis