Cargando…

Spacemake: processing and analysis of large-scale spatial transcriptomics data

BACKGROUND: Spatial sequencing methods increasingly gain popularity within RNA biology studies. State-of-the-art techniques quantify messenger RNA expression levels from tissue sections and at the same time register information about the original locations of the molecules in the tissue. The resulti...

Descripción completa

Detalles Bibliográficos
Autores principales: Sztanka-Toth, Tamas Ryszard, Jens, Marvin, Karaiskos, Nikos, Rajewsky, Nikolaus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9295369/
https://www.ncbi.nlm.nih.gov/pubmed/35852420
http://dx.doi.org/10.1093/gigascience/giac064
_version_ 1784750046027186176
author Sztanka-Toth, Tamas Ryszard
Jens, Marvin
Karaiskos, Nikos
Rajewsky, Nikolaus
author_facet Sztanka-Toth, Tamas Ryszard
Jens, Marvin
Karaiskos, Nikos
Rajewsky, Nikolaus
author_sort Sztanka-Toth, Tamas Ryszard
collection PubMed
description BACKGROUND: Spatial sequencing methods increasingly gain popularity within RNA biology studies. State-of-the-art techniques quantify messenger RNA expression levels from tissue sections and at the same time register information about the original locations of the molecules in the tissue. The resulting data sets are processed and analyzed by accompanying software that, however, is incompatible across inputs from different technologies. FINDINGS: Here, we present spacemake, a modular, robust, and scalable spatial transcriptomics pipeline built in Snakemake and Python. Spacemake is designed to handle all major spatial transcriptomics data sets and can be readily configured for other technologies. It can process and analyze several samples in parallel, even if they stem from different experimental methods. Spacemake's unified framework enables reproducible data processing from raw sequencing data to automatically generated downstream analysis reports. Spacemake is built with a modular design and offers additional functionality such as sample merging, saturation analysis, and analysis of long reads as separate modules. Moreover, spacemake employs novoSpaRc to integrate spatial and single-cell transcriptomics data, resulting in increased gene counts for the spatial data set. Spacemake is open source and extendable, and it can be seamlessly integrated with existing computational workflows.
format Online
Article
Text
id pubmed-9295369
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-92953692022-07-20 Spacemake: processing and analysis of large-scale spatial transcriptomics data Sztanka-Toth, Tamas Ryszard Jens, Marvin Karaiskos, Nikos Rajewsky, Nikolaus Gigascience Technical Note BACKGROUND: Spatial sequencing methods increasingly gain popularity within RNA biology studies. State-of-the-art techniques quantify messenger RNA expression levels from tissue sections and at the same time register information about the original locations of the molecules in the tissue. The resulting data sets are processed and analyzed by accompanying software that, however, is incompatible across inputs from different technologies. FINDINGS: Here, we present spacemake, a modular, robust, and scalable spatial transcriptomics pipeline built in Snakemake and Python. Spacemake is designed to handle all major spatial transcriptomics data sets and can be readily configured for other technologies. It can process and analyze several samples in parallel, even if they stem from different experimental methods. Spacemake's unified framework enables reproducible data processing from raw sequencing data to automatically generated downstream analysis reports. Spacemake is built with a modular design and offers additional functionality such as sample merging, saturation analysis, and analysis of long reads as separate modules. Moreover, spacemake employs novoSpaRc to integrate spatial and single-cell transcriptomics data, resulting in increased gene counts for the spatial data set. Spacemake is open source and extendable, and it can be seamlessly integrated with existing computational workflows. Oxford University Press 2022-07-19 /pmc/articles/PMC9295369/ /pubmed/35852420 http://dx.doi.org/10.1093/gigascience/giac064 Text en © The Author(s) 2022. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Sztanka-Toth, Tamas Ryszard
Jens, Marvin
Karaiskos, Nikos
Rajewsky, Nikolaus
Spacemake: processing and analysis of large-scale spatial transcriptomics data
title Spacemake: processing and analysis of large-scale spatial transcriptomics data
title_full Spacemake: processing and analysis of large-scale spatial transcriptomics data
title_fullStr Spacemake: processing and analysis of large-scale spatial transcriptomics data
title_full_unstemmed Spacemake: processing and analysis of large-scale spatial transcriptomics data
title_short Spacemake: processing and analysis of large-scale spatial transcriptomics data
title_sort spacemake: processing and analysis of large-scale spatial transcriptomics data
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9295369/
https://www.ncbi.nlm.nih.gov/pubmed/35852420
http://dx.doi.org/10.1093/gigascience/giac064
work_keys_str_mv AT sztankatothtamasryszard spacemakeprocessingandanalysisoflargescalespatialtranscriptomicsdata
AT jensmarvin spacemakeprocessingandanalysisoflargescalespatialtranscriptomicsdata
AT karaiskosnikos spacemakeprocessingandanalysisoflargescalespatialtranscriptomicsdata
AT rajewskynikolaus spacemakeprocessingandanalysisoflargescalespatialtranscriptomicsdata