Cargando…

scGate: marker-based purification of cell types from heterogeneous single-cell RNA-seq datasets

SUMMARY: A common bioinformatics task in single-cell data analysis is to purify a cell type or cell population of interest from heterogeneous datasets. Here, we present scGate, an algorithm that automatizes marker-based purification of specific cell populations, without requiring training data or re...

Descripción completa

Detalles Bibliográficos
Autores principales: Andreatta, Massimo, Berenstein, Ariel J, Carmona, Santiago J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9048671/
https://www.ncbi.nlm.nih.gov/pubmed/35258562
http://dx.doi.org/10.1093/bioinformatics/btac141
_version_ 1784695981529366528
author Andreatta, Massimo
Berenstein, Ariel J
Carmona, Santiago J
author_facet Andreatta, Massimo
Berenstein, Ariel J
Carmona, Santiago J
author_sort Andreatta, Massimo
collection PubMed
description SUMMARY: A common bioinformatics task in single-cell data analysis is to purify a cell type or cell population of interest from heterogeneous datasets. Here, we present scGate, an algorithm that automatizes marker-based purification of specific cell populations, without requiring training data or reference gene expression profiles. scGate purifies a cell population of interest using a set of markers organized in a hierarchical structure, akin to gating strategies employed in flow cytometry. scGate outperforms state-of-the-art single-cell classifiers and it can be applied to multiple modalities of single-cell data (e.g. RNA-seq, ATAC-seq, CITE-seq). scGate is implemented as an R package and integrated with the Seurat framework, providing an intuitive tool to isolate cell populations of interest from heterogeneous single-cell datasets. AVAILABILITY AND IMPLEMENTATION: scGate is available as an R package at https://github.com/carmonalab/scGate (https://doi.org/10.5281/zenodo.6202614). Several reproducible workflows describing the main functions and usage of the package on different single-cell modalities, as well as the code to reproduce the benchmark, can be found at https://github.com/carmonalab/scGate.demo (https://doi.org/10.5281/zenodo.6202585) and https://github.com/carmonalab/scGate.benchmark. Test data are available at https://doi.org/10.6084/m9.figshare.16826071. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9048671
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-90486712022-04-29 scGate: marker-based purification of cell types from heterogeneous single-cell RNA-seq datasets Andreatta, Massimo Berenstein, Ariel J Carmona, Santiago J Bioinformatics Applications Notes SUMMARY: A common bioinformatics task in single-cell data analysis is to purify a cell type or cell population of interest from heterogeneous datasets. Here, we present scGate, an algorithm that automatizes marker-based purification of specific cell populations, without requiring training data or reference gene expression profiles. scGate purifies a cell population of interest using a set of markers organized in a hierarchical structure, akin to gating strategies employed in flow cytometry. scGate outperforms state-of-the-art single-cell classifiers and it can be applied to multiple modalities of single-cell data (e.g. RNA-seq, ATAC-seq, CITE-seq). scGate is implemented as an R package and integrated with the Seurat framework, providing an intuitive tool to isolate cell populations of interest from heterogeneous single-cell datasets. AVAILABILITY AND IMPLEMENTATION: scGate is available as an R package at https://github.com/carmonalab/scGate (https://doi.org/10.5281/zenodo.6202614). Several reproducible workflows describing the main functions and usage of the package on different single-cell modalities, as well as the code to reproduce the benchmark, can be found at https://github.com/carmonalab/scGate.demo (https://doi.org/10.5281/zenodo.6202585) and https://github.com/carmonalab/scGate.benchmark. Test data are available at https://doi.org/10.6084/m9.figshare.16826071. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-03-08 /pmc/articles/PMC9048671/ /pubmed/35258562 http://dx.doi.org/10.1093/bioinformatics/btac141 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Andreatta, Massimo
Berenstein, Ariel J
Carmona, Santiago J
scGate: marker-based purification of cell types from heterogeneous single-cell RNA-seq datasets
title scGate: marker-based purification of cell types from heterogeneous single-cell RNA-seq datasets
title_full scGate: marker-based purification of cell types from heterogeneous single-cell RNA-seq datasets
title_fullStr scGate: marker-based purification of cell types from heterogeneous single-cell RNA-seq datasets
title_full_unstemmed scGate: marker-based purification of cell types from heterogeneous single-cell RNA-seq datasets
title_short scGate: marker-based purification of cell types from heterogeneous single-cell RNA-seq datasets
title_sort scgate: marker-based purification of cell types from heterogeneous single-cell rna-seq datasets
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9048671/
https://www.ncbi.nlm.nih.gov/pubmed/35258562
http://dx.doi.org/10.1093/bioinformatics/btac141
work_keys_str_mv AT andreattamassimo scgatemarkerbasedpurificationofcelltypesfromheterogeneoussinglecellrnaseqdatasets
AT berensteinarielj scgatemarkerbasedpurificationofcelltypesfromheterogeneoussinglecellrnaseqdatasets
AT carmonasantiagoj scgatemarkerbasedpurificationofcelltypesfromheterogeneoussinglecellrnaseqdatasets