Cargando…

Cytocipher determines significantly different populations of cells in single-cell RNA-seq data

MOTIVATION: Identification of cell types using single-cell RNA-seq is revolutionizing the study of multicellular organisms. However, typical single-cell RNA-seq analysis often involves post hoc manual curation to ensure clusters are transcriptionally distinct, which is time-consuming, error-prone, a...

Descripción completa

Detalles Bibliográficos
Autores principales: Balderson, Brad, Piper, Michael, Thor, Stefan, Bodén, Mikael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368802/
https://www.ncbi.nlm.nih.gov/pubmed/37449901
http://dx.doi.org/10.1093/bioinformatics/btad435
_version_ 1785077581735788544
author Balderson, Brad
Piper, Michael
Thor, Stefan
Bodén, Mikael
author_facet Balderson, Brad
Piper, Michael
Thor, Stefan
Bodén, Mikael
author_sort Balderson, Brad
collection PubMed
description MOTIVATION: Identification of cell types using single-cell RNA-seq is revolutionizing the study of multicellular organisms. However, typical single-cell RNA-seq analysis often involves post hoc manual curation to ensure clusters are transcriptionally distinct, which is time-consuming, error-prone, and irreproducible. RESULTS: To overcome these obstacles, we developed Cytocipher, a bioinformatics method and scverse compatible software package that statistically determines significant clusters. Application of Cytocipher to normal tissue, development, disease, and large-scale atlas data reveals the broad applicability and power of Cytocipher to generate biological insights in numerous contexts. This included the identification of cell types not previously described in the datasets analysed, such as CD8+ T cell subtypes in human peripheral blood mononuclear cells; cell lineage intermediate states during mouse pancreas development; and subpopulations of luminal epithelial cells over-represented in prostate cancer. Cytocipher also scales to large datasets with high-test performance, as shown by application to the Tabula Sapiens Atlas representing >480 000 cells. Cytocipher is a novel and generalizable method that statistically determines transcriptionally distinct and programmatically reproducible clusters from single-cell data. AVAILABILITY AND IMPLEMENTATION: The software version used for this manuscript has been deposited on Zenodo (https://doi.org/10.5281/zenodo.8089546), and is also available via github (https://github.com/BradBalderson/Cytocipher).
format Online
Article
Text
id pubmed-10368802
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-103688022023-07-27 Cytocipher determines significantly different populations of cells in single-cell RNA-seq data Balderson, Brad Piper, Michael Thor, Stefan Bodén, Mikael Bioinformatics Original Paper MOTIVATION: Identification of cell types using single-cell RNA-seq is revolutionizing the study of multicellular organisms. However, typical single-cell RNA-seq analysis often involves post hoc manual curation to ensure clusters are transcriptionally distinct, which is time-consuming, error-prone, and irreproducible. RESULTS: To overcome these obstacles, we developed Cytocipher, a bioinformatics method and scverse compatible software package that statistically determines significant clusters. Application of Cytocipher to normal tissue, development, disease, and large-scale atlas data reveals the broad applicability and power of Cytocipher to generate biological insights in numerous contexts. This included the identification of cell types not previously described in the datasets analysed, such as CD8+ T cell subtypes in human peripheral blood mononuclear cells; cell lineage intermediate states during mouse pancreas development; and subpopulations of luminal epithelial cells over-represented in prostate cancer. Cytocipher also scales to large datasets with high-test performance, as shown by application to the Tabula Sapiens Atlas representing >480 000 cells. Cytocipher is a novel and generalizable method that statistically determines transcriptionally distinct and programmatically reproducible clusters from single-cell data. AVAILABILITY AND IMPLEMENTATION: The software version used for this manuscript has been deposited on Zenodo (https://doi.org/10.5281/zenodo.8089546), and is also available via github (https://github.com/BradBalderson/Cytocipher). Oxford University Press 2023-07-14 /pmc/articles/PMC10368802/ /pubmed/37449901 http://dx.doi.org/10.1093/bioinformatics/btad435 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Balderson, Brad
Piper, Michael
Thor, Stefan
Bodén, Mikael
Cytocipher determines significantly different populations of cells in single-cell RNA-seq data
title Cytocipher determines significantly different populations of cells in single-cell RNA-seq data
title_full Cytocipher determines significantly different populations of cells in single-cell RNA-seq data
title_fullStr Cytocipher determines significantly different populations of cells in single-cell RNA-seq data
title_full_unstemmed Cytocipher determines significantly different populations of cells in single-cell RNA-seq data
title_short Cytocipher determines significantly different populations of cells in single-cell RNA-seq data
title_sort cytocipher determines significantly different populations of cells in single-cell rna-seq data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368802/
https://www.ncbi.nlm.nih.gov/pubmed/37449901
http://dx.doi.org/10.1093/bioinformatics/btad435
work_keys_str_mv AT baldersonbrad cytocipherdeterminessignificantlydifferentpopulationsofcellsinsinglecellrnaseqdata
AT pipermichael cytocipherdeterminessignificantlydifferentpopulationsofcellsinsinglecellrnaseqdata
AT thorstefan cytocipherdeterminessignificantlydifferentpopulationsofcellsinsinglecellrnaseqdata
AT bodenmikael cytocipherdeterminessignificantlydifferentpopulationsofcellsinsinglecellrnaseqdata