Cargando…

SCA: recovering single-cell heterogeneity through information-based dimensionality reduction

Dimensionality reduction summarizes the complex transcriptomic landscape of single-cell datasets for downstream analyses. Current approaches favor large cellular populations defined by many genes, at the expense of smaller and more subtly defined populations. Here, we present surprisal component ana...

Descripción completa

Detalles Bibliográficos
Autores principales: DeMeo, Benjamin, Berger, Bonnie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10464206/
https://www.ncbi.nlm.nih.gov/pubmed/37626411
http://dx.doi.org/10.1186/s13059-023-02998-7
Descripción
Sumario:Dimensionality reduction summarizes the complex transcriptomic landscape of single-cell datasets for downstream analyses. Current approaches favor large cellular populations defined by many genes, at the expense of smaller and more subtly defined populations. Here, we present surprisal component analysis (SCA), a technique that newly leverages the information-theoretic notion of surprisal for dimensionality reduction to promote more meaningful signal extraction. For example, SCA uncovers clinically important cytotoxic T-cell subpopulations that are indistinguishable using existing pipelines. We also demonstrate that SCA substantially improves downstream imputation. SCA’s efficient information-theoretic paradigm has broad applications to the study of complex biological tissues in health and disease. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-02998-7.