Cargando…

GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership

Parts-based representations, such as non-negative matrix factorization and topic modeling, have been used to identify structure from single-cell sequencing data sets, in particular structure that is not as well captured by clustering or other dimensionality reduction methods. However, interpreting t...

Descripción completa

Detalles Bibliográficos
Autores principales: Carbonetto, Peter, Luo, Kaixuan, Sarkar, Abhishek, Hung, Anthony, Tayeb, Karl, Pott, Sebastian, Stephens, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588049/
https://www.ncbi.nlm.nih.gov/pubmed/37858253
http://dx.doi.org/10.1186/s13059-023-03067-9
Descripción
Sumario:Parts-based representations, such as non-negative matrix factorization and topic modeling, have been used to identify structure from single-cell sequencing data sets, in particular structure that is not as well captured by clustering or other dimensionality reduction methods. However, interpreting the individual parts remains a challenge. To address this challenge, we extend methods for differential expression analysis by allowing cells to have partial membership to multiple groups. We call this grade of membership differential expression (GoM DE). We illustrate the benefits of GoM DE for annotating topics identified in several single-cell RNA-seq and ATAC-seq data sets. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03067-9.