Cargando…

GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership

Parts-based representations, such as non-negative matrix factorization and topic modeling, have been used to identify structure from single-cell sequencing data sets, in particular structure that is not as well captured by clustering or other dimensionality reduction methods. However, interpreting t...

Descripción completa

Detalles Bibliográficos
Autores principales: Carbonetto, Peter, Luo, Kaixuan, Sarkar, Abhishek, Hung, Anthony, Tayeb, Karl, Pott, Sebastian, Stephens, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10028846/
https://www.ncbi.nlm.nih.gov/pubmed/36945441
http://dx.doi.org/10.1101/2023.03.03.531029
_version_ 1784910030619803648
author Carbonetto, Peter
Luo, Kaixuan
Sarkar, Abhishek
Hung, Anthony
Tayeb, Karl
Pott, Sebastian
Stephens, Matthew
author_facet Carbonetto, Peter
Luo, Kaixuan
Sarkar, Abhishek
Hung, Anthony
Tayeb, Karl
Pott, Sebastian
Stephens, Matthew
author_sort Carbonetto, Peter
collection PubMed
description Parts-based representations, such as non-negative matrix factorization and topic modeling, have been used to identify structure from single-cell sequencing data sets, in particular structure that is not as well captured by clustering or other dimensionality reduction methods. However, interpreting the individual parts remains a challenge. To address this challenge, we extend methods for differential expression analysis by allowing cells to have partial membership to multiple groups. We call this grade of membership differential expression (GoM DE). We illustrate the benefits of GoM DE for annotating topics identified in several single-cell RNA-seq and ATAC-seq data sets.
format Online
Article
Text
id pubmed-10028846
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-100288462023-03-22 GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership Carbonetto, Peter Luo, Kaixuan Sarkar, Abhishek Hung, Anthony Tayeb, Karl Pott, Sebastian Stephens, Matthew bioRxiv Article Parts-based representations, such as non-negative matrix factorization and topic modeling, have been used to identify structure from single-cell sequencing data sets, in particular structure that is not as well captured by clustering or other dimensionality reduction methods. However, interpreting the individual parts remains a challenge. To address this challenge, we extend methods for differential expression analysis by allowing cells to have partial membership to multiple groups. We call this grade of membership differential expression (GoM DE). We illustrate the benefits of GoM DE for annotating topics identified in several single-cell RNA-seq and ATAC-seq data sets. Cold Spring Harbor Laboratory 2023-09-14 /pmc/articles/PMC10028846/ /pubmed/36945441 http://dx.doi.org/10.1101/2023.03.03.531029 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Carbonetto, Peter
Luo, Kaixuan
Sarkar, Abhishek
Hung, Anthony
Tayeb, Karl
Pott, Sebastian
Stephens, Matthew
GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership
title GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership
title_full GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership
title_fullStr GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership
title_full_unstemmed GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership
title_short GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership
title_sort gom de: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10028846/
https://www.ncbi.nlm.nih.gov/pubmed/36945441
http://dx.doi.org/10.1101/2023.03.03.531029
work_keys_str_mv AT carbonettopeter gomdeinterpretingstructureinsequencecountdatawithdifferentialexpressionanalysisallowingforgradesofmembership
AT luokaixuan gomdeinterpretingstructureinsequencecountdatawithdifferentialexpressionanalysisallowingforgradesofmembership
AT sarkarabhishek gomdeinterpretingstructureinsequencecountdatawithdifferentialexpressionanalysisallowingforgradesofmembership
AT hunganthony gomdeinterpretingstructureinsequencecountdatawithdifferentialexpressionanalysisallowingforgradesofmembership
AT tayebkarl gomdeinterpretingstructureinsequencecountdatawithdifferentialexpressionanalysisallowingforgradesofmembership
AT pottsebastian gomdeinterpretingstructureinsequencecountdatawithdifferentialexpressionanalysisallowingforgradesofmembership
AT stephensmatthew gomdeinterpretingstructureinsequencecountdatawithdifferentialexpressionanalysisallowingforgradesofmembership