Cargando…

GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership

Parts-based representations, such as non-negative matrix factorization and topic modeling, have been used to identify structure from single-cell sequencing data sets, in particular structure that is not as well captured by clustering or other dimensionality reduction methods. However, interpreting t...

Descripción completa

Detalles Bibliográficos
Autores principales: Carbonetto, Peter, Luo, Kaixuan, Sarkar, Abhishek, Hung, Anthony, Tayeb, Karl, Pott, Sebastian, Stephens, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588049/
https://www.ncbi.nlm.nih.gov/pubmed/37858253
http://dx.doi.org/10.1186/s13059-023-03067-9
_version_ 1785123495247609856
author Carbonetto, Peter
Luo, Kaixuan
Sarkar, Abhishek
Hung, Anthony
Tayeb, Karl
Pott, Sebastian
Stephens, Matthew
author_facet Carbonetto, Peter
Luo, Kaixuan
Sarkar, Abhishek
Hung, Anthony
Tayeb, Karl
Pott, Sebastian
Stephens, Matthew
author_sort Carbonetto, Peter
collection PubMed
description Parts-based representations, such as non-negative matrix factorization and topic modeling, have been used to identify structure from single-cell sequencing data sets, in particular structure that is not as well captured by clustering or other dimensionality reduction methods. However, interpreting the individual parts remains a challenge. To address this challenge, we extend methods for differential expression analysis by allowing cells to have partial membership to multiple groups. We call this grade of membership differential expression (GoM DE). We illustrate the benefits of GoM DE for annotating topics identified in several single-cell RNA-seq and ATAC-seq data sets. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03067-9.
format Online
Article
Text
id pubmed-10588049
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-105880492023-10-21 GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership Carbonetto, Peter Luo, Kaixuan Sarkar, Abhishek Hung, Anthony Tayeb, Karl Pott, Sebastian Stephens, Matthew Genome Biol Method Parts-based representations, such as non-negative matrix factorization and topic modeling, have been used to identify structure from single-cell sequencing data sets, in particular structure that is not as well captured by clustering or other dimensionality reduction methods. However, interpreting the individual parts remains a challenge. To address this challenge, we extend methods for differential expression analysis by allowing cells to have partial membership to multiple groups. We call this grade of membership differential expression (GoM DE). We illustrate the benefits of GoM DE for annotating topics identified in several single-cell RNA-seq and ATAC-seq data sets. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03067-9. BioMed Central 2023-10-19 /pmc/articles/PMC10588049/ /pubmed/37858253 http://dx.doi.org/10.1186/s13059-023-03067-9 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Method
Carbonetto, Peter
Luo, Kaixuan
Sarkar, Abhishek
Hung, Anthony
Tayeb, Karl
Pott, Sebastian
Stephens, Matthew
GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership
title GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership
title_full GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership
title_fullStr GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership
title_full_unstemmed GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership
title_short GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership
title_sort gom de: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588049/
https://www.ncbi.nlm.nih.gov/pubmed/37858253
http://dx.doi.org/10.1186/s13059-023-03067-9
work_keys_str_mv AT carbonettopeter gomdeinterpretingstructureinsequencecountdatawithdifferentialexpressionanalysisallowingforgradesofmembership
AT luokaixuan gomdeinterpretingstructureinsequencecountdatawithdifferentialexpressionanalysisallowingforgradesofmembership
AT sarkarabhishek gomdeinterpretingstructureinsequencecountdatawithdifferentialexpressionanalysisallowingforgradesofmembership
AT hunganthony gomdeinterpretingstructureinsequencecountdatawithdifferentialexpressionanalysisallowingforgradesofmembership
AT tayebkarl gomdeinterpretingstructureinsequencecountdatawithdifferentialexpressionanalysisallowingforgradesofmembership
AT pottsebastian gomdeinterpretingstructureinsequencecountdatawithdifferentialexpressionanalysisallowingforgradesofmembership
AT stephensmatthew gomdeinterpretingstructureinsequencecountdatawithdifferentialexpressionanalysisallowingforgradesofmembership