Cargando…

sccomp: Robust differential composition and variability analysis for single-cell data

Cellular omics such as single-cell genomics, proteomics, and microbiomics allow the characterization of tissue and microbial community composition, which can be compared between conditions to identify biological drivers. This strategy has been critical to revealing markers of disease progression, su...

Descripción completa

Detalles Bibliográficos
Autores principales: Mangiola, Stefano, Roth-Schulze, Alexandra J., Trussart, Marie, Zozaya-Valdés, Enrique, Ma, Mengyao, Gao, Zijie, Rubin, Alan F., Speed, Terence P., Shim, Heejung, Papenfuss, Anthony T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10438834/
https://www.ncbi.nlm.nih.gov/pubmed/37549298
http://dx.doi.org/10.1073/pnas.2203828120
_version_ 1785092812140707840
author Mangiola, Stefano
Roth-Schulze, Alexandra J.
Trussart, Marie
Zozaya-Valdés, Enrique
Ma, Mengyao
Gao, Zijie
Rubin, Alan F.
Speed, Terence P.
Shim, Heejung
Papenfuss, Anthony T.
author_facet Mangiola, Stefano
Roth-Schulze, Alexandra J.
Trussart, Marie
Zozaya-Valdés, Enrique
Ma, Mengyao
Gao, Zijie
Rubin, Alan F.
Speed, Terence P.
Shim, Heejung
Papenfuss, Anthony T.
author_sort Mangiola, Stefano
collection PubMed
description Cellular omics such as single-cell genomics, proteomics, and microbiomics allow the characterization of tissue and microbial community composition, which can be compared between conditions to identify biological drivers. This strategy has been critical to revealing markers of disease progression, such as cancer and pathogen infection. A dedicated statistical method for differential variability analysis is lacking for cellular omics data, and existing methods for differential composition analysis do not model some compositional data properties, suggesting there is room to improve model performance. Here, we introduce sccomp, a method for differential composition and variability analyses that jointly models data count distribution, compositionality, group-specific variability, and proportion mean–variability association, being aware of outliers. sccomp provides a comprehensive analysis framework that offers realistic data simulation and cross-study knowledge transfer. Here, we demonstrate that mean–variability association is ubiquitous across technologies, highlighting the inadequacy of the very popular Dirichlet-multinomial distribution. We show that sccomp accurately fits experimental data, significantly improving performance over state-of-the-art algorithms. Using sccomp, we identified differential constraints and composition in the microenvironment of primary breast cancer.
format Online
Article
Text
id pubmed-10438834
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-104388342023-08-19 sccomp: Robust differential composition and variability analysis for single-cell data Mangiola, Stefano Roth-Schulze, Alexandra J. Trussart, Marie Zozaya-Valdés, Enrique Ma, Mengyao Gao, Zijie Rubin, Alan F. Speed, Terence P. Shim, Heejung Papenfuss, Anthony T. Proc Natl Acad Sci U S A Biological Sciences Cellular omics such as single-cell genomics, proteomics, and microbiomics allow the characterization of tissue and microbial community composition, which can be compared between conditions to identify biological drivers. This strategy has been critical to revealing markers of disease progression, such as cancer and pathogen infection. A dedicated statistical method for differential variability analysis is lacking for cellular omics data, and existing methods for differential composition analysis do not model some compositional data properties, suggesting there is room to improve model performance. Here, we introduce sccomp, a method for differential composition and variability analyses that jointly models data count distribution, compositionality, group-specific variability, and proportion mean–variability association, being aware of outliers. sccomp provides a comprehensive analysis framework that offers realistic data simulation and cross-study knowledge transfer. Here, we demonstrate that mean–variability association is ubiquitous across technologies, highlighting the inadequacy of the very popular Dirichlet-multinomial distribution. We show that sccomp accurately fits experimental data, significantly improving performance over state-of-the-art algorithms. Using sccomp, we identified differential constraints and composition in the microenvironment of primary breast cancer. National Academy of Sciences 2023-08-07 2023-08-15 /pmc/articles/PMC10438834/ /pubmed/37549298 http://dx.doi.org/10.1073/pnas.2203828120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Biological Sciences
Mangiola, Stefano
Roth-Schulze, Alexandra J.
Trussart, Marie
Zozaya-Valdés, Enrique
Ma, Mengyao
Gao, Zijie
Rubin, Alan F.
Speed, Terence P.
Shim, Heejung
Papenfuss, Anthony T.
sccomp: Robust differential composition and variability analysis for single-cell data
title sccomp: Robust differential composition and variability analysis for single-cell data
title_full sccomp: Robust differential composition and variability analysis for single-cell data
title_fullStr sccomp: Robust differential composition and variability analysis for single-cell data
title_full_unstemmed sccomp: Robust differential composition and variability analysis for single-cell data
title_short sccomp: Robust differential composition and variability analysis for single-cell data
title_sort sccomp: robust differential composition and variability analysis for single-cell data
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10438834/
https://www.ncbi.nlm.nih.gov/pubmed/37549298
http://dx.doi.org/10.1073/pnas.2203828120
work_keys_str_mv AT mangiolastefano sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata
AT rothschulzealexandraj sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata
AT trussartmarie sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata
AT zozayavaldesenrique sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata
AT mamengyao sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata
AT gaozijie sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata
AT rubinalanf sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata
AT speedterencep sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata
AT shimheejung sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata
AT papenfussanthonyt sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata