Cargando…
sccomp: Robust differential composition and variability analysis for single-cell data
Cellular omics such as single-cell genomics, proteomics, and microbiomics allow the characterization of tissue and microbial community composition, which can be compared between conditions to identify biological drivers. This strategy has been critical to revealing markers of disease progression, su...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10438834/ https://www.ncbi.nlm.nih.gov/pubmed/37549298 http://dx.doi.org/10.1073/pnas.2203828120 |
_version_ | 1785092812140707840 |
---|---|
author | Mangiola, Stefano Roth-Schulze, Alexandra J. Trussart, Marie Zozaya-Valdés, Enrique Ma, Mengyao Gao, Zijie Rubin, Alan F. Speed, Terence P. Shim, Heejung Papenfuss, Anthony T. |
author_facet | Mangiola, Stefano Roth-Schulze, Alexandra J. Trussart, Marie Zozaya-Valdés, Enrique Ma, Mengyao Gao, Zijie Rubin, Alan F. Speed, Terence P. Shim, Heejung Papenfuss, Anthony T. |
author_sort | Mangiola, Stefano |
collection | PubMed |
description | Cellular omics such as single-cell genomics, proteomics, and microbiomics allow the characterization of tissue and microbial community composition, which can be compared between conditions to identify biological drivers. This strategy has been critical to revealing markers of disease progression, such as cancer and pathogen infection. A dedicated statistical method for differential variability analysis is lacking for cellular omics data, and existing methods for differential composition analysis do not model some compositional data properties, suggesting there is room to improve model performance. Here, we introduce sccomp, a method for differential composition and variability analyses that jointly models data count distribution, compositionality, group-specific variability, and proportion mean–variability association, being aware of outliers. sccomp provides a comprehensive analysis framework that offers realistic data simulation and cross-study knowledge transfer. Here, we demonstrate that mean–variability association is ubiquitous across technologies, highlighting the inadequacy of the very popular Dirichlet-multinomial distribution. We show that sccomp accurately fits experimental data, significantly improving performance over state-of-the-art algorithms. Using sccomp, we identified differential constraints and composition in the microenvironment of primary breast cancer. |
format | Online Article Text |
id | pubmed-10438834 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-104388342023-08-19 sccomp: Robust differential composition and variability analysis for single-cell data Mangiola, Stefano Roth-Schulze, Alexandra J. Trussart, Marie Zozaya-Valdés, Enrique Ma, Mengyao Gao, Zijie Rubin, Alan F. Speed, Terence P. Shim, Heejung Papenfuss, Anthony T. Proc Natl Acad Sci U S A Biological Sciences Cellular omics such as single-cell genomics, proteomics, and microbiomics allow the characterization of tissue and microbial community composition, which can be compared between conditions to identify biological drivers. This strategy has been critical to revealing markers of disease progression, such as cancer and pathogen infection. A dedicated statistical method for differential variability analysis is lacking for cellular omics data, and existing methods for differential composition analysis do not model some compositional data properties, suggesting there is room to improve model performance. Here, we introduce sccomp, a method for differential composition and variability analyses that jointly models data count distribution, compositionality, group-specific variability, and proportion mean–variability association, being aware of outliers. sccomp provides a comprehensive analysis framework that offers realistic data simulation and cross-study knowledge transfer. Here, we demonstrate that mean–variability association is ubiquitous across technologies, highlighting the inadequacy of the very popular Dirichlet-multinomial distribution. We show that sccomp accurately fits experimental data, significantly improving performance over state-of-the-art algorithms. Using sccomp, we identified differential constraints and composition in the microenvironment of primary breast cancer. National Academy of Sciences 2023-08-07 2023-08-15 /pmc/articles/PMC10438834/ /pubmed/37549298 http://dx.doi.org/10.1073/pnas.2203828120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Biological Sciences Mangiola, Stefano Roth-Schulze, Alexandra J. Trussart, Marie Zozaya-Valdés, Enrique Ma, Mengyao Gao, Zijie Rubin, Alan F. Speed, Terence P. Shim, Heejung Papenfuss, Anthony T. sccomp: Robust differential composition and variability analysis for single-cell data |
title | sccomp: Robust differential composition and variability analysis for single-cell data |
title_full | sccomp: Robust differential composition and variability analysis for single-cell data |
title_fullStr | sccomp: Robust differential composition and variability analysis for single-cell data |
title_full_unstemmed | sccomp: Robust differential composition and variability analysis for single-cell data |
title_short | sccomp: Robust differential composition and variability analysis for single-cell data |
title_sort | sccomp: robust differential composition and variability analysis for single-cell data |
topic | Biological Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10438834/ https://www.ncbi.nlm.nih.gov/pubmed/37549298 http://dx.doi.org/10.1073/pnas.2203828120 |
work_keys_str_mv | AT mangiolastefano sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata AT rothschulzealexandraj sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata AT trussartmarie sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata AT zozayavaldesenrique sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata AT mamengyao sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata AT gaozijie sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata AT rubinalanf sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata AT speedterencep sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata AT shimheejung sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata AT papenfussanthonyt sccomprobustdifferentialcompositionandvariabilityanalysisforsinglecelldata |