Cargando…

Covarying neighborhood analysis identifies cell populations associated with phenotypes of interest from single-cell transcriptomics

As single-cell datasets grow in sample size, there is a critical need to characterize cell states that vary across samples and associate with sample attributes like clinical phenotypes. Current statistical approaches typically map cells to clusters then assess differences in cluster abundance. We pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Reshef, Yakir, Rumker, Laurie, Kang, Joyce B., Nathan, Aparna, Korsunsky, Ilya, Asgari, Samira, Murray, Megan B., Moody, D. Branch, Raychaudhuri, Soumya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8930733/
https://www.ncbi.nlm.nih.gov/pubmed/34675423
http://dx.doi.org/10.1038/s41587-021-01066-4
_version_ 1784671103189254144
author Reshef, Yakir
Rumker, Laurie
Kang, Joyce B.
Nathan, Aparna
Korsunsky, Ilya
Asgari, Samira
Murray, Megan B.
Moody, D. Branch
Raychaudhuri, Soumya
author_facet Reshef, Yakir
Rumker, Laurie
Kang, Joyce B.
Nathan, Aparna
Korsunsky, Ilya
Asgari, Samira
Murray, Megan B.
Moody, D. Branch
Raychaudhuri, Soumya
author_sort Reshef, Yakir
collection PubMed
description As single-cell datasets grow in sample size, there is a critical need to characterize cell states that vary across samples and associate with sample attributes like clinical phenotypes. Current statistical approaches typically map cells to clusters then assess differences in cluster abundance. We present covarying neighborhood analysis (CNA), an unbiased method to identify associated cell populations with greater flexibility than cluster-based approaches. CNA characterizes dominant axes of variation across samples by identifying groups of small regions in transcriptional space—termed neighborhoods—that covary in abundance across samples, suggesting shared function or regulation. CNA performs statistical testing for associations between any sample-level attribute and the abundances of these covarying neighborhood groups. Simulations show that CNA enables more sensitive and accurate identification of disease-associated cell states than a cluster-based approach. When applied to published datasets, CNA captures a Notch activation signature in rheumatoid arthritis, identifies monocyte populations expanded in sepsis, and identifies a novel T-cell population associated with progression to active tuberculosis.
format Online
Article
Text
id pubmed-8930733
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-89307332022-04-21 Covarying neighborhood analysis identifies cell populations associated with phenotypes of interest from single-cell transcriptomics Reshef, Yakir Rumker, Laurie Kang, Joyce B. Nathan, Aparna Korsunsky, Ilya Asgari, Samira Murray, Megan B. Moody, D. Branch Raychaudhuri, Soumya Nat Biotechnol Article As single-cell datasets grow in sample size, there is a critical need to characterize cell states that vary across samples and associate with sample attributes like clinical phenotypes. Current statistical approaches typically map cells to clusters then assess differences in cluster abundance. We present covarying neighborhood analysis (CNA), an unbiased method to identify associated cell populations with greater flexibility than cluster-based approaches. CNA characterizes dominant axes of variation across samples by identifying groups of small regions in transcriptional space—termed neighborhoods—that covary in abundance across samples, suggesting shared function or regulation. CNA performs statistical testing for associations between any sample-level attribute and the abundances of these covarying neighborhood groups. Simulations show that CNA enables more sensitive and accurate identification of disease-associated cell states than a cluster-based approach. When applied to published datasets, CNA captures a Notch activation signature in rheumatoid arthritis, identifies monocyte populations expanded in sepsis, and identifies a novel T-cell population associated with progression to active tuberculosis. 2022-03 2021-10-21 /pmc/articles/PMC8930733/ /pubmed/34675423 http://dx.doi.org/10.1038/s41587-021-01066-4 Text en Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: https://www.springernature.com/gp/open-research/policies/accepted-manuscript-terms
spellingShingle Article
Reshef, Yakir
Rumker, Laurie
Kang, Joyce B.
Nathan, Aparna
Korsunsky, Ilya
Asgari, Samira
Murray, Megan B.
Moody, D. Branch
Raychaudhuri, Soumya
Covarying neighborhood analysis identifies cell populations associated with phenotypes of interest from single-cell transcriptomics
title Covarying neighborhood analysis identifies cell populations associated with phenotypes of interest from single-cell transcriptomics
title_full Covarying neighborhood analysis identifies cell populations associated with phenotypes of interest from single-cell transcriptomics
title_fullStr Covarying neighborhood analysis identifies cell populations associated with phenotypes of interest from single-cell transcriptomics
title_full_unstemmed Covarying neighborhood analysis identifies cell populations associated with phenotypes of interest from single-cell transcriptomics
title_short Covarying neighborhood analysis identifies cell populations associated with phenotypes of interest from single-cell transcriptomics
title_sort covarying neighborhood analysis identifies cell populations associated with phenotypes of interest from single-cell transcriptomics
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8930733/
https://www.ncbi.nlm.nih.gov/pubmed/34675423
http://dx.doi.org/10.1038/s41587-021-01066-4
work_keys_str_mv AT reshefyakir covaryingneighborhoodanalysisidentifiescellpopulationsassociatedwithphenotypesofinterestfromsinglecelltranscriptomics
AT rumkerlaurie covaryingneighborhoodanalysisidentifiescellpopulationsassociatedwithphenotypesofinterestfromsinglecelltranscriptomics
AT kangjoyceb covaryingneighborhoodanalysisidentifiescellpopulationsassociatedwithphenotypesofinterestfromsinglecelltranscriptomics
AT nathanaparna covaryingneighborhoodanalysisidentifiescellpopulationsassociatedwithphenotypesofinterestfromsinglecelltranscriptomics
AT korsunskyilya covaryingneighborhoodanalysisidentifiescellpopulationsassociatedwithphenotypesofinterestfromsinglecelltranscriptomics
AT asgarisamira covaryingneighborhoodanalysisidentifiescellpopulationsassociatedwithphenotypesofinterestfromsinglecelltranscriptomics
AT murraymeganb covaryingneighborhoodanalysisidentifiescellpopulationsassociatedwithphenotypesofinterestfromsinglecelltranscriptomics
AT moodydbranch covaryingneighborhoodanalysisidentifiescellpopulationsassociatedwithphenotypesofinterestfromsinglecelltranscriptomics
AT raychaudhurisoumya covaryingneighborhoodanalysisidentifiescellpopulationsassociatedwithphenotypesofinterestfromsinglecelltranscriptomics