Cargando…

A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data

The development of single-cell RNA sequencing (scRNA-seq) offers opportunities to characterize cellular heterogeneity at unprecedented resolution. Although scRNA-seq has been widely used to identify and characterize gene expression variation across cell types and cell states based on their average g...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Minhui, Dahl, Andy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002707/
https://www.ncbi.nlm.nih.gov/pubmed/36909553
http://dx.doi.org/10.1101/2023.02.24.529987
_version_ 1784904446014128128
author Chen, Minhui
Dahl, Andy
author_facet Chen, Minhui
Dahl, Andy
author_sort Chen, Minhui
collection PubMed
description The development of single-cell RNA sequencing (scRNA-seq) offers opportunities to characterize cellular heterogeneity at unprecedented resolution. Although scRNA-seq has been widely used to identify and characterize gene expression variation across cell types and cell states based on their average gene expression profiles, most studies ignore variation across individual donors. Modelling this inter-individual variation could improve statistical power to detect cell type-specific biology and inform the genes and cell types that underlying complex traits. We therefore develop a new model to detect and quantify cell type-specific variation across individuals called CTMM (Cell Type-specific linear Mixed Model). CTMM operates on cell type-specific pseudobulk expression and is fit with efficient methods that scale to hundreds of samples. We use extensive simulations to show that CTMM is powerful and unbiased in realistic settings. We also derive calibrated tests for cell type-specific interindividual variation, which is challenging given the modest sample sizes in scRNA-seq data. We apply CTMM to scRNA-seq data from human induced pluripotent stem cells to characterize the transcriptomic variation across donors as cells differentiate into endoderm. We find that almost 100% of transcriptome-wide variability between donors is differentiation stage-specific. CTMM also identifies individual genes with statistically significant stage-specific variability across samples, including 61 genes that do not have significant stage-specific mean expression. Finally, we extend CTMM to partition interindividual covariance between stages, which recapitulates the overall differentiation trajectory. Overall, CTMM is a powerful tool to characterize a novel dimension of cell type-specific biology in scRNA-seq.
format Online
Article
Text
id pubmed-10002707
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-100027072023-03-11 A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data Chen, Minhui Dahl, Andy bioRxiv Article The development of single-cell RNA sequencing (scRNA-seq) offers opportunities to characterize cellular heterogeneity at unprecedented resolution. Although scRNA-seq has been widely used to identify and characterize gene expression variation across cell types and cell states based on their average gene expression profiles, most studies ignore variation across individual donors. Modelling this inter-individual variation could improve statistical power to detect cell type-specific biology and inform the genes and cell types that underlying complex traits. We therefore develop a new model to detect and quantify cell type-specific variation across individuals called CTMM (Cell Type-specific linear Mixed Model). CTMM operates on cell type-specific pseudobulk expression and is fit with efficient methods that scale to hundreds of samples. We use extensive simulations to show that CTMM is powerful and unbiased in realistic settings. We also derive calibrated tests for cell type-specific interindividual variation, which is challenging given the modest sample sizes in scRNA-seq data. We apply CTMM to scRNA-seq data from human induced pluripotent stem cells to characterize the transcriptomic variation across donors as cells differentiate into endoderm. We find that almost 100% of transcriptome-wide variability between donors is differentiation stage-specific. CTMM also identifies individual genes with statistically significant stage-specific variability across samples, including 61 genes that do not have significant stage-specific mean expression. Finally, we extend CTMM to partition interindividual covariance between stages, which recapitulates the overall differentiation trajectory. Overall, CTMM is a powerful tool to characterize a novel dimension of cell type-specific biology in scRNA-seq. Cold Spring Harbor Laboratory 2023-02-27 /pmc/articles/PMC10002707/ /pubmed/36909553 http://dx.doi.org/10.1101/2023.02.24.529987 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Chen, Minhui
Dahl, Andy
A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data
title A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data
title_full A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data
title_fullStr A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data
title_full_unstemmed A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data
title_short A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data
title_sort robust model for cell type-specific interindividual variation in single-cell rna sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002707/
https://www.ncbi.nlm.nih.gov/pubmed/36909553
http://dx.doi.org/10.1101/2023.02.24.529987
work_keys_str_mv AT chenminhui arobustmodelforcelltypespecificinterindividualvariationinsinglecellrnasequencingdata
AT dahlandy arobustmodelforcelltypespecificinterindividualvariationinsinglecellrnasequencingdata
AT chenminhui robustmodelforcelltypespecificinterindividualvariationinsinglecellrnasequencingdata
AT dahlandy robustmodelforcelltypespecificinterindividualvariationinsinglecellrnasequencingdata