Cargando…
A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data
The development of single-cell RNA sequencing (scRNA-seq) offers opportunities to characterize cellular heterogeneity at unprecedented resolution. Although scRNA-seq has been widely used to identify and characterize gene expression variation across cell types and cell states based on their average g...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002707/ https://www.ncbi.nlm.nih.gov/pubmed/36909553 http://dx.doi.org/10.1101/2023.02.24.529987 |
_version_ | 1784904446014128128 |
---|---|
author | Chen, Minhui Dahl, Andy |
author_facet | Chen, Minhui Dahl, Andy |
author_sort | Chen, Minhui |
collection | PubMed |
description | The development of single-cell RNA sequencing (scRNA-seq) offers opportunities to characterize cellular heterogeneity at unprecedented resolution. Although scRNA-seq has been widely used to identify and characterize gene expression variation across cell types and cell states based on their average gene expression profiles, most studies ignore variation across individual donors. Modelling this inter-individual variation could improve statistical power to detect cell type-specific biology and inform the genes and cell types that underlying complex traits. We therefore develop a new model to detect and quantify cell type-specific variation across individuals called CTMM (Cell Type-specific linear Mixed Model). CTMM operates on cell type-specific pseudobulk expression and is fit with efficient methods that scale to hundreds of samples. We use extensive simulations to show that CTMM is powerful and unbiased in realistic settings. We also derive calibrated tests for cell type-specific interindividual variation, which is challenging given the modest sample sizes in scRNA-seq data. We apply CTMM to scRNA-seq data from human induced pluripotent stem cells to characterize the transcriptomic variation across donors as cells differentiate into endoderm. We find that almost 100% of transcriptome-wide variability between donors is differentiation stage-specific. CTMM also identifies individual genes with statistically significant stage-specific variability across samples, including 61 genes that do not have significant stage-specific mean expression. Finally, we extend CTMM to partition interindividual covariance between stages, which recapitulates the overall differentiation trajectory. Overall, CTMM is a powerful tool to characterize a novel dimension of cell type-specific biology in scRNA-seq. |
format | Online Article Text |
id | pubmed-10002707 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-100027072023-03-11 A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data Chen, Minhui Dahl, Andy bioRxiv Article The development of single-cell RNA sequencing (scRNA-seq) offers opportunities to characterize cellular heterogeneity at unprecedented resolution. Although scRNA-seq has been widely used to identify and characterize gene expression variation across cell types and cell states based on their average gene expression profiles, most studies ignore variation across individual donors. Modelling this inter-individual variation could improve statistical power to detect cell type-specific biology and inform the genes and cell types that underlying complex traits. We therefore develop a new model to detect and quantify cell type-specific variation across individuals called CTMM (Cell Type-specific linear Mixed Model). CTMM operates on cell type-specific pseudobulk expression and is fit with efficient methods that scale to hundreds of samples. We use extensive simulations to show that CTMM is powerful and unbiased in realistic settings. We also derive calibrated tests for cell type-specific interindividual variation, which is challenging given the modest sample sizes in scRNA-seq data. We apply CTMM to scRNA-seq data from human induced pluripotent stem cells to characterize the transcriptomic variation across donors as cells differentiate into endoderm. We find that almost 100% of transcriptome-wide variability between donors is differentiation stage-specific. CTMM also identifies individual genes with statistically significant stage-specific variability across samples, including 61 genes that do not have significant stage-specific mean expression. Finally, we extend CTMM to partition interindividual covariance between stages, which recapitulates the overall differentiation trajectory. Overall, CTMM is a powerful tool to characterize a novel dimension of cell type-specific biology in scRNA-seq. Cold Spring Harbor Laboratory 2023-02-27 /pmc/articles/PMC10002707/ /pubmed/36909553 http://dx.doi.org/10.1101/2023.02.24.529987 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Chen, Minhui Dahl, Andy A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data |
title | A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data |
title_full | A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data |
title_fullStr | A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data |
title_full_unstemmed | A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data |
title_short | A robust model for cell type-specific interindividual variation in single-cell RNA sequencing data |
title_sort | robust model for cell type-specific interindividual variation in single-cell rna sequencing data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10002707/ https://www.ncbi.nlm.nih.gov/pubmed/36909553 http://dx.doi.org/10.1101/2023.02.24.529987 |
work_keys_str_mv | AT chenminhui arobustmodelforcelltypespecificinterindividualvariationinsinglecellrnasequencingdata AT dahlandy arobustmodelforcelltypespecificinterindividualvariationinsinglecellrnasequencingdata AT chenminhui robustmodelforcelltypespecificinterindividualvariationinsinglecellrnasequencingdata AT dahlandy robustmodelforcelltypespecificinterindividualvariationinsinglecellrnasequencingdata |