Cargando…

Robustness of single-cell RNA-seq for identifying differentially expressed genes

BACKGROUND: A common feature of single-cell RNA-seq (scRNA-seq) data is that the number of cells in a cell cluster may vary widely, ranging from a few dozen to several thousand. It is not clear whether scRNA-seq data from a small number of cells allow robust identification of differentially expresse...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yong, Huang, Jing, Pandey, Rajan, Liu, Pengyuan, Therani, Bhavika, Qiu, Qiongzi, Rao, Sridhar, Geurts, Aron M., Cowley, Allen W., Greene, Andrew S., Liang, Mingyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10316566/
https://www.ncbi.nlm.nih.gov/pubmed/37394518
http://dx.doi.org/10.1186/s12864-023-09487-y
Descripción
Sumario:BACKGROUND: A common feature of single-cell RNA-seq (scRNA-seq) data is that the number of cells in a cell cluster may vary widely, ranging from a few dozen to several thousand. It is not clear whether scRNA-seq data from a small number of cells allow robust identification of differentially expressed genes (DEGs) with various characteristics. RESULTS: We addressed this question by performing scRNA-seq and poly(A)-dependent bulk RNA-seq in comparable aliquots of human induced pluripotent stem cells-derived, purified vascular endothelial and smooth muscle cells. We found that scRNA-seq data needed to have 2,000 or more cells in a cluster to identify the majority of DEGs that would show modest differences in a bulk RNA-seq analysis. On the other hand, clusters with as few as 50–100 cells may be sufficient for identifying the majority of DEGs that would have extremely small p values or transcript abundance greater than a few hundred transcripts per million in a bulk RNA-seq analysis. CONCLUSION: Findings of the current study provide a quantitative reference for designing studies that aim for identifying DEGs for specific cell clusters using scRNA-seq data and for interpreting results of such studies. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-023-09487-y.