Cargando…

Single-cell assignment using multiple-adversarial domain adaptation network with large-scale references

The rapid accumulation of single-cell RNA-seq data has provided rich resources to characterize various human cell populations. However, achieving accurate cell-type annotation using public references presents challenges due to inconsistent annotations, batch effects, and rare cell types. Here, we in...

Descripción completa

Detalles Bibliográficos
Autores principales: Ren, Pengfei, Shi, Xiaoying, Yu, Zhiguang, Dong, Xin, Ding, Xuanxin, Wang, Jin, Sun, Liangdong, Yan, Yilv, Hu, Junjie, Zhang, Peng, Chen, Qianming, Zhang, Jing, Li, Taiwen, Wang, Chenfei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10545911/
https://www.ncbi.nlm.nih.gov/pubmed/37751689
http://dx.doi.org/10.1016/j.crmeth.2023.100577
Descripción
Sumario:The rapid accumulation of single-cell RNA-seq data has provided rich resources to characterize various human cell populations. However, achieving accurate cell-type annotation using public references presents challenges due to inconsistent annotations, batch effects, and rare cell types. Here, we introduce SELINA (single-cell identity navigator), an integrative and automatic cell-type annotation framework based on a pre-curated reference atlas spanning various tissues. SELINA employs a multiple-adversarial domain adaptation network to remove batch effects within the reference dataset. Additionally, it enhances the annotation of less frequent cell types by synthetic minority oversampling and fits query data with the reference data using an autoencoder. SELINA culminates in the creation of a comprehensive and uniform reference atlas, encompassing 1.7 million cells covering 230 distinct human cell types. We substantiate its robustness and superiority across a multitude of human tissues. Notably, SELINA could accurately annotate cells within diverse disease contexts. SELINA provides a complete solution for human single-cell RNA-seq data annotation with both python and R packages.