Cargando…

Single-cell assignment using multiple-adversarial domain adaptation network with large-scale references

The rapid accumulation of single-cell RNA-seq data has provided rich resources to characterize various human cell populations. However, achieving accurate cell-type annotation using public references presents challenges due to inconsistent annotations, batch effects, and rare cell types. Here, we in...

Descripción completa

Detalles Bibliográficos
Autores principales: Ren, Pengfei, Shi, Xiaoying, Yu, Zhiguang, Dong, Xin, Ding, Xuanxin, Wang, Jin, Sun, Liangdong, Yan, Yilv, Hu, Junjie, Zhang, Peng, Chen, Qianming, Zhang, Jing, Li, Taiwen, Wang, Chenfei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10545911/
https://www.ncbi.nlm.nih.gov/pubmed/37751689
http://dx.doi.org/10.1016/j.crmeth.2023.100577
_version_ 1785114764817465344
author Ren, Pengfei
Shi, Xiaoying
Yu, Zhiguang
Dong, Xin
Ding, Xuanxin
Wang, Jin
Sun, Liangdong
Yan, Yilv
Hu, Junjie
Zhang, Peng
Chen, Qianming
Zhang, Jing
Li, Taiwen
Wang, Chenfei
author_facet Ren, Pengfei
Shi, Xiaoying
Yu, Zhiguang
Dong, Xin
Ding, Xuanxin
Wang, Jin
Sun, Liangdong
Yan, Yilv
Hu, Junjie
Zhang, Peng
Chen, Qianming
Zhang, Jing
Li, Taiwen
Wang, Chenfei
author_sort Ren, Pengfei
collection PubMed
description The rapid accumulation of single-cell RNA-seq data has provided rich resources to characterize various human cell populations. However, achieving accurate cell-type annotation using public references presents challenges due to inconsistent annotations, batch effects, and rare cell types. Here, we introduce SELINA (single-cell identity navigator), an integrative and automatic cell-type annotation framework based on a pre-curated reference atlas spanning various tissues. SELINA employs a multiple-adversarial domain adaptation network to remove batch effects within the reference dataset. Additionally, it enhances the annotation of less frequent cell types by synthetic minority oversampling and fits query data with the reference data using an autoencoder. SELINA culminates in the creation of a comprehensive and uniform reference atlas, encompassing 1.7 million cells covering 230 distinct human cell types. We substantiate its robustness and superiority across a multitude of human tissues. Notably, SELINA could accurately annotate cells within diverse disease contexts. SELINA provides a complete solution for human single-cell RNA-seq data annotation with both python and R packages.
format Online
Article
Text
id pubmed-10545911
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-105459112023-10-04 Single-cell assignment using multiple-adversarial domain adaptation network with large-scale references Ren, Pengfei Shi, Xiaoying Yu, Zhiguang Dong, Xin Ding, Xuanxin Wang, Jin Sun, Liangdong Yan, Yilv Hu, Junjie Zhang, Peng Chen, Qianming Zhang, Jing Li, Taiwen Wang, Chenfei Cell Rep Methods Article The rapid accumulation of single-cell RNA-seq data has provided rich resources to characterize various human cell populations. However, achieving accurate cell-type annotation using public references presents challenges due to inconsistent annotations, batch effects, and rare cell types. Here, we introduce SELINA (single-cell identity navigator), an integrative and automatic cell-type annotation framework based on a pre-curated reference atlas spanning various tissues. SELINA employs a multiple-adversarial domain adaptation network to remove batch effects within the reference dataset. Additionally, it enhances the annotation of less frequent cell types by synthetic minority oversampling and fits query data with the reference data using an autoencoder. SELINA culminates in the creation of a comprehensive and uniform reference atlas, encompassing 1.7 million cells covering 230 distinct human cell types. We substantiate its robustness and superiority across a multitude of human tissues. Notably, SELINA could accurately annotate cells within diverse disease contexts. SELINA provides a complete solution for human single-cell RNA-seq data annotation with both python and R packages. Elsevier 2023-08-31 /pmc/articles/PMC10545911/ /pubmed/37751689 http://dx.doi.org/10.1016/j.crmeth.2023.100577 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Ren, Pengfei
Shi, Xiaoying
Yu, Zhiguang
Dong, Xin
Ding, Xuanxin
Wang, Jin
Sun, Liangdong
Yan, Yilv
Hu, Junjie
Zhang, Peng
Chen, Qianming
Zhang, Jing
Li, Taiwen
Wang, Chenfei
Single-cell assignment using multiple-adversarial domain adaptation network with large-scale references
title Single-cell assignment using multiple-adversarial domain adaptation network with large-scale references
title_full Single-cell assignment using multiple-adversarial domain adaptation network with large-scale references
title_fullStr Single-cell assignment using multiple-adversarial domain adaptation network with large-scale references
title_full_unstemmed Single-cell assignment using multiple-adversarial domain adaptation network with large-scale references
title_short Single-cell assignment using multiple-adversarial domain adaptation network with large-scale references
title_sort single-cell assignment using multiple-adversarial domain adaptation network with large-scale references
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10545911/
https://www.ncbi.nlm.nih.gov/pubmed/37751689
http://dx.doi.org/10.1016/j.crmeth.2023.100577
work_keys_str_mv AT renpengfei singlecellassignmentusingmultipleadversarialdomainadaptationnetworkwithlargescalereferences
AT shixiaoying singlecellassignmentusingmultipleadversarialdomainadaptationnetworkwithlargescalereferences
AT yuzhiguang singlecellassignmentusingmultipleadversarialdomainadaptationnetworkwithlargescalereferences
AT dongxin singlecellassignmentusingmultipleadversarialdomainadaptationnetworkwithlargescalereferences
AT dingxuanxin singlecellassignmentusingmultipleadversarialdomainadaptationnetworkwithlargescalereferences
AT wangjin singlecellassignmentusingmultipleadversarialdomainadaptationnetworkwithlargescalereferences
AT sunliangdong singlecellassignmentusingmultipleadversarialdomainadaptationnetworkwithlargescalereferences
AT yanyilv singlecellassignmentusingmultipleadversarialdomainadaptationnetworkwithlargescalereferences
AT hujunjie singlecellassignmentusingmultipleadversarialdomainadaptationnetworkwithlargescalereferences
AT zhangpeng singlecellassignmentusingmultipleadversarialdomainadaptationnetworkwithlargescalereferences
AT chenqianming singlecellassignmentusingmultipleadversarialdomainadaptationnetworkwithlargescalereferences
AT zhangjing singlecellassignmentusingmultipleadversarialdomainadaptationnetworkwithlargescalereferences
AT litaiwen singlecellassignmentusingmultipleadversarialdomainadaptationnetworkwithlargescalereferences
AT wangchenfei singlecellassignmentusingmultipleadversarialdomainadaptationnetworkwithlargescalereferences