Cargando…

scMAGIC: accurately annotating single cells using two rounds of reference-based classification

Here, we introduce scMAGIC (Single Cell annotation using MArker Genes Identification and two rounds of reference-based Classification [RBC]), a novel method that uses well-annotated single-cell RNA sequencing (scRNA-seq) data as the reference to assist in the classification of query scRNA-seq data....

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yu, Zhang, Feng, Wang, Zekun, Wu, Siyi, Tian, Weidong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9071478/
https://www.ncbi.nlm.nih.gov/pubmed/34986249
http://dx.doi.org/10.1093/nar/gkab1275
_version_ 1784700851072270336
author Zhang, Yu
Zhang, Feng
Wang, Zekun
Wu, Siyi
Tian, Weidong
author_facet Zhang, Yu
Zhang, Feng
Wang, Zekun
Wu, Siyi
Tian, Weidong
author_sort Zhang, Yu
collection PubMed
description Here, we introduce scMAGIC (Single Cell annotation using MArker Genes Identification and two rounds of reference-based Classification [RBC]), a novel method that uses well-annotated single-cell RNA sequencing (scRNA-seq) data as the reference to assist in the classification of query scRNA-seq data. A key innovation in scMAGIC is the introduction of a second-round RBC in which those query cells whose cell identities are confidently validated in the first round are used as a new reference to again classify query cells, therefore eliminating the batch effects between the reference and the query data. scMAGIC significantly outperforms 13 competing RBC methods with their optimal parameter settings across 86 benchmark tests, especially when the cell types in the query dataset are not completely covered by the reference dataset and when there exist significant batch effects between the reference and the query datasets. Moreover, when no reference dataset is available, scMAGIC can annotate query cells with reasonably high accuracy by using an atlas dataset as the reference.
format Online
Article
Text
id pubmed-9071478
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-90714782022-05-06 scMAGIC: accurately annotating single cells using two rounds of reference-based classification Zhang, Yu Zhang, Feng Wang, Zekun Wu, Siyi Tian, Weidong Nucleic Acids Res Methods Online Here, we introduce scMAGIC (Single Cell annotation using MArker Genes Identification and two rounds of reference-based Classification [RBC]), a novel method that uses well-annotated single-cell RNA sequencing (scRNA-seq) data as the reference to assist in the classification of query scRNA-seq data. A key innovation in scMAGIC is the introduction of a second-round RBC in which those query cells whose cell identities are confidently validated in the first round are used as a new reference to again classify query cells, therefore eliminating the batch effects between the reference and the query data. scMAGIC significantly outperforms 13 competing RBC methods with their optimal parameter settings across 86 benchmark tests, especially when the cell types in the query dataset are not completely covered by the reference dataset and when there exist significant batch effects between the reference and the query datasets. Moreover, when no reference dataset is available, scMAGIC can annotate query cells with reasonably high accuracy by using an atlas dataset as the reference. Oxford University Press 2022-01-05 /pmc/articles/PMC9071478/ /pubmed/34986249 http://dx.doi.org/10.1093/nar/gkab1275 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Zhang, Yu
Zhang, Feng
Wang, Zekun
Wu, Siyi
Tian, Weidong
scMAGIC: accurately annotating single cells using two rounds of reference-based classification
title scMAGIC: accurately annotating single cells using two rounds of reference-based classification
title_full scMAGIC: accurately annotating single cells using two rounds of reference-based classification
title_fullStr scMAGIC: accurately annotating single cells using two rounds of reference-based classification
title_full_unstemmed scMAGIC: accurately annotating single cells using two rounds of reference-based classification
title_short scMAGIC: accurately annotating single cells using two rounds of reference-based classification
title_sort scmagic: accurately annotating single cells using two rounds of reference-based classification
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9071478/
https://www.ncbi.nlm.nih.gov/pubmed/34986249
http://dx.doi.org/10.1093/nar/gkab1275
work_keys_str_mv AT zhangyu scmagicaccuratelyannotatingsinglecellsusingtworoundsofreferencebasedclassification
AT zhangfeng scmagicaccuratelyannotatingsinglecellsusingtworoundsofreferencebasedclassification
AT wangzekun scmagicaccuratelyannotatingsinglecellsusingtworoundsofreferencebasedclassification
AT wusiyi scmagicaccuratelyannotatingsinglecellsusingtworoundsofreferencebasedclassification
AT tianweidong scmagicaccuratelyannotatingsinglecellsusingtworoundsofreferencebasedclassification