Cargando…
scMAGIC: accurately annotating single cells using two rounds of reference-based classification
Here, we introduce scMAGIC (Single Cell annotation using MArker Genes Identification and two rounds of reference-based Classification [RBC]), a novel method that uses well-annotated single-cell RNA sequencing (scRNA-seq) data as the reference to assist in the classification of query scRNA-seq data....
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9071478/ https://www.ncbi.nlm.nih.gov/pubmed/34986249 http://dx.doi.org/10.1093/nar/gkab1275 |
_version_ | 1784700851072270336 |
---|---|
author | Zhang, Yu Zhang, Feng Wang, Zekun Wu, Siyi Tian, Weidong |
author_facet | Zhang, Yu Zhang, Feng Wang, Zekun Wu, Siyi Tian, Weidong |
author_sort | Zhang, Yu |
collection | PubMed |
description | Here, we introduce scMAGIC (Single Cell annotation using MArker Genes Identification and two rounds of reference-based Classification [RBC]), a novel method that uses well-annotated single-cell RNA sequencing (scRNA-seq) data as the reference to assist in the classification of query scRNA-seq data. A key innovation in scMAGIC is the introduction of a second-round RBC in which those query cells whose cell identities are confidently validated in the first round are used as a new reference to again classify query cells, therefore eliminating the batch effects between the reference and the query data. scMAGIC significantly outperforms 13 competing RBC methods with their optimal parameter settings across 86 benchmark tests, especially when the cell types in the query dataset are not completely covered by the reference dataset and when there exist significant batch effects between the reference and the query datasets. Moreover, when no reference dataset is available, scMAGIC can annotate query cells with reasonably high accuracy by using an atlas dataset as the reference. |
format | Online Article Text |
id | pubmed-9071478 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-90714782022-05-06 scMAGIC: accurately annotating single cells using two rounds of reference-based classification Zhang, Yu Zhang, Feng Wang, Zekun Wu, Siyi Tian, Weidong Nucleic Acids Res Methods Online Here, we introduce scMAGIC (Single Cell annotation using MArker Genes Identification and two rounds of reference-based Classification [RBC]), a novel method that uses well-annotated single-cell RNA sequencing (scRNA-seq) data as the reference to assist in the classification of query scRNA-seq data. A key innovation in scMAGIC is the introduction of a second-round RBC in which those query cells whose cell identities are confidently validated in the first round are used as a new reference to again classify query cells, therefore eliminating the batch effects between the reference and the query data. scMAGIC significantly outperforms 13 competing RBC methods with their optimal parameter settings across 86 benchmark tests, especially when the cell types in the query dataset are not completely covered by the reference dataset and when there exist significant batch effects between the reference and the query datasets. Moreover, when no reference dataset is available, scMAGIC can annotate query cells with reasonably high accuracy by using an atlas dataset as the reference. Oxford University Press 2022-01-05 /pmc/articles/PMC9071478/ /pubmed/34986249 http://dx.doi.org/10.1093/nar/gkab1275 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Zhang, Yu Zhang, Feng Wang, Zekun Wu, Siyi Tian, Weidong scMAGIC: accurately annotating single cells using two rounds of reference-based classification |
title | scMAGIC: accurately annotating single cells using two rounds of reference-based classification |
title_full | scMAGIC: accurately annotating single cells using two rounds of reference-based classification |
title_fullStr | scMAGIC: accurately annotating single cells using two rounds of reference-based classification |
title_full_unstemmed | scMAGIC: accurately annotating single cells using two rounds of reference-based classification |
title_short | scMAGIC: accurately annotating single cells using two rounds of reference-based classification |
title_sort | scmagic: accurately annotating single cells using two rounds of reference-based classification |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9071478/ https://www.ncbi.nlm.nih.gov/pubmed/34986249 http://dx.doi.org/10.1093/nar/gkab1275 |
work_keys_str_mv | AT zhangyu scmagicaccuratelyannotatingsinglecellsusingtworoundsofreferencebasedclassification AT zhangfeng scmagicaccuratelyannotatingsinglecellsusingtworoundsofreferencebasedclassification AT wangzekun scmagicaccuratelyannotatingsinglecellsusingtworoundsofreferencebasedclassification AT wusiyi scmagicaccuratelyannotatingsinglecellsusingtworoundsofreferencebasedclassification AT tianweidong scmagicaccuratelyannotatingsinglecellsusingtworoundsofreferencebasedclassification |