Cargando…

gCAnno: a graph-based single cell type annotation method

BACKGROUND: Current single cell analysis methods annotate cell types at cluster-level rather than ideally at single cell level. Multiple exchangeable clustering methods and many tunable parameters have a substantial impact on the clustering outcome, often leading to incorrect cluster-level annotatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Xiaofei, Gao, Shenghan, Wang, Tingjie, Yang, Boyu, Dang, Ningxin, Ye, Kai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7686723/
https://www.ncbi.nlm.nih.gov/pubmed/33228535
http://dx.doi.org/10.1186/s12864-020-07223-4
_version_ 1783613388583075840
author Yang, Xiaofei
Gao, Shenghan
Wang, Tingjie
Yang, Boyu
Dang, Ningxin
Ye, Kai
author_facet Yang, Xiaofei
Gao, Shenghan
Wang, Tingjie
Yang, Boyu
Dang, Ningxin
Ye, Kai
author_sort Yang, Xiaofei
collection PubMed
description BACKGROUND: Current single cell analysis methods annotate cell types at cluster-level rather than ideally at single cell level. Multiple exchangeable clustering methods and many tunable parameters have a substantial impact on the clustering outcome, often leading to incorrect cluster-level annotation or multiple runs of subsequent clustering steps. To address these limitations, methods based on well-annotated reference atlas has been proposed. However, these methods are currently not robust enough to handle datasets with different noise levels or from different platforms. RESULTS: Here, we present gCAnno, a graph-based Cell type Annotation method. First, gCAnno constructs cell type-gene bipartite graph and adopts graph embedding to obtain cell type specific genes. Then, naïve Bayes (gCAnno-Bayes) and SVM (gCAnno-SVM) classifiers are built for annotation. We compared the performance of gCAnno to other state-of-art methods on multiple single cell datasets, either with various noise levels or from different platforms. The results showed that gCAnno outperforms other state-of-art methods with higher accuracy and robustness. CONCLUSIONS: gCAnno is a robust and accurate cell type annotation tool for single cell RNA analysis. The source code of gCAnno is publicly available at https://github.com/xjtu-omics/gCAnno. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-020-07223-4.
format Online
Article
Text
id pubmed-7686723
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-76867232020-11-25 gCAnno: a graph-based single cell type annotation method Yang, Xiaofei Gao, Shenghan Wang, Tingjie Yang, Boyu Dang, Ningxin Ye, Kai BMC Genomics Methodology Article BACKGROUND: Current single cell analysis methods annotate cell types at cluster-level rather than ideally at single cell level. Multiple exchangeable clustering methods and many tunable parameters have a substantial impact on the clustering outcome, often leading to incorrect cluster-level annotation or multiple runs of subsequent clustering steps. To address these limitations, methods based on well-annotated reference atlas has been proposed. However, these methods are currently not robust enough to handle datasets with different noise levels or from different platforms. RESULTS: Here, we present gCAnno, a graph-based Cell type Annotation method. First, gCAnno constructs cell type-gene bipartite graph and adopts graph embedding to obtain cell type specific genes. Then, naïve Bayes (gCAnno-Bayes) and SVM (gCAnno-SVM) classifiers are built for annotation. We compared the performance of gCAnno to other state-of-art methods on multiple single cell datasets, either with various noise levels or from different platforms. The results showed that gCAnno outperforms other state-of-art methods with higher accuracy and robustness. CONCLUSIONS: gCAnno is a robust and accurate cell type annotation tool for single cell RNA analysis. The source code of gCAnno is publicly available at https://github.com/xjtu-omics/gCAnno. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-020-07223-4. BioMed Central 2020-11-23 /pmc/articles/PMC7686723/ /pubmed/33228535 http://dx.doi.org/10.1186/s12864-020-07223-4 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Yang, Xiaofei
Gao, Shenghan
Wang, Tingjie
Yang, Boyu
Dang, Ningxin
Ye, Kai
gCAnno: a graph-based single cell type annotation method
title gCAnno: a graph-based single cell type annotation method
title_full gCAnno: a graph-based single cell type annotation method
title_fullStr gCAnno: a graph-based single cell type annotation method
title_full_unstemmed gCAnno: a graph-based single cell type annotation method
title_short gCAnno: a graph-based single cell type annotation method
title_sort gcanno: a graph-based single cell type annotation method
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7686723/
https://www.ncbi.nlm.nih.gov/pubmed/33228535
http://dx.doi.org/10.1186/s12864-020-07223-4
work_keys_str_mv AT yangxiaofei gcannoagraphbasedsinglecelltypeannotationmethod
AT gaoshenghan gcannoagraphbasedsinglecelltypeannotationmethod
AT wangtingjie gcannoagraphbasedsinglecelltypeannotationmethod
AT yangboyu gcannoagraphbasedsinglecelltypeannotationmethod
AT dangningxin gcannoagraphbasedsinglecelltypeannotationmethod
AT yekai gcannoagraphbasedsinglecelltypeannotationmethod