Cargando…

EasyCellType: marker-based cell-type annotation by automatically querying multiple databases

MOTIVATION: Cell label annotation is a challenging step in the analysis of single-cell RNA sequencing (scRNA-seq) data, especially for tissue types that are less commonly studied. The accumulation of scRNA-seq studies and biological knowledge leads to several well-maintained cell marker databases. M...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Ruoxing, Zhang, Jianjun, Li, Ziyi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10049754/
https://www.ncbi.nlm.nih.gov/pubmed/36998720
http://dx.doi.org/10.1093/bioadv/vbad029
_version_ 1785014532862640128
author Li, Ruoxing
Zhang, Jianjun
Li, Ziyi
author_facet Li, Ruoxing
Zhang, Jianjun
Li, Ziyi
author_sort Li, Ruoxing
collection PubMed
description MOTIVATION: Cell label annotation is a challenging step in the analysis of single-cell RNA sequencing (scRNA-seq) data, especially for tissue types that are less commonly studied. The accumulation of scRNA-seq studies and biological knowledge leads to several well-maintained cell marker databases. Manually examining the cell marker lists against these databases can be difficult due to the large amount of available information. Additionally, simply overlapping the two lists without considering gene ranking might lead to unreliable results. Thus, an automated method with careful statistical testing is needed to facilitate the usage of these databases. RESULTS: We develop a user-friendly computational tool, EasyCellType, which automatically checks an input marker list obtained by differential expression analysis against the databases and provides annotation recommendations in graphical outcomes. The package provides two statistical tests, gene set enrichment analysis and a modified version of Fisher’s exact test, as well as customized database and tissue type choices. We also provide an interactive shiny application to annotate cells in a user-friendly graphical user interface. The simulation study and real-data applications demonstrate favorable results by the proposed method. AVAILABILITY AND IMPLEMENTATION: https://biostatistics.mdanderson.org/shinyapps/EasyCellType/; https://bioconductor.org/packages/devel/bioc/html/EasyCellType.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online.
format Online
Article
Text
id pubmed-10049754
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-100497542023-03-29 EasyCellType: marker-based cell-type annotation by automatically querying multiple databases Li, Ruoxing Zhang, Jianjun Li, Ziyi Bioinform Adv Original Paper MOTIVATION: Cell label annotation is a challenging step in the analysis of single-cell RNA sequencing (scRNA-seq) data, especially for tissue types that are less commonly studied. The accumulation of scRNA-seq studies and biological knowledge leads to several well-maintained cell marker databases. Manually examining the cell marker lists against these databases can be difficult due to the large amount of available information. Additionally, simply overlapping the two lists without considering gene ranking might lead to unreliable results. Thus, an automated method with careful statistical testing is needed to facilitate the usage of these databases. RESULTS: We develop a user-friendly computational tool, EasyCellType, which automatically checks an input marker list obtained by differential expression analysis against the databases and provides annotation recommendations in graphical outcomes. The package provides two statistical tests, gene set enrichment analysis and a modified version of Fisher’s exact test, as well as customized database and tissue type choices. We also provide an interactive shiny application to annotate cells in a user-friendly graphical user interface. The simulation study and real-data applications demonstrate favorable results by the proposed method. AVAILABILITY AND IMPLEMENTATION: https://biostatistics.mdanderson.org/shinyapps/EasyCellType/; https://bioconductor.org/packages/devel/bioc/html/EasyCellType.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics Advances online. Oxford University Press 2023-03-24 /pmc/articles/PMC10049754/ /pubmed/36998720 http://dx.doi.org/10.1093/bioadv/vbad029 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Li, Ruoxing
Zhang, Jianjun
Li, Ziyi
EasyCellType: marker-based cell-type annotation by automatically querying multiple databases
title EasyCellType: marker-based cell-type annotation by automatically querying multiple databases
title_full EasyCellType: marker-based cell-type annotation by automatically querying multiple databases
title_fullStr EasyCellType: marker-based cell-type annotation by automatically querying multiple databases
title_full_unstemmed EasyCellType: marker-based cell-type annotation by automatically querying multiple databases
title_short EasyCellType: marker-based cell-type annotation by automatically querying multiple databases
title_sort easycelltype: marker-based cell-type annotation by automatically querying multiple databases
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10049754/
https://www.ncbi.nlm.nih.gov/pubmed/36998720
http://dx.doi.org/10.1093/bioadv/vbad029
work_keys_str_mv AT liruoxing easycelltypemarkerbasedcelltypeannotationbyautomaticallyqueryingmultipledatabases
AT zhangjianjun easycelltypemarkerbasedcelltypeannotationbyautomaticallyqueryingmultipledatabases
AT liziyi easycelltypemarkerbasedcelltypeannotationbyautomaticallyqueryingmultipledatabases