Cargando…

scTransSort: Transformers for Intelligent Annotation of Cell Types by Gene Embeddings

Single-cell transcriptomics is rapidly advancing our understanding of the composition of complex tissues and biological cells, and single-cell RNA sequencing (scRNA-seq) holds great potential for identifying and characterizing the cell composition of complex tissues. Cell type identification by anal...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiao, Linfang, Wang, Gan, Dai, Huanhuan, Li, Xue, Wang, Shuang, Song, Tao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10136153/
https://www.ncbi.nlm.nih.gov/pubmed/37189359
http://dx.doi.org/10.3390/biom13040611
_version_ 1785032148068073472
author Jiao, Linfang
Wang, Gan
Dai, Huanhuan
Li, Xue
Wang, Shuang
Song, Tao
author_facet Jiao, Linfang
Wang, Gan
Dai, Huanhuan
Li, Xue
Wang, Shuang
Song, Tao
author_sort Jiao, Linfang
collection PubMed
description Single-cell transcriptomics is rapidly advancing our understanding of the composition of complex tissues and biological cells, and single-cell RNA sequencing (scRNA-seq) holds great potential for identifying and characterizing the cell composition of complex tissues. Cell type identification by analyzing scRNA-seq data is mostly limited by time-consuming and irreproducible manual annotation. As scRNA-seq technology scales to thousands of cells per experiment, the exponential increase in the number of cell samples makes manual annotation more difficult. On the other hand, the sparsity of gene transcriptome data remains a major challenge. This paper applied the idea of the transformer to single-cell classification tasks based on scRNA-seq data. We propose scTransSort, a cell-type annotation method pretrained with single-cell transcriptomics data. The scTransSort incorporates a method of representing genes as gene expression embedding blocks to reduce the sparsity of data used for cell type identification and reduce the computational complexity. The feature of scTransSort is that its implementation of intelligent information extraction for unordered data, automatically extracting valid features of cell types without the need for manually labeled features and additional references. In experiments on cells from 35 human and 26 mouse tissues, scTransSort successfully elucidated its high accuracy and high performance for cell type identification, and demonstrated its own high robustness and generalization ability.
format Online
Article
Text
id pubmed-10136153
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-101361532023-04-28 scTransSort: Transformers for Intelligent Annotation of Cell Types by Gene Embeddings Jiao, Linfang Wang, Gan Dai, Huanhuan Li, Xue Wang, Shuang Song, Tao Biomolecules Article Single-cell transcriptomics is rapidly advancing our understanding of the composition of complex tissues and biological cells, and single-cell RNA sequencing (scRNA-seq) holds great potential for identifying and characterizing the cell composition of complex tissues. Cell type identification by analyzing scRNA-seq data is mostly limited by time-consuming and irreproducible manual annotation. As scRNA-seq technology scales to thousands of cells per experiment, the exponential increase in the number of cell samples makes manual annotation more difficult. On the other hand, the sparsity of gene transcriptome data remains a major challenge. This paper applied the idea of the transformer to single-cell classification tasks based on scRNA-seq data. We propose scTransSort, a cell-type annotation method pretrained with single-cell transcriptomics data. The scTransSort incorporates a method of representing genes as gene expression embedding blocks to reduce the sparsity of data used for cell type identification and reduce the computational complexity. The feature of scTransSort is that its implementation of intelligent information extraction for unordered data, automatically extracting valid features of cell types without the need for manually labeled features and additional references. In experiments on cells from 35 human and 26 mouse tissues, scTransSort successfully elucidated its high accuracy and high performance for cell type identification, and demonstrated its own high robustness and generalization ability. MDPI 2023-03-28 /pmc/articles/PMC10136153/ /pubmed/37189359 http://dx.doi.org/10.3390/biom13040611 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Jiao, Linfang
Wang, Gan
Dai, Huanhuan
Li, Xue
Wang, Shuang
Song, Tao
scTransSort: Transformers for Intelligent Annotation of Cell Types by Gene Embeddings
title scTransSort: Transformers for Intelligent Annotation of Cell Types by Gene Embeddings
title_full scTransSort: Transformers for Intelligent Annotation of Cell Types by Gene Embeddings
title_fullStr scTransSort: Transformers for Intelligent Annotation of Cell Types by Gene Embeddings
title_full_unstemmed scTransSort: Transformers for Intelligent Annotation of Cell Types by Gene Embeddings
title_short scTransSort: Transformers for Intelligent Annotation of Cell Types by Gene Embeddings
title_sort sctranssort: transformers for intelligent annotation of cell types by gene embeddings
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10136153/
https://www.ncbi.nlm.nih.gov/pubmed/37189359
http://dx.doi.org/10.3390/biom13040611
work_keys_str_mv AT jiaolinfang sctranssorttransformersforintelligentannotationofcelltypesbygeneembeddings
AT wanggan sctranssorttransformersforintelligentannotationofcelltypesbygeneembeddings
AT daihuanhuan sctranssorttransformersforintelligentannotationofcelltypesbygeneembeddings
AT lixue sctranssorttransformersforintelligentannotationofcelltypesbygeneembeddings
AT wangshuang sctranssorttransformersforintelligentannotationofcelltypesbygeneembeddings
AT songtao sctranssorttransformersforintelligentannotationofcelltypesbygeneembeddings