Cargando…

Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network

Cross-species comparative analyses of single-cell RNA sequencing (scRNA-seq) data allow us to explore, at single-cell resolution, the origins of the cellular diversity and evolutionary mechanisms that shape cellular form and function. Cell-type assignment is a crucial step to achieve that. However,...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Xingyan, Shen, Qunlun, Zhang, Shihua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9977153/
https://www.ncbi.nlm.nih.gov/pubmed/36526433
http://dx.doi.org/10.1101/gr.276868.122
_version_ 1784899233084604416
author Liu, Xingyan
Shen, Qunlun
Zhang, Shihua
author_facet Liu, Xingyan
Shen, Qunlun
Zhang, Shihua
author_sort Liu, Xingyan
collection PubMed
description Cross-species comparative analyses of single-cell RNA sequencing (scRNA-seq) data allow us to explore, at single-cell resolution, the origins of the cellular diversity and evolutionary mechanisms that shape cellular form and function. Cell-type assignment is a crucial step to achieve that. However, the poorly annotated genome and limited known biomarkers hinder us from assigning cell identities for nonmodel species. Here, we design a heterogeneous graph neural network model, CAME, to learn aligned and interpretable cell and gene embeddings for cross-species cell-type assignment and gene module extraction from scRNA-seq data. CAME achieves significant improvements in cell-type characterization across distant species owing to the utilization of non-one-to-one homologous gene mapping ignored by early methods. Our large-scale benchmarking study shows that CAME significantly outperforms five classical methods in terms of cell-type assignment and model robustness to insufficiency and inconsistency of sequencing depths. CAME can transfer the major cell types and interneuron subtypes of human brains to mouse and discover shared cell-type-specific functions in homologous gene modules. CAME can align the trajectories of human and macaque spermatogenesis and reveal their conservative expression dynamics. In short, CAME can make accurate cross-species cell-type assignments even for nonmodel species and uncover shared and divergent characteristics between two species from scRNA-seq data.
format Online
Article
Text
id pubmed-9977153
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-99771532023-07-01 Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network Liu, Xingyan Shen, Qunlun Zhang, Shihua Genome Res Method Cross-species comparative analyses of single-cell RNA sequencing (scRNA-seq) data allow us to explore, at single-cell resolution, the origins of the cellular diversity and evolutionary mechanisms that shape cellular form and function. Cell-type assignment is a crucial step to achieve that. However, the poorly annotated genome and limited known biomarkers hinder us from assigning cell identities for nonmodel species. Here, we design a heterogeneous graph neural network model, CAME, to learn aligned and interpretable cell and gene embeddings for cross-species cell-type assignment and gene module extraction from scRNA-seq data. CAME achieves significant improvements in cell-type characterization across distant species owing to the utilization of non-one-to-one homologous gene mapping ignored by early methods. Our large-scale benchmarking study shows that CAME significantly outperforms five classical methods in terms of cell-type assignment and model robustness to insufficiency and inconsistency of sequencing depths. CAME can transfer the major cell types and interneuron subtypes of human brains to mouse and discover shared cell-type-specific functions in homologous gene modules. CAME can align the trajectories of human and macaque spermatogenesis and reveal their conservative expression dynamics. In short, CAME can make accurate cross-species cell-type assignments even for nonmodel species and uncover shared and divergent characteristics between two species from scRNA-seq data. Cold Spring Harbor Laboratory Press 2023-01 /pmc/articles/PMC9977153/ /pubmed/36526433 http://dx.doi.org/10.1101/gr.276868.122 Text en © 2023 Liu et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Method
Liu, Xingyan
Shen, Qunlun
Zhang, Shihua
Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network
title Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network
title_full Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network
title_fullStr Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network
title_full_unstemmed Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network
title_short Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network
title_sort cross-species cell-type assignment from single-cell rna-seq data by a heterogeneous graph neural network
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9977153/
https://www.ncbi.nlm.nih.gov/pubmed/36526433
http://dx.doi.org/10.1101/gr.276868.122
work_keys_str_mv AT liuxingyan crossspeciescelltypeassignmentfromsinglecellrnaseqdatabyaheterogeneousgraphneuralnetwork
AT shenqunlun crossspeciescelltypeassignmentfromsinglecellrnaseqdatabyaheterogeneousgraphneuralnetwork
AT zhangshihua crossspeciescelltypeassignmentfromsinglecellrnaseqdatabyaheterogeneousgraphneuralnetwork