Cargando…
Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network
Cross-species comparative analyses of single-cell RNA sequencing (scRNA-seq) data allow us to explore, at single-cell resolution, the origins of the cellular diversity and evolutionary mechanisms that shape cellular form and function. Cell-type assignment is a crucial step to achieve that. However,...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9977153/ https://www.ncbi.nlm.nih.gov/pubmed/36526433 http://dx.doi.org/10.1101/gr.276868.122 |
_version_ | 1784899233084604416 |
---|---|
author | Liu, Xingyan Shen, Qunlun Zhang, Shihua |
author_facet | Liu, Xingyan Shen, Qunlun Zhang, Shihua |
author_sort | Liu, Xingyan |
collection | PubMed |
description | Cross-species comparative analyses of single-cell RNA sequencing (scRNA-seq) data allow us to explore, at single-cell resolution, the origins of the cellular diversity and evolutionary mechanisms that shape cellular form and function. Cell-type assignment is a crucial step to achieve that. However, the poorly annotated genome and limited known biomarkers hinder us from assigning cell identities for nonmodel species. Here, we design a heterogeneous graph neural network model, CAME, to learn aligned and interpretable cell and gene embeddings for cross-species cell-type assignment and gene module extraction from scRNA-seq data. CAME achieves significant improvements in cell-type characterization across distant species owing to the utilization of non-one-to-one homologous gene mapping ignored by early methods. Our large-scale benchmarking study shows that CAME significantly outperforms five classical methods in terms of cell-type assignment and model robustness to insufficiency and inconsistency of sequencing depths. CAME can transfer the major cell types and interneuron subtypes of human brains to mouse and discover shared cell-type-specific functions in homologous gene modules. CAME can align the trajectories of human and macaque spermatogenesis and reveal their conservative expression dynamics. In short, CAME can make accurate cross-species cell-type assignments even for nonmodel species and uncover shared and divergent characteristics between two species from scRNA-seq data. |
format | Online Article Text |
id | pubmed-9977153 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-99771532023-07-01 Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network Liu, Xingyan Shen, Qunlun Zhang, Shihua Genome Res Method Cross-species comparative analyses of single-cell RNA sequencing (scRNA-seq) data allow us to explore, at single-cell resolution, the origins of the cellular diversity and evolutionary mechanisms that shape cellular form and function. Cell-type assignment is a crucial step to achieve that. However, the poorly annotated genome and limited known biomarkers hinder us from assigning cell identities for nonmodel species. Here, we design a heterogeneous graph neural network model, CAME, to learn aligned and interpretable cell and gene embeddings for cross-species cell-type assignment and gene module extraction from scRNA-seq data. CAME achieves significant improvements in cell-type characterization across distant species owing to the utilization of non-one-to-one homologous gene mapping ignored by early methods. Our large-scale benchmarking study shows that CAME significantly outperforms five classical methods in terms of cell-type assignment and model robustness to insufficiency and inconsistency of sequencing depths. CAME can transfer the major cell types and interneuron subtypes of human brains to mouse and discover shared cell-type-specific functions in homologous gene modules. CAME can align the trajectories of human and macaque spermatogenesis and reveal their conservative expression dynamics. In short, CAME can make accurate cross-species cell-type assignments even for nonmodel species and uncover shared and divergent characteristics between two species from scRNA-seq data. Cold Spring Harbor Laboratory Press 2023-01 /pmc/articles/PMC9977153/ /pubmed/36526433 http://dx.doi.org/10.1101/gr.276868.122 Text en © 2023 Liu et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by-nc/4.0/This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) . |
spellingShingle | Method Liu, Xingyan Shen, Qunlun Zhang, Shihua Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network |
title | Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network |
title_full | Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network |
title_fullStr | Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network |
title_full_unstemmed | Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network |
title_short | Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network |
title_sort | cross-species cell-type assignment from single-cell rna-seq data by a heterogeneous graph neural network |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9977153/ https://www.ncbi.nlm.nih.gov/pubmed/36526433 http://dx.doi.org/10.1101/gr.276868.122 |
work_keys_str_mv | AT liuxingyan crossspeciescelltypeassignmentfromsinglecellrnaseqdatabyaheterogeneousgraphneuralnetwork AT shenqunlun crossspeciescelltypeassignmentfromsinglecellrnaseqdatabyaheterogeneousgraphneuralnetwork AT zhangshihua crossspeciescelltypeassignmentfromsinglecellrnaseqdatabyaheterogeneousgraphneuralnetwork |