Cargando…

Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN

Analysis of single-cell datasets generated from diverse organisms offers unprecedented opportunities to unravel fundamental evolutionary processes of conservation and diversification of cell types. However, inter-species genomic differences limit the joint analysis of cross-species datasets to homol...

Descripción completa

Detalles Bibliográficos
Autores principales: Rosen, Yanay, Brbić, Maria, Roohani, Yusuf, Swanson, Kyle, Li, Ziang, Leskovec, Jure
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9915700/
https://www.ncbi.nlm.nih.gov/pubmed/36778387
http://dx.doi.org/10.1101/2023.02.03.526939
_version_ 1784885952301236224
author Rosen, Yanay
Brbić, Maria
Roohani, Yusuf
Swanson, Kyle
Li, Ziang
Leskovec, Jure
author_facet Rosen, Yanay
Brbić, Maria
Roohani, Yusuf
Swanson, Kyle
Li, Ziang
Leskovec, Jure
author_sort Rosen, Yanay
collection PubMed
description Analysis of single-cell datasets generated from diverse organisms offers unprecedented opportunities to unravel fundamental evolutionary processes of conservation and diversification of cell types. However, inter-species genomic differences limit the joint analysis of cross-species datasets to homologous genes. Here, we present SATURN, a deep learning method for learning universal cell embeddings that encodes genes’ biological properties using protein language models. By coupling protein embeddings from language models with RNA expression, SATURN integrates datasets profiled from different species regardless of their genomic similarity. SATURN has a unique ability to detect functionally related genes co-expressed across species, redefining differential expression for cross-species analysis. We apply SATURN to three species whole-organism atlases and frog and zebrafish embryogenesis datasets. We show that cell embeddings learnt in SATURN can be effectively used to transfer annotations across species and identify both homologous and species-specific cell types, even across evolutionarily remote species. Finally, we use SATURN to reannotate the five species Cell Atlas of Human Trabecular Meshwork and Aqueous Outflow Structures and find evidence of potentially divergent functions between glaucoma associated genes in humans and other species.
format Online
Article
Text
id pubmed-9915700
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-99157002023-02-11 Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN Rosen, Yanay Brbić, Maria Roohani, Yusuf Swanson, Kyle Li, Ziang Leskovec, Jure bioRxiv Article Analysis of single-cell datasets generated from diverse organisms offers unprecedented opportunities to unravel fundamental evolutionary processes of conservation and diversification of cell types. However, inter-species genomic differences limit the joint analysis of cross-species datasets to homologous genes. Here, we present SATURN, a deep learning method for learning universal cell embeddings that encodes genes’ biological properties using protein language models. By coupling protein embeddings from language models with RNA expression, SATURN integrates datasets profiled from different species regardless of their genomic similarity. SATURN has a unique ability to detect functionally related genes co-expressed across species, redefining differential expression for cross-species analysis. We apply SATURN to three species whole-organism atlases and frog and zebrafish embryogenesis datasets. We show that cell embeddings learnt in SATURN can be effectively used to transfer annotations across species and identify both homologous and species-specific cell types, even across evolutionarily remote species. Finally, we use SATURN to reannotate the five species Cell Atlas of Human Trabecular Meshwork and Aqueous Outflow Structures and find evidence of potentially divergent functions between glaucoma associated genes in humans and other species. Cold Spring Harbor Laboratory 2023-09-24 /pmc/articles/PMC9915700/ /pubmed/36778387 http://dx.doi.org/10.1101/2023.02.03.526939 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Rosen, Yanay
Brbić, Maria
Roohani, Yusuf
Swanson, Kyle
Li, Ziang
Leskovec, Jure
Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN
title Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN
title_full Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN
title_fullStr Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN
title_full_unstemmed Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN
title_short Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN
title_sort towards universal cell embeddings: integrating single-cell rna-seq datasets across species with saturn
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9915700/
https://www.ncbi.nlm.nih.gov/pubmed/36778387
http://dx.doi.org/10.1101/2023.02.03.526939
work_keys_str_mv AT rosenyanay towardsuniversalcellembeddingsintegratingsinglecellrnaseqdatasetsacrossspecieswithsaturn
AT brbicmaria towardsuniversalcellembeddingsintegratingsinglecellrnaseqdatasetsacrossspecieswithsaturn
AT roohaniyusuf towardsuniversalcellembeddingsintegratingsinglecellrnaseqdatasetsacrossspecieswithsaturn
AT swansonkyle towardsuniversalcellembeddingsintegratingsinglecellrnaseqdatasetsacrossspecieswithsaturn
AT liziang towardsuniversalcellembeddingsintegratingsinglecellrnaseqdatasetsacrossspecieswithsaturn
AT leskovecjure towardsuniversalcellembeddingsintegratingsinglecellrnaseqdatasetsacrossspecieswithsaturn