Cargando…
Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN
Analysis of single-cell datasets generated from diverse organisms offers unprecedented opportunities to unravel fundamental evolutionary processes of conservation and diversification of cell types. However, inter-species genomic differences limit the joint analysis of cross-species datasets to homol...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9915700/ https://www.ncbi.nlm.nih.gov/pubmed/36778387 http://dx.doi.org/10.1101/2023.02.03.526939 |
_version_ | 1784885952301236224 |
---|---|
author | Rosen, Yanay Brbić, Maria Roohani, Yusuf Swanson, Kyle Li, Ziang Leskovec, Jure |
author_facet | Rosen, Yanay Brbić, Maria Roohani, Yusuf Swanson, Kyle Li, Ziang Leskovec, Jure |
author_sort | Rosen, Yanay |
collection | PubMed |
description | Analysis of single-cell datasets generated from diverse organisms offers unprecedented opportunities to unravel fundamental evolutionary processes of conservation and diversification of cell types. However, inter-species genomic differences limit the joint analysis of cross-species datasets to homologous genes. Here, we present SATURN, a deep learning method for learning universal cell embeddings that encodes genes’ biological properties using protein language models. By coupling protein embeddings from language models with RNA expression, SATURN integrates datasets profiled from different species regardless of their genomic similarity. SATURN has a unique ability to detect functionally related genes co-expressed across species, redefining differential expression for cross-species analysis. We apply SATURN to three species whole-organism atlases and frog and zebrafish embryogenesis datasets. We show that cell embeddings learnt in SATURN can be effectively used to transfer annotations across species and identify both homologous and species-specific cell types, even across evolutionarily remote species. Finally, we use SATURN to reannotate the five species Cell Atlas of Human Trabecular Meshwork and Aqueous Outflow Structures and find evidence of potentially divergent functions between glaucoma associated genes in humans and other species. |
format | Online Article Text |
id | pubmed-9915700 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-99157002023-02-11 Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN Rosen, Yanay Brbić, Maria Roohani, Yusuf Swanson, Kyle Li, Ziang Leskovec, Jure bioRxiv Article Analysis of single-cell datasets generated from diverse organisms offers unprecedented opportunities to unravel fundamental evolutionary processes of conservation and diversification of cell types. However, inter-species genomic differences limit the joint analysis of cross-species datasets to homologous genes. Here, we present SATURN, a deep learning method for learning universal cell embeddings that encodes genes’ biological properties using protein language models. By coupling protein embeddings from language models with RNA expression, SATURN integrates datasets profiled from different species regardless of their genomic similarity. SATURN has a unique ability to detect functionally related genes co-expressed across species, redefining differential expression for cross-species analysis. We apply SATURN to three species whole-organism atlases and frog and zebrafish embryogenesis datasets. We show that cell embeddings learnt in SATURN can be effectively used to transfer annotations across species and identify both homologous and species-specific cell types, even across evolutionarily remote species. Finally, we use SATURN to reannotate the five species Cell Atlas of Human Trabecular Meshwork and Aqueous Outflow Structures and find evidence of potentially divergent functions between glaucoma associated genes in humans and other species. Cold Spring Harbor Laboratory 2023-09-24 /pmc/articles/PMC9915700/ /pubmed/36778387 http://dx.doi.org/10.1101/2023.02.03.526939 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Rosen, Yanay Brbić, Maria Roohani, Yusuf Swanson, Kyle Li, Ziang Leskovec, Jure Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN |
title | Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN |
title_full | Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN |
title_fullStr | Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN |
title_full_unstemmed | Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN |
title_short | Towards Universal Cell Embeddings: Integrating Single-cell RNA-seq Datasets across Species with SATURN |
title_sort | towards universal cell embeddings: integrating single-cell rna-seq datasets across species with saturn |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9915700/ https://www.ncbi.nlm.nih.gov/pubmed/36778387 http://dx.doi.org/10.1101/2023.02.03.526939 |
work_keys_str_mv | AT rosenyanay towardsuniversalcellembeddingsintegratingsinglecellrnaseqdatasetsacrossspecieswithsaturn AT brbicmaria towardsuniversalcellembeddingsintegratingsinglecellrnaseqdatasetsacrossspecieswithsaturn AT roohaniyusuf towardsuniversalcellembeddingsintegratingsinglecellrnaseqdatasetsacrossspecieswithsaturn AT swansonkyle towardsuniversalcellembeddingsintegratingsinglecellrnaseqdatasetsacrossspecieswithsaturn AT liziang towardsuniversalcellembeddingsintegratingsinglecellrnaseqdatasetsacrossspecieswithsaturn AT leskovecjure towardsuniversalcellembeddingsintegratingsinglecellrnaseqdatasetsacrossspecieswithsaturn |