Cargando…

Multi-omics single-cell data integration and regulatory inference with graph-linked embedding

Despite the emergence of experimental methods for simultaneous measurement of multiple omics modalities in single cells, most single-cell datasets include only one modality. A major obstacle in integrating omics data from multiple modalities is that different omics layers typically have distinct fea...

Descripción completa

Detalles Bibliográficos
Autores principales: Cao, Zhi-Jie, Gao, Ge
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9546775/
https://www.ncbi.nlm.nih.gov/pubmed/35501393
http://dx.doi.org/10.1038/s41587-022-01284-4
_version_ 1784805118753898496
author Cao, Zhi-Jie
Gao, Ge
author_facet Cao, Zhi-Jie
Gao, Ge
author_sort Cao, Zhi-Jie
collection PubMed
description Despite the emergence of experimental methods for simultaneous measurement of multiple omics modalities in single cells, most single-cell datasets include only one modality. A major obstacle in integrating omics data from multiple modalities is that different omics layers typically have distinct feature spaces. Here, we propose a computational framework called GLUE (graph-linked unified embedding), which bridges the gap by modeling regulatory interactions across omics layers explicitly. Systematic benchmarking demonstrated that GLUE is more accurate, robust and scalable than state-of-the-art tools for heterogeneous single-cell multi-omics data. We applied GLUE to various challenging tasks, including triple-omics integration, integrative regulatory inference and multi-omics human cell atlas construction over millions of cells, where GLUE was able to correct previous annotations. GLUE features a modular design that can be flexibly extended and enhanced for new analysis tasks. The full package is available online at https://github.com/gao-lab/GLUE.
format Online
Article
Text
id pubmed-9546775
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-95467752022-10-09 Multi-omics single-cell data integration and regulatory inference with graph-linked embedding Cao, Zhi-Jie Gao, Ge Nat Biotechnol Article Despite the emergence of experimental methods for simultaneous measurement of multiple omics modalities in single cells, most single-cell datasets include only one modality. A major obstacle in integrating omics data from multiple modalities is that different omics layers typically have distinct feature spaces. Here, we propose a computational framework called GLUE (graph-linked unified embedding), which bridges the gap by modeling regulatory interactions across omics layers explicitly. Systematic benchmarking demonstrated that GLUE is more accurate, robust and scalable than state-of-the-art tools for heterogeneous single-cell multi-omics data. We applied GLUE to various challenging tasks, including triple-omics integration, integrative regulatory inference and multi-omics human cell atlas construction over millions of cells, where GLUE was able to correct previous annotations. GLUE features a modular design that can be flexibly extended and enhanced for new analysis tasks. The full package is available online at https://github.com/gao-lab/GLUE. Nature Publishing Group US 2022-05-02 2022 /pmc/articles/PMC9546775/ /pubmed/35501393 http://dx.doi.org/10.1038/s41587-022-01284-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Cao, Zhi-Jie
Gao, Ge
Multi-omics single-cell data integration and regulatory inference with graph-linked embedding
title Multi-omics single-cell data integration and regulatory inference with graph-linked embedding
title_full Multi-omics single-cell data integration and regulatory inference with graph-linked embedding
title_fullStr Multi-omics single-cell data integration and regulatory inference with graph-linked embedding
title_full_unstemmed Multi-omics single-cell data integration and regulatory inference with graph-linked embedding
title_short Multi-omics single-cell data integration and regulatory inference with graph-linked embedding
title_sort multi-omics single-cell data integration and regulatory inference with graph-linked embedding
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9546775/
https://www.ncbi.nlm.nih.gov/pubmed/35501393
http://dx.doi.org/10.1038/s41587-022-01284-4
work_keys_str_mv AT caozhijie multiomicssinglecelldataintegrationandregulatoryinferencewithgraphlinkedembedding
AT gaoge multiomicssinglecelldataintegrationandregulatoryinferencewithgraphlinkedembedding