Cargando…

A universal framework for single-cell multi-omics data integration with graph convolutional networks

Single-cell omics data are growing at an unprecedented rate, whereas effective integration of them remains challenging due to different sequencing methods, quality, and expression pattern of each omics data. In this study, we propose a universal framework for the integration of single-cell multi-omi...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Hongli, Zhang, Bin, Liu, Long, Li, Shan, Gao, Xin, Yu, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10199767/
https://www.ncbi.nlm.nih.gov/pubmed/36929841
http://dx.doi.org/10.1093/bib/bbad081
_version_ 1785045001037676544
author Gao, Hongli
Zhang, Bin
Liu, Long
Li, Shan
Gao, Xin
Yu, Bin
author_facet Gao, Hongli
Zhang, Bin
Liu, Long
Li, Shan
Gao, Xin
Yu, Bin
author_sort Gao, Hongli
collection PubMed
description Single-cell omics data are growing at an unprecedented rate, whereas effective integration of them remains challenging due to different sequencing methods, quality, and expression pattern of each omics data. In this study, we propose a universal framework for the integration of single-cell multi-omics data based on graph convolutional network (GCN-SC). Among the multiple single-cell data, GCN-SC usually selects one data with the largest number of cells as the reference and the rest as the query dataset. It utilizes mutual nearest neighbor algorithm to identify cell-pairs, which provide connections between cells both within and across the reference and query datasets. A GCN algorithm further takes the mixed graph constructed from these cell-pairs to adjust count matrices from the query datasets. Finally, dimension reduction is performed by using non-negative matrix factorization before visualization. By applying GCN-SC on six datasets, we show that GCN-SC can effectively integrate sequencing data from multiple single-cell sequencing technologies, species or different omics, which outperforms the state-of-the-art methods, including Seurat, LIGER, GLUER and Pamona.
format Online
Article
Text
id pubmed-10199767
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101997672023-05-21 A universal framework for single-cell multi-omics data integration with graph convolutional networks Gao, Hongli Zhang, Bin Liu, Long Li, Shan Gao, Xin Yu, Bin Brief Bioinform Problem Solving Protocol Single-cell omics data are growing at an unprecedented rate, whereas effective integration of them remains challenging due to different sequencing methods, quality, and expression pattern of each omics data. In this study, we propose a universal framework for the integration of single-cell multi-omics data based on graph convolutional network (GCN-SC). Among the multiple single-cell data, GCN-SC usually selects one data with the largest number of cells as the reference and the rest as the query dataset. It utilizes mutual nearest neighbor algorithm to identify cell-pairs, which provide connections between cells both within and across the reference and query datasets. A GCN algorithm further takes the mixed graph constructed from these cell-pairs to adjust count matrices from the query datasets. Finally, dimension reduction is performed by using non-negative matrix factorization before visualization. By applying GCN-SC on six datasets, we show that GCN-SC can effectively integrate sequencing data from multiple single-cell sequencing technologies, species or different omics, which outperforms the state-of-the-art methods, including Seurat, LIGER, GLUER and Pamona. Oxford University Press 2023-03-17 /pmc/articles/PMC10199767/ /pubmed/36929841 http://dx.doi.org/10.1093/bib/bbad081 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Problem Solving Protocol
Gao, Hongli
Zhang, Bin
Liu, Long
Li, Shan
Gao, Xin
Yu, Bin
A universal framework for single-cell multi-omics data integration with graph convolutional networks
title A universal framework for single-cell multi-omics data integration with graph convolutional networks
title_full A universal framework for single-cell multi-omics data integration with graph convolutional networks
title_fullStr A universal framework for single-cell multi-omics data integration with graph convolutional networks
title_full_unstemmed A universal framework for single-cell multi-omics data integration with graph convolutional networks
title_short A universal framework for single-cell multi-omics data integration with graph convolutional networks
title_sort universal framework for single-cell multi-omics data integration with graph convolutional networks
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10199767/
https://www.ncbi.nlm.nih.gov/pubmed/36929841
http://dx.doi.org/10.1093/bib/bbad081
work_keys_str_mv AT gaohongli auniversalframeworkforsinglecellmultiomicsdataintegrationwithgraphconvolutionalnetworks
AT zhangbin auniversalframeworkforsinglecellmultiomicsdataintegrationwithgraphconvolutionalnetworks
AT liulong auniversalframeworkforsinglecellmultiomicsdataintegrationwithgraphconvolutionalnetworks
AT lishan auniversalframeworkforsinglecellmultiomicsdataintegrationwithgraphconvolutionalnetworks
AT gaoxin auniversalframeworkforsinglecellmultiomicsdataintegrationwithgraphconvolutionalnetworks
AT yubin auniversalframeworkforsinglecellmultiomicsdataintegrationwithgraphconvolutionalnetworks
AT gaohongli universalframeworkforsinglecellmultiomicsdataintegrationwithgraphconvolutionalnetworks
AT zhangbin universalframeworkforsinglecellmultiomicsdataintegrationwithgraphconvolutionalnetworks
AT liulong universalframeworkforsinglecellmultiomicsdataintegrationwithgraphconvolutionalnetworks
AT lishan universalframeworkforsinglecellmultiomicsdataintegrationwithgraphconvolutionalnetworks
AT gaoxin universalframeworkforsinglecellmultiomicsdataintegrationwithgraphconvolutionalnetworks
AT yubin universalframeworkforsinglecellmultiomicsdataintegrationwithgraphconvolutionalnetworks