Cargando…

MoGCN: A Multi-Omics Integration Method Based on Graph Convolutional Network for Cancer Subtype Analysis

In light of the rapid accumulation of large-scale omics datasets, numerous studies have attempted to characterize the molecular and clinical features of cancers from a multi-omics perspective. However, there are great challenges in integrating multi-omics using machine learning methods for cancer su...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Xiao, Ma, Jie, Leng, Ling, Han, Mingfei, Li, Mansheng, He, Fuchu, Zhu, Yunping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8847688/
https://www.ncbi.nlm.nih.gov/pubmed/35186034
http://dx.doi.org/10.3389/fgene.2022.806842
_version_ 1784652098115207168
author Li, Xiao
Ma, Jie
Leng, Ling
Han, Mingfei
Li, Mansheng
He, Fuchu
Zhu, Yunping
author_facet Li, Xiao
Ma, Jie
Leng, Ling
Han, Mingfei
Li, Mansheng
He, Fuchu
Zhu, Yunping
author_sort Li, Xiao
collection PubMed
description In light of the rapid accumulation of large-scale omics datasets, numerous studies have attempted to characterize the molecular and clinical features of cancers from a multi-omics perspective. However, there are great challenges in integrating multi-omics using machine learning methods for cancer subtype classification. In this study, MoGCN, a multi-omics integration model based on graph convolutional network (GCN) was developed for cancer subtype classification and analysis. Genomics, transcriptomics and proteomics datasets for 511 breast invasive carcinoma (BRCA) samples were downloaded from the Cancer Genome Atlas (TCGA). The autoencoder (AE) and the similarity network fusion (SNF) methods were used to reduce dimensionality and construct the patient similarity network (PSN), respectively. Then the vector features and the PSN were input into the GCN for training and testing. Feature extraction and network visualization were used for further biological knowledge discovery and subtype classification. In the analysis of multi-dimensional omics data of the BRCA samples in TCGA, MoGCN achieved the highest accuracy in cancer subtype classification compared with several popular algorithms. Moreover, MoGCN can extract the most significant features of each omics layer and provide candidate functional molecules for further analysis of their biological effects. And network visualization showed that MoGCN could make clinically intuitive diagnosis. The generality of MoGCN was proven on the TCGA pan-kidney cancer datasets. MoGCN and datasets are public available at https://github.com/Lifoof/MoGCN. Our study shows that MoGCN performs well for heterogeneous data integration and the interpretability of classification results, which confers great potential for applications in biomarker identification and clinical diagnosis.
format Online
Article
Text
id pubmed-8847688
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-88476882022-02-17 MoGCN: A Multi-Omics Integration Method Based on Graph Convolutional Network for Cancer Subtype Analysis Li, Xiao Ma, Jie Leng, Ling Han, Mingfei Li, Mansheng He, Fuchu Zhu, Yunping Front Genet Genetics In light of the rapid accumulation of large-scale omics datasets, numerous studies have attempted to characterize the molecular and clinical features of cancers from a multi-omics perspective. However, there are great challenges in integrating multi-omics using machine learning methods for cancer subtype classification. In this study, MoGCN, a multi-omics integration model based on graph convolutional network (GCN) was developed for cancer subtype classification and analysis. Genomics, transcriptomics and proteomics datasets for 511 breast invasive carcinoma (BRCA) samples were downloaded from the Cancer Genome Atlas (TCGA). The autoencoder (AE) and the similarity network fusion (SNF) methods were used to reduce dimensionality and construct the patient similarity network (PSN), respectively. Then the vector features and the PSN were input into the GCN for training and testing. Feature extraction and network visualization were used for further biological knowledge discovery and subtype classification. In the analysis of multi-dimensional omics data of the BRCA samples in TCGA, MoGCN achieved the highest accuracy in cancer subtype classification compared with several popular algorithms. Moreover, MoGCN can extract the most significant features of each omics layer and provide candidate functional molecules for further analysis of their biological effects. And network visualization showed that MoGCN could make clinically intuitive diagnosis. The generality of MoGCN was proven on the TCGA pan-kidney cancer datasets. MoGCN and datasets are public available at https://github.com/Lifoof/MoGCN. Our study shows that MoGCN performs well for heterogeneous data integration and the interpretability of classification results, which confers great potential for applications in biomarker identification and clinical diagnosis. Frontiers Media S.A. 2022-02-02 /pmc/articles/PMC8847688/ /pubmed/35186034 http://dx.doi.org/10.3389/fgene.2022.806842 Text en Copyright © 2022 Li, Ma, Leng, Han, Li, He and Zhu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Li, Xiao
Ma, Jie
Leng, Ling
Han, Mingfei
Li, Mansheng
He, Fuchu
Zhu, Yunping
MoGCN: A Multi-Omics Integration Method Based on Graph Convolutional Network for Cancer Subtype Analysis
title MoGCN: A Multi-Omics Integration Method Based on Graph Convolutional Network for Cancer Subtype Analysis
title_full MoGCN: A Multi-Omics Integration Method Based on Graph Convolutional Network for Cancer Subtype Analysis
title_fullStr MoGCN: A Multi-Omics Integration Method Based on Graph Convolutional Network for Cancer Subtype Analysis
title_full_unstemmed MoGCN: A Multi-Omics Integration Method Based on Graph Convolutional Network for Cancer Subtype Analysis
title_short MoGCN: A Multi-Omics Integration Method Based on Graph Convolutional Network for Cancer Subtype Analysis
title_sort mogcn: a multi-omics integration method based on graph convolutional network for cancer subtype analysis
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8847688/
https://www.ncbi.nlm.nih.gov/pubmed/35186034
http://dx.doi.org/10.3389/fgene.2022.806842
work_keys_str_mv AT lixiao mogcnamultiomicsintegrationmethodbasedongraphconvolutionalnetworkforcancersubtypeanalysis
AT majie mogcnamultiomicsintegrationmethodbasedongraphconvolutionalnetworkforcancersubtypeanalysis
AT lengling mogcnamultiomicsintegrationmethodbasedongraphconvolutionalnetworkforcancersubtypeanalysis
AT hanmingfei mogcnamultiomicsintegrationmethodbasedongraphconvolutionalnetworkforcancersubtypeanalysis
AT limansheng mogcnamultiomicsintegrationmethodbasedongraphconvolutionalnetworkforcancersubtypeanalysis
AT hefuchu mogcnamultiomicsintegrationmethodbasedongraphconvolutionalnetworkforcancersubtypeanalysis
AT zhuyunping mogcnamultiomicsintegrationmethodbasedongraphconvolutionalnetworkforcancersubtypeanalysis