Cargando…

Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling

How to learn the embedding vectors of nodes in unsupervised large-scale heterogeneous networks is a key problem in heterogeneous network embedding research. This paper proposes an unsupervised embedding learning model, named LHGI (Large-scale Heterogeneous Graph Infomax). LHGI adopts the subgraph sa...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhong, Hongwei, Wang, Mingyang, Zhang, Xinyue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9955212/
https://www.ncbi.nlm.nih.gov/pubmed/36832662
http://dx.doi.org/10.3390/e25020297
_version_ 1784894299354169344
author Zhong, Hongwei
Wang, Mingyang
Zhang, Xinyue
author_facet Zhong, Hongwei
Wang, Mingyang
Zhang, Xinyue
author_sort Zhong, Hongwei
collection PubMed
description How to learn the embedding vectors of nodes in unsupervised large-scale heterogeneous networks is a key problem in heterogeneous network embedding research. This paper proposes an unsupervised embedding learning model, named LHGI (Large-scale Heterogeneous Graph Infomax). LHGI adopts the subgraph sampling technology under the guidance of metapaths, which can compress the network and retain the semantic information in the network as much as possible. At the same time, LHGI adopts the idea of contrastive learning, and takes the mutual information between normal/negative node vectors and the global graph vector as the objective function to guide the learning process. By maximizing the mutual information, LHGI solves the problem of how to train the network without supervised information. The experimental results show that, compared with the baseline models, the LHGI model shows a better feature extraction capability both in medium-scale unsupervised heterogeneous networks and in large-scale unsupervised heterogeneous networks. The node vectors generated by the LHGI model achieve better performance in the downstream mining tasks.
format Online
Article
Text
id pubmed-9955212
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99552122023-02-25 Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling Zhong, Hongwei Wang, Mingyang Zhang, Xinyue Entropy (Basel) Article How to learn the embedding vectors of nodes in unsupervised large-scale heterogeneous networks is a key problem in heterogeneous network embedding research. This paper proposes an unsupervised embedding learning model, named LHGI (Large-scale Heterogeneous Graph Infomax). LHGI adopts the subgraph sampling technology under the guidance of metapaths, which can compress the network and retain the semantic information in the network as much as possible. At the same time, LHGI adopts the idea of contrastive learning, and takes the mutual information between normal/negative node vectors and the global graph vector as the objective function to guide the learning process. By maximizing the mutual information, LHGI solves the problem of how to train the network without supervised information. The experimental results show that, compared with the baseline models, the LHGI model shows a better feature extraction capability both in medium-scale unsupervised heterogeneous networks and in large-scale unsupervised heterogeneous networks. The node vectors generated by the LHGI model achieve better performance in the downstream mining tasks. MDPI 2023-02-04 /pmc/articles/PMC9955212/ /pubmed/36832662 http://dx.doi.org/10.3390/e25020297 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhong, Hongwei
Wang, Mingyang
Zhang, Xinyue
Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling
title Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling
title_full Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling
title_fullStr Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling
title_full_unstemmed Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling
title_short Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling
title_sort unsupervised embedding learning for large-scale heterogeneous networks based on metapath graph sampling
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9955212/
https://www.ncbi.nlm.nih.gov/pubmed/36832662
http://dx.doi.org/10.3390/e25020297
work_keys_str_mv AT zhonghongwei unsupervisedembeddinglearningforlargescaleheterogeneousnetworksbasedonmetapathgraphsampling
AT wangmingyang unsupervisedembeddinglearningforlargescaleheterogeneousnetworksbasedonmetapathgraphsampling
AT zhangxinyue unsupervisedembeddinglearningforlargescaleheterogeneousnetworksbasedonmetapathgraphsampling