Cargando…
Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling
How to learn the embedding vectors of nodes in unsupervised large-scale heterogeneous networks is a key problem in heterogeneous network embedding research. This paper proposes an unsupervised embedding learning model, named LHGI (Large-scale Heterogeneous Graph Infomax). LHGI adopts the subgraph sa...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9955212/ https://www.ncbi.nlm.nih.gov/pubmed/36832662 http://dx.doi.org/10.3390/e25020297 |
_version_ | 1784894299354169344 |
---|---|
author | Zhong, Hongwei Wang, Mingyang Zhang, Xinyue |
author_facet | Zhong, Hongwei Wang, Mingyang Zhang, Xinyue |
author_sort | Zhong, Hongwei |
collection | PubMed |
description | How to learn the embedding vectors of nodes in unsupervised large-scale heterogeneous networks is a key problem in heterogeneous network embedding research. This paper proposes an unsupervised embedding learning model, named LHGI (Large-scale Heterogeneous Graph Infomax). LHGI adopts the subgraph sampling technology under the guidance of metapaths, which can compress the network and retain the semantic information in the network as much as possible. At the same time, LHGI adopts the idea of contrastive learning, and takes the mutual information between normal/negative node vectors and the global graph vector as the objective function to guide the learning process. By maximizing the mutual information, LHGI solves the problem of how to train the network without supervised information. The experimental results show that, compared with the baseline models, the LHGI model shows a better feature extraction capability both in medium-scale unsupervised heterogeneous networks and in large-scale unsupervised heterogeneous networks. The node vectors generated by the LHGI model achieve better performance in the downstream mining tasks. |
format | Online Article Text |
id | pubmed-9955212 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-99552122023-02-25 Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling Zhong, Hongwei Wang, Mingyang Zhang, Xinyue Entropy (Basel) Article How to learn the embedding vectors of nodes in unsupervised large-scale heterogeneous networks is a key problem in heterogeneous network embedding research. This paper proposes an unsupervised embedding learning model, named LHGI (Large-scale Heterogeneous Graph Infomax). LHGI adopts the subgraph sampling technology under the guidance of metapaths, which can compress the network and retain the semantic information in the network as much as possible. At the same time, LHGI adopts the idea of contrastive learning, and takes the mutual information between normal/negative node vectors and the global graph vector as the objective function to guide the learning process. By maximizing the mutual information, LHGI solves the problem of how to train the network without supervised information. The experimental results show that, compared with the baseline models, the LHGI model shows a better feature extraction capability both in medium-scale unsupervised heterogeneous networks and in large-scale unsupervised heterogeneous networks. The node vectors generated by the LHGI model achieve better performance in the downstream mining tasks. MDPI 2023-02-04 /pmc/articles/PMC9955212/ /pubmed/36832662 http://dx.doi.org/10.3390/e25020297 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Zhong, Hongwei Wang, Mingyang Zhang, Xinyue Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling |
title | Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling |
title_full | Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling |
title_fullStr | Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling |
title_full_unstemmed | Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling |
title_short | Unsupervised Embedding Learning for Large-Scale Heterogeneous Networks Based on Metapath Graph Sampling |
title_sort | unsupervised embedding learning for large-scale heterogeneous networks based on metapath graph sampling |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9955212/ https://www.ncbi.nlm.nih.gov/pubmed/36832662 http://dx.doi.org/10.3390/e25020297 |
work_keys_str_mv | AT zhonghongwei unsupervisedembeddinglearningforlargescaleheterogeneousnetworksbasedonmetapathgraphsampling AT wangmingyang unsupervisedembeddinglearningforlargescaleheterogeneousnetworksbasedonmetapathgraphsampling AT zhangxinyue unsupervisedembeddinglearningforlargescaleheterogeneousnetworksbasedonmetapathgraphsampling |