Cargando…

Survey on graph embeddings and their applications to machine learning problems on graphs

Dealing with relational data always required significant computational resources, domain expertise and task-dependent feature engineering to incorporate structural information into a predictive model. Nowadays, a family of automated graph feature engineering techniques has been proposed in different...

Descripción completa

Detalles Bibliográficos
Autores principales:	Makarov, Ilya, Kiselev, Dmitrii, Nikitinsky, Nikita, Subelj, Lovro
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	PeerJ Inc. 2021
Materias:	Artificial Intelligence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959646/ https://www.ncbi.nlm.nih.gov/pubmed/33817007 http://dx.doi.org/10.7717/peerj-cs.357

_version_	1783664994971287552
author	Makarov, Ilya Kiselev, Dmitrii Nikitinsky, Nikita Subelj, Lovro
author_facet	Makarov, Ilya Kiselev, Dmitrii Nikitinsky, Nikita Subelj, Lovro
author_sort	Makarov, Ilya
collection	PubMed
description	Dealing with relational data always required significant computational resources, domain expertise and task-dependent feature engineering to incorporate structural information into a predictive model. Nowadays, a family of automated graph feature engineering techniques has been proposed in different streams of literature. So-called graph embeddings provide a powerful tool to construct vectorized feature spaces for graphs and their components, such as nodes, edges and subgraphs under preserving inner graph properties. Using the constructed feature spaces, many machine learning problems on graphs can be solved via standard frameworks suitable for vectorized feature representation. Our survey aims to describe the core concepts of graph embeddings and provide several taxonomies for their description. First, we start with the methodological approach and extract three types of graph embedding models based on matrix factorization, random-walks and deep learning approaches. Next, we describe how different types of networks impact the ability of models to incorporate structural and attributed data into a unified embedding. Going further, we perform a thorough evaluation of graph embedding applications to machine learning problems on graphs, among which are node classification, link prediction, clustering, visualization, compression, and a family of the whole graph embedding algorithms suitable for graph classification, similarity and alignment problems. Finally, we overview the existing applications of graph embeddings to computer science domains, formulate open problems and provide experiment results, explaining how different networks properties result in graph embeddings quality in the four classic machine learning problems on graphs, such as node classification, link prediction, clustering and graph visualization. As a result, our survey covers a new rapidly growing field of network feature engineering, presents an in-depth analysis of models based on network types, and overviews a wide range of applications to machine learning problems on graphs.
format	Online Article Text
id	pubmed-7959646
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	PeerJ Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-79596462021-04-02 Survey on graph embeddings and their applications to machine learning problems on graphs Makarov, Ilya Kiselev, Dmitrii Nikitinsky, Nikita Subelj, Lovro PeerJ Comput Sci Artificial Intelligence Dealing with relational data always required significant computational resources, domain expertise and task-dependent feature engineering to incorporate structural information into a predictive model. Nowadays, a family of automated graph feature engineering techniques has been proposed in different streams of literature. So-called graph embeddings provide a powerful tool to construct vectorized feature spaces for graphs and their components, such as nodes, edges and subgraphs under preserving inner graph properties. Using the constructed feature spaces, many machine learning problems on graphs can be solved via standard frameworks suitable for vectorized feature representation. Our survey aims to describe the core concepts of graph embeddings and provide several taxonomies for their description. First, we start with the methodological approach and extract three types of graph embedding models based on matrix factorization, random-walks and deep learning approaches. Next, we describe how different types of networks impact the ability of models to incorporate structural and attributed data into a unified embedding. Going further, we perform a thorough evaluation of graph embedding applications to machine learning problems on graphs, among which are node classification, link prediction, clustering, visualization, compression, and a family of the whole graph embedding algorithms suitable for graph classification, similarity and alignment problems. Finally, we overview the existing applications of graph embeddings to computer science domains, formulate open problems and provide experiment results, explaining how different networks properties result in graph embeddings quality in the four classic machine learning problems on graphs, such as node classification, link prediction, clustering and graph visualization. As a result, our survey covers a new rapidly growing field of network feature engineering, presents an in-depth analysis of models based on network types, and overviews a wide range of applications to machine learning problems on graphs. PeerJ Inc. 2021-02-04 /pmc/articles/PMC7959646/ /pubmed/33817007 http://dx.doi.org/10.7717/peerj-cs.357 Text en © 2021 Makarov et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle	Artificial Intelligence Makarov, Ilya Kiselev, Dmitrii Nikitinsky, Nikita Subelj, Lovro Survey on graph embeddings and their applications to machine learning problems on graphs
title	Survey on graph embeddings and their applications to machine learning problems on graphs
title_full	Survey on graph embeddings and their applications to machine learning problems on graphs
title_fullStr	Survey on graph embeddings and their applications to machine learning problems on graphs
title_full_unstemmed	Survey on graph embeddings and their applications to machine learning problems on graphs
title_short	Survey on graph embeddings and their applications to machine learning problems on graphs
title_sort	survey on graph embeddings and their applications to machine learning problems on graphs
topic	Artificial Intelligence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959646/ https://www.ncbi.nlm.nih.gov/pubmed/33817007 http://dx.doi.org/10.7717/peerj-cs.357
work_keys_str_mv	AT makarovilya surveyongraphembeddingsandtheirapplicationstomachinelearningproblemsongraphs AT kiselevdmitrii surveyongraphembeddingsandtheirapplicationstomachinelearningproblemsongraphs AT nikitinskynikita surveyongraphembeddingsandtheirapplicationstomachinelearningproblemsongraphs AT subeljlovro surveyongraphembeddingsandtheirapplicationstomachinelearningproblemsongraphs

Survey on graph embeddings and their applications to machine learning problems on graphs

Ejemplares similares