Cargando…

A Scalable Similarity-Popularity Link Prediction Method

Link prediction is the task of computing the likelihood that a link exists between two given nodes in a network. With countless applications in different areas of science and engineering, link prediction has received the attention of many researchers working in various disciplines. Considerable rese...

Descripción completa

Detalles Bibliográficos
Autores principales: Kerrache, Said, Alharbi, Ruwayda, Benhidour, Hafida
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7156691/
https://www.ncbi.nlm.nih.gov/pubmed/32286363
http://dx.doi.org/10.1038/s41598-020-62636-1
_version_ 1783522265767346176
author Kerrache, Said
Alharbi, Ruwayda
Benhidour, Hafida
author_facet Kerrache, Said
Alharbi, Ruwayda
Benhidour, Hafida
author_sort Kerrache, Said
collection PubMed
description Link prediction is the task of computing the likelihood that a link exists between two given nodes in a network. With countless applications in different areas of science and engineering, link prediction has received the attention of many researchers working in various disciplines. Considerable research efforts have been invested into the development of increasingly accurate prediction methods. Most of the proposed algorithms, however, have limited use in practice because of their high computational requirements. The aim of this work is to develop a scalable link prediction algorithm that offers a higher overall predictive power than existing methods. The proposed solution falls into the class of global, parameter-free similarity-popularity-based methods, and in it, we assume that network topology is governed by three factors: popularity of the nodes, their similarity and the attraction induced by local neighbourhood. In our approach, popularity and neighbourhood-caused attraction are computed directly from the network topology and factored out by introducing a specific weight map, which is then used to estimate the dissimilarity between non-adjacent nodes through shortest path distances. We show through extensive experimental testing that the proposed method produces highly accurate predictions at a fraction of the computational cost required by existing global methods and at a low additional cost compared to local methods. The scalability of the proposed algorithm is demonstrated on several large networks having hundreds of thousands of nodes.
format Online
Article
Text
id pubmed-7156691
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-71566912020-04-19 A Scalable Similarity-Popularity Link Prediction Method Kerrache, Said Alharbi, Ruwayda Benhidour, Hafida Sci Rep Article Link prediction is the task of computing the likelihood that a link exists between two given nodes in a network. With countless applications in different areas of science and engineering, link prediction has received the attention of many researchers working in various disciplines. Considerable research efforts have been invested into the development of increasingly accurate prediction methods. Most of the proposed algorithms, however, have limited use in practice because of their high computational requirements. The aim of this work is to develop a scalable link prediction algorithm that offers a higher overall predictive power than existing methods. The proposed solution falls into the class of global, parameter-free similarity-popularity-based methods, and in it, we assume that network topology is governed by three factors: popularity of the nodes, their similarity and the attraction induced by local neighbourhood. In our approach, popularity and neighbourhood-caused attraction are computed directly from the network topology and factored out by introducing a specific weight map, which is then used to estimate the dissimilarity between non-adjacent nodes through shortest path distances. We show through extensive experimental testing that the proposed method produces highly accurate predictions at a fraction of the computational cost required by existing global methods and at a low additional cost compared to local methods. The scalability of the proposed algorithm is demonstrated on several large networks having hundreds of thousands of nodes. Nature Publishing Group UK 2020-04-14 /pmc/articles/PMC7156691/ /pubmed/32286363 http://dx.doi.org/10.1038/s41598-020-62636-1 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Kerrache, Said
Alharbi, Ruwayda
Benhidour, Hafida
A Scalable Similarity-Popularity Link Prediction Method
title A Scalable Similarity-Popularity Link Prediction Method
title_full A Scalable Similarity-Popularity Link Prediction Method
title_fullStr A Scalable Similarity-Popularity Link Prediction Method
title_full_unstemmed A Scalable Similarity-Popularity Link Prediction Method
title_short A Scalable Similarity-Popularity Link Prediction Method
title_sort scalable similarity-popularity link prediction method
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7156691/
https://www.ncbi.nlm.nih.gov/pubmed/32286363
http://dx.doi.org/10.1038/s41598-020-62636-1
work_keys_str_mv AT kerrachesaid ascalablesimilaritypopularitylinkpredictionmethod
AT alharbiruwayda ascalablesimilaritypopularitylinkpredictionmethod
AT benhidourhafida ascalablesimilaritypopularitylinkpredictionmethod
AT kerrachesaid scalablesimilaritypopularitylinkpredictionmethod
AT alharbiruwayda scalablesimilaritypopularitylinkpredictionmethod
AT benhidourhafida scalablesimilaritypopularitylinkpredictionmethod