Cargando…
A Scalable Similarity-Popularity Link Prediction Method
Link prediction is the task of computing the likelihood that a link exists between two given nodes in a network. With countless applications in different areas of science and engineering, link prediction has received the attention of many researchers working in various disciplines. Considerable rese...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7156691/ https://www.ncbi.nlm.nih.gov/pubmed/32286363 http://dx.doi.org/10.1038/s41598-020-62636-1 |
_version_ | 1783522265767346176 |
---|---|
author | Kerrache, Said Alharbi, Ruwayda Benhidour, Hafida |
author_facet | Kerrache, Said Alharbi, Ruwayda Benhidour, Hafida |
author_sort | Kerrache, Said |
collection | PubMed |
description | Link prediction is the task of computing the likelihood that a link exists between two given nodes in a network. With countless applications in different areas of science and engineering, link prediction has received the attention of many researchers working in various disciplines. Considerable research efforts have been invested into the development of increasingly accurate prediction methods. Most of the proposed algorithms, however, have limited use in practice because of their high computational requirements. The aim of this work is to develop a scalable link prediction algorithm that offers a higher overall predictive power than existing methods. The proposed solution falls into the class of global, parameter-free similarity-popularity-based methods, and in it, we assume that network topology is governed by three factors: popularity of the nodes, their similarity and the attraction induced by local neighbourhood. In our approach, popularity and neighbourhood-caused attraction are computed directly from the network topology and factored out by introducing a specific weight map, which is then used to estimate the dissimilarity between non-adjacent nodes through shortest path distances. We show through extensive experimental testing that the proposed method produces highly accurate predictions at a fraction of the computational cost required by existing global methods and at a low additional cost compared to local methods. The scalability of the proposed algorithm is demonstrated on several large networks having hundreds of thousands of nodes. |
format | Online Article Text |
id | pubmed-7156691 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-71566912020-04-19 A Scalable Similarity-Popularity Link Prediction Method Kerrache, Said Alharbi, Ruwayda Benhidour, Hafida Sci Rep Article Link prediction is the task of computing the likelihood that a link exists between two given nodes in a network. With countless applications in different areas of science and engineering, link prediction has received the attention of many researchers working in various disciplines. Considerable research efforts have been invested into the development of increasingly accurate prediction methods. Most of the proposed algorithms, however, have limited use in practice because of their high computational requirements. The aim of this work is to develop a scalable link prediction algorithm that offers a higher overall predictive power than existing methods. The proposed solution falls into the class of global, parameter-free similarity-popularity-based methods, and in it, we assume that network topology is governed by three factors: popularity of the nodes, their similarity and the attraction induced by local neighbourhood. In our approach, popularity and neighbourhood-caused attraction are computed directly from the network topology and factored out by introducing a specific weight map, which is then used to estimate the dissimilarity between non-adjacent nodes through shortest path distances. We show through extensive experimental testing that the proposed method produces highly accurate predictions at a fraction of the computational cost required by existing global methods and at a low additional cost compared to local methods. The scalability of the proposed algorithm is demonstrated on several large networks having hundreds of thousands of nodes. Nature Publishing Group UK 2020-04-14 /pmc/articles/PMC7156691/ /pubmed/32286363 http://dx.doi.org/10.1038/s41598-020-62636-1 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Kerrache, Said Alharbi, Ruwayda Benhidour, Hafida A Scalable Similarity-Popularity Link Prediction Method |
title | A Scalable Similarity-Popularity Link Prediction Method |
title_full | A Scalable Similarity-Popularity Link Prediction Method |
title_fullStr | A Scalable Similarity-Popularity Link Prediction Method |
title_full_unstemmed | A Scalable Similarity-Popularity Link Prediction Method |
title_short | A Scalable Similarity-Popularity Link Prediction Method |
title_sort | scalable similarity-popularity link prediction method |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7156691/ https://www.ncbi.nlm.nih.gov/pubmed/32286363 http://dx.doi.org/10.1038/s41598-020-62636-1 |
work_keys_str_mv | AT kerrachesaid ascalablesimilaritypopularitylinkpredictionmethod AT alharbiruwayda ascalablesimilaritypopularitylinkpredictionmethod AT benhidourhafida ascalablesimilaritypopularitylinkpredictionmethod AT kerrachesaid scalablesimilaritypopularitylinkpredictionmethod AT alharbiruwayda scalablesimilaritypopularitylinkpredictionmethod AT benhidourhafida scalablesimilaritypopularitylinkpredictionmethod |