Cargando…

Link-Based Similarity Measures Using Reachability Vectors

We present a novel approach for computing link-based similarities among objects accurately by utilizing the link information pertaining to the objects involved. We discuss the problems with previous link-based similarity measures and propose a novel approach for computing link based similarities tha...

Descripción completa

Detalles Bibliográficos
Autores principales: Yoon, Seok-Ho, Kim, Ji-Soo, Ha, Jiwoon, Kim, Sang-Wook, Ryu, Minsoo, Choi, Ho-Jin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3948467/
https://www.ncbi.nlm.nih.gov/pubmed/24701188
http://dx.doi.org/10.1155/2014/741608
_version_ 1782306780286550016
author Yoon, Seok-Ho
Kim, Ji-Soo
Ha, Jiwoon
Kim, Sang-Wook
Ryu, Minsoo
Choi, Ho-Jin
author_facet Yoon, Seok-Ho
Kim, Ji-Soo
Ha, Jiwoon
Kim, Sang-Wook
Ryu, Minsoo
Choi, Ho-Jin
author_sort Yoon, Seok-Ho
collection PubMed
description We present a novel approach for computing link-based similarities among objects accurately by utilizing the link information pertaining to the objects involved. We discuss the problems with previous link-based similarity measures and propose a novel approach for computing link based similarities that does not suffer from these problems. In the proposed approach each target object is represented by a vector. Each element of the vector corresponds to all the objects in the given data, and the value of each element denotes the weight for the corresponding object. As for this weight value, we propose to utilize the probability of reaching from the target object to the specific object, computed using the “Random Walk with Restart” strategy. Then, we define the similarity between two objects as the cosine similarity of the two vectors. In this paper, we provide examples to show that our approach does not suffer from the aforementioned problems. We also evaluate the performance of the proposed methods in comparison with existing link-based measures, qualitatively and quantitatively, with respect to two kinds of data sets, scientific papers and Web documents. Our experimental results indicate that the proposed methods significantly outperform the existing measures.
format Online
Article
Text
id pubmed-3948467
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-39484672014-04-03 Link-Based Similarity Measures Using Reachability Vectors Yoon, Seok-Ho Kim, Ji-Soo Ha, Jiwoon Kim, Sang-Wook Ryu, Minsoo Choi, Ho-Jin ScientificWorldJournal Research Article We present a novel approach for computing link-based similarities among objects accurately by utilizing the link information pertaining to the objects involved. We discuss the problems with previous link-based similarity measures and propose a novel approach for computing link based similarities that does not suffer from these problems. In the proposed approach each target object is represented by a vector. Each element of the vector corresponds to all the objects in the given data, and the value of each element denotes the weight for the corresponding object. As for this weight value, we propose to utilize the probability of reaching from the target object to the specific object, computed using the “Random Walk with Restart” strategy. Then, we define the similarity between two objects as the cosine similarity of the two vectors. In this paper, we provide examples to show that our approach does not suffer from the aforementioned problems. We also evaluate the performance of the proposed methods in comparison with existing link-based measures, qualitatively and quantitatively, with respect to two kinds of data sets, scientific papers and Web documents. Our experimental results indicate that the proposed methods significantly outperform the existing measures. Hindawi Publishing Corporation 2014-02-18 /pmc/articles/PMC3948467/ /pubmed/24701188 http://dx.doi.org/10.1155/2014/741608 Text en Copyright © 2014 Seok-Ho Yoon et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Yoon, Seok-Ho
Kim, Ji-Soo
Ha, Jiwoon
Kim, Sang-Wook
Ryu, Minsoo
Choi, Ho-Jin
Link-Based Similarity Measures Using Reachability Vectors
title Link-Based Similarity Measures Using Reachability Vectors
title_full Link-Based Similarity Measures Using Reachability Vectors
title_fullStr Link-Based Similarity Measures Using Reachability Vectors
title_full_unstemmed Link-Based Similarity Measures Using Reachability Vectors
title_short Link-Based Similarity Measures Using Reachability Vectors
title_sort link-based similarity measures using reachability vectors
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3948467/
https://www.ncbi.nlm.nih.gov/pubmed/24701188
http://dx.doi.org/10.1155/2014/741608
work_keys_str_mv AT yoonseokho linkbasedsimilaritymeasuresusingreachabilityvectors
AT kimjisoo linkbasedsimilaritymeasuresusingreachabilityvectors
AT hajiwoon linkbasedsimilaritymeasuresusingreachabilityvectors
AT kimsangwook linkbasedsimilaritymeasuresusingreachabilityvectors
AT ryuminsoo linkbasedsimilaritymeasuresusingreachabilityvectors
AT choihojin linkbasedsimilaritymeasuresusingreachabilityvectors