Cargando…

An iterative approach for the global estimation of sentence similarity

Measuring the similarity between two sentences is often difficult due to their small lexical overlap. Instead of focusing on the sets of features in two given sentences between which we must measure similarity, we propose a sentence similarity method that considers two types of constraints that must...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kajiwara, Tomoyuki, Bollegala, Danushka, Yoshida, Yuichi, Kawarabayashi, Ken-ichi
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2017
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5595307/ https://www.ncbi.nlm.nih.gov/pubmed/28898242 http://dx.doi.org/10.1371/journal.pone.0180885

_version_	1783263348045905920
author	Kajiwara, Tomoyuki Bollegala, Danushka Yoshida, Yuichi Kawarabayashi, Ken-ichi
author_facet	Kajiwara, Tomoyuki Bollegala, Danushka Yoshida, Yuichi Kawarabayashi, Ken-ichi
author_sort	Kajiwara, Tomoyuki
collection	PubMed
description	Measuring the similarity between two sentences is often difficult due to their small lexical overlap. Instead of focusing on the sets of features in two given sentences between which we must measure similarity, we propose a sentence similarity method that considers two types of constraints that must be satisfied by all pairs of sentences in a given corpus. Namely, (a) if two sentences share many features in common, then it is likely that the remaining features in each sentence are also related, and (b) if two sentences contain many related features, then those two sentences are themselves similar. The two constraints are utilized in an iterative bootstrapping procedure that simultaneously updates both word and sentence similarity scores. Experimental results on SemEval 2015 Task 2 dataset show that the proposed iterative approach for measuring sentence semantic similarity is significantly better than the non-iterative counterparts.
format	Online Article Text
id	pubmed-5595307
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-55953072017-09-15 An iterative approach for the global estimation of sentence similarity Kajiwara, Tomoyuki Bollegala, Danushka Yoshida, Yuichi Kawarabayashi, Ken-ichi PLoS One Research Article Measuring the similarity between two sentences is often difficult due to their small lexical overlap. Instead of focusing on the sets of features in two given sentences between which we must measure similarity, we propose a sentence similarity method that considers two types of constraints that must be satisfied by all pairs of sentences in a given corpus. Namely, (a) if two sentences share many features in common, then it is likely that the remaining features in each sentence are also related, and (b) if two sentences contain many related features, then those two sentences are themselves similar. The two constraints are utilized in an iterative bootstrapping procedure that simultaneously updates both word and sentence similarity scores. Experimental results on SemEval 2015 Task 2 dataset show that the proposed iterative approach for measuring sentence semantic similarity is significantly better than the non-iterative counterparts. Public Library of Science 2017-09-12 /pmc/articles/PMC5595307/ /pubmed/28898242 http://dx.doi.org/10.1371/journal.pone.0180885 Text en © 2017 Kajiwara et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Kajiwara, Tomoyuki Bollegala, Danushka Yoshida, Yuichi Kawarabayashi, Ken-ichi An iterative approach for the global estimation of sentence similarity
title	An iterative approach for the global estimation of sentence similarity
title_full	An iterative approach for the global estimation of sentence similarity
title_fullStr	An iterative approach for the global estimation of sentence similarity
title_full_unstemmed	An iterative approach for the global estimation of sentence similarity
title_short	An iterative approach for the global estimation of sentence similarity
title_sort	iterative approach for the global estimation of sentence similarity
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5595307/ https://www.ncbi.nlm.nih.gov/pubmed/28898242 http://dx.doi.org/10.1371/journal.pone.0180885
work_keys_str_mv	AT kajiwaratomoyuki aniterativeapproachfortheglobalestimationofsentencesimilarity AT bollegaladanushka aniterativeapproachfortheglobalestimationofsentencesimilarity AT yoshidayuichi aniterativeapproachfortheglobalestimationofsentencesimilarity AT kawarabayashikenichi aniterativeapproachfortheglobalestimationofsentencesimilarity AT kajiwaratomoyuki iterativeapproachfortheglobalestimationofsentencesimilarity AT bollegaladanushka iterativeapproachfortheglobalestimationofsentencesimilarity AT yoshidayuichi iterativeapproachfortheglobalestimationofsentencesimilarity AT kawarabayashikenichi iterativeapproachfortheglobalestimationofsentencesimilarity

An iterative approach for the global estimation of sentence similarity

Ejemplares similares