Cargando…

Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification

Implicit result diversification methods exploit the content of the documents in the candidate set, i.e., the initial retrieval results of a query, to obtain a relevant and diverse ranking. As our first contribution, we explore whether recently introduced word embeddings can be exploited for represen...

Descripción completa

Detalles Bibliográficos
Autores principales: Ulu, Yasar Baris, Altingovde, Ismail Sengor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148034/
http://dx.doi.org/10.1007/978-3-030-45442-5_51
_version_ 1783520516424859648
author Ulu, Yasar Baris
Altingovde, Ismail Sengor
author_facet Ulu, Yasar Baris
Altingovde, Ismail Sengor
author_sort Ulu, Yasar Baris
collection PubMed
description Implicit result diversification methods exploit the content of the documents in the candidate set, i.e., the initial retrieval results of a query, to obtain a relevant and diverse ranking. As our first contribution, we explore whether recently introduced word embeddings can be exploited for representing documents to improve diversification, and show a positive result. As a second improvement, we propose to automatically predict the size of candidate set on per query basis. Experimental evaluations using our BM25 runs as well as the best-performing ad hoc runs submitted to TREC (2009–2012) show that our approach improves the performance of implicit diversification up to 5.4% wrt. initial ranking.
format Online
Article
Text
id pubmed-7148034
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-71480342020-04-13 Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification Ulu, Yasar Baris Altingovde, Ismail Sengor Advances in Information Retrieval Article Implicit result diversification methods exploit the content of the documents in the candidate set, i.e., the initial retrieval results of a query, to obtain a relevant and diverse ranking. As our first contribution, we explore whether recently introduced word embeddings can be exploited for representing documents to improve diversification, and show a positive result. As a second improvement, we propose to automatically predict the size of candidate set on per query basis. Experimental evaluations using our BM25 runs as well as the best-performing ad hoc runs submitted to TREC (2009–2012) show that our approach improves the performance of implicit diversification up to 5.4% wrt. initial ranking. 2020-03-24 /pmc/articles/PMC7148034/ http://dx.doi.org/10.1007/978-3-030-45442-5_51 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Ulu, Yasar Baris
Altingovde, Ismail Sengor
Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification
title Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification
title_full Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification
title_fullStr Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification
title_full_unstemmed Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification
title_short Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification
title_sort predicting the size of candidate document set for implicit web search result diversification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148034/
http://dx.doi.org/10.1007/978-3-030-45442-5_51
work_keys_str_mv AT uluyasarbaris predictingthesizeofcandidatedocumentsetforimplicitwebsearchresultdiversification
AT altingovdeismailsengor predictingthesizeofcandidatedocumentsetforimplicitwebsearchresultdiversification