Cargando…
Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification
Implicit result diversification methods exploit the content of the documents in the candidate set, i.e., the initial retrieval results of a query, to obtain a relevant and diverse ranking. As our first contribution, we explore whether recently introduced word embeddings can be exploited for represen...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148034/ http://dx.doi.org/10.1007/978-3-030-45442-5_51 |
_version_ | 1783520516424859648 |
---|---|
author | Ulu, Yasar Baris Altingovde, Ismail Sengor |
author_facet | Ulu, Yasar Baris Altingovde, Ismail Sengor |
author_sort | Ulu, Yasar Baris |
collection | PubMed |
description | Implicit result diversification methods exploit the content of the documents in the candidate set, i.e., the initial retrieval results of a query, to obtain a relevant and diverse ranking. As our first contribution, we explore whether recently introduced word embeddings can be exploited for representing documents to improve diversification, and show a positive result. As a second improvement, we propose to automatically predict the size of candidate set on per query basis. Experimental evaluations using our BM25 runs as well as the best-performing ad hoc runs submitted to TREC (2009–2012) show that our approach improves the performance of implicit diversification up to 5.4% wrt. initial ranking. |
format | Online Article Text |
id | pubmed-7148034 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-71480342020-04-13 Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification Ulu, Yasar Baris Altingovde, Ismail Sengor Advances in Information Retrieval Article Implicit result diversification methods exploit the content of the documents in the candidate set, i.e., the initial retrieval results of a query, to obtain a relevant and diverse ranking. As our first contribution, we explore whether recently introduced word embeddings can be exploited for representing documents to improve diversification, and show a positive result. As a second improvement, we propose to automatically predict the size of candidate set on per query basis. Experimental evaluations using our BM25 runs as well as the best-performing ad hoc runs submitted to TREC (2009–2012) show that our approach improves the performance of implicit diversification up to 5.4% wrt. initial ranking. 2020-03-24 /pmc/articles/PMC7148034/ http://dx.doi.org/10.1007/978-3-030-45442-5_51 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Ulu, Yasar Baris Altingovde, Ismail Sengor Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification |
title | Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification |
title_full | Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification |
title_fullStr | Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification |
title_full_unstemmed | Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification |
title_short | Predicting the Size of Candidate Document Set for Implicit Web Search Result Diversification |
title_sort | predicting the size of candidate document set for implicit web search result diversification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148034/ http://dx.doi.org/10.1007/978-3-030-45442-5_51 |
work_keys_str_mv | AT uluyasarbaris predictingthesizeofcandidatedocumentsetforimplicitwebsearchresultdiversification AT altingovdeismailsengor predictingthesizeofcandidatedocumentsetforimplicitwebsearchresultdiversification |