Cargando…

The Effect of Content-Equivalent Near-Duplicates on the Evaluation of Search Engines

Current best practices for the evaluation of search engines do not take into account duplicate documents. Dependent on their prevalence, not discounting duplicates during evaluation artificially inflates performance scores, and, it penalizes those whose search systems diligently filter them. Althoug...

Descripción completa

Detalles Bibliográficos
Autores principales: Fröbe, Maik, Bittner, Jan Philipp, Potthast, Martin, Hagen, Matthias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148013/
http://dx.doi.org/10.1007/978-3-030-45442-5_2