Cargando…

Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels

Many video-on-demand and music streaming services provide the user with a page consisting of several recommendation lists, i.e., widgets or swipeable carousels, each built with specific criteria (e.g., most recent, TV series, etc.). Finding efficient strategies to select which carousels to display i...

Descripción completa

Detalles Bibliográficos
Autores principales: Ferrari Dacrema, Maurizio, Felicioni, Nicolò, Cremonesi, Paolo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9218726/
https://www.ncbi.nlm.nih.gov/pubmed/35754557
http://dx.doi.org/10.3389/fdata.2022.910030
_version_ 1784731955407880192
author Ferrari Dacrema, Maurizio
Felicioni, Nicolò
Cremonesi, Paolo
author_facet Ferrari Dacrema, Maurizio
Felicioni, Nicolò
Cremonesi, Paolo
author_sort Ferrari Dacrema, Maurizio
collection PubMed
description Many video-on-demand and music streaming services provide the user with a page consisting of several recommendation lists, i.e., widgets or swipeable carousels, each built with specific criteria (e.g., most recent, TV series, etc.). Finding efficient strategies to select which carousels to display is an active research topic of great industrial interest. In this setting, the overall quality of the recommendations of a new algorithm cannot be assessed by measuring solely its individual recommendation quality. Rather, it should be evaluated in a context where other recommendation lists are already available, to account for how they complement each other. The traditional offline evaluation protocol however does not take this into account. To address this limitation, we propose an offline evaluation protocol for a carousel setting in which the recommendation quality of a model is measured by how much it improves upon that of an already available set of carousels. We also propose to extend ranking metrics to the two-dimensional carousel setting in order to account for a known position bias, i.e., users will not explore the lists sequentially, but rather concentrate on the top-left corner of the screen. Finally, we describe and evaluate two strategies for the ranking of carousels in a scenario where the technique used to generate the two-dimensional layout is agnostic on the algorithms used to generate each carousel. We report experiments on publicly available datasets in the movie domain to show how the relative effectiveness of several recommendation models compares. Our results indicate that under a carousel setting the ranking of the algorithms changes sometimes significantly. Furthermore, when selecting the optimal carousel layout accounting for the two dimensional layout of the user interface leads to very different selections.
format Online
Article
Text
id pubmed-9218726
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-92187262022-06-24 Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels Ferrari Dacrema, Maurizio Felicioni, Nicolò Cremonesi, Paolo Front Big Data Big Data Many video-on-demand and music streaming services provide the user with a page consisting of several recommendation lists, i.e., widgets or swipeable carousels, each built with specific criteria (e.g., most recent, TV series, etc.). Finding efficient strategies to select which carousels to display is an active research topic of great industrial interest. In this setting, the overall quality of the recommendations of a new algorithm cannot be assessed by measuring solely its individual recommendation quality. Rather, it should be evaluated in a context where other recommendation lists are already available, to account for how they complement each other. The traditional offline evaluation protocol however does not take this into account. To address this limitation, we propose an offline evaluation protocol for a carousel setting in which the recommendation quality of a model is measured by how much it improves upon that of an already available set of carousels. We also propose to extend ranking metrics to the two-dimensional carousel setting in order to account for a known position bias, i.e., users will not explore the lists sequentially, but rather concentrate on the top-left corner of the screen. Finally, we describe and evaluate two strategies for the ranking of carousels in a scenario where the technique used to generate the two-dimensional layout is agnostic on the algorithms used to generate each carousel. We report experiments on publicly available datasets in the movie domain to show how the relative effectiveness of several recommendation models compares. Our results indicate that under a carousel setting the ranking of the algorithms changes sometimes significantly. Furthermore, when selecting the optimal carousel layout accounting for the two dimensional layout of the user interface leads to very different selections. Frontiers Media S.A. 2022-06-09 /pmc/articles/PMC9218726/ /pubmed/35754557 http://dx.doi.org/10.3389/fdata.2022.910030 Text en Copyright © 2022 Ferrari Dacrema, Felicioni and Cremonesi. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Ferrari Dacrema, Maurizio
Felicioni, Nicolò
Cremonesi, Paolo
Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels
title Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels
title_full Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels
title_fullStr Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels
title_full_unstemmed Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels
title_short Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels
title_sort offline evaluation of recommender systems in a user interface with multiple carousels
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9218726/
https://www.ncbi.nlm.nih.gov/pubmed/35754557
http://dx.doi.org/10.3389/fdata.2022.910030
work_keys_str_mv AT ferraridacremamaurizio offlineevaluationofrecommendersystemsinauserinterfacewithmultiplecarousels
AT felicioninicolo offlineevaluationofrecommendersystemsinauserinterfacewithmultiplecarousels
AT cremonesipaolo offlineevaluationofrecommendersystemsinauserinterfacewithmultiplecarousels