Cargando…
Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels
Many video-on-demand and music streaming services provide the user with a page consisting of several recommendation lists, i.e., widgets or swipeable carousels, each built with specific criteria (e.g., most recent, TV series, etc.). Finding efficient strategies to select which carousels to display i...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9218726/ https://www.ncbi.nlm.nih.gov/pubmed/35754557 http://dx.doi.org/10.3389/fdata.2022.910030 |
_version_ | 1784731955407880192 |
---|---|
author | Ferrari Dacrema, Maurizio Felicioni, Nicolò Cremonesi, Paolo |
author_facet | Ferrari Dacrema, Maurizio Felicioni, Nicolò Cremonesi, Paolo |
author_sort | Ferrari Dacrema, Maurizio |
collection | PubMed |
description | Many video-on-demand and music streaming services provide the user with a page consisting of several recommendation lists, i.e., widgets or swipeable carousels, each built with specific criteria (e.g., most recent, TV series, etc.). Finding efficient strategies to select which carousels to display is an active research topic of great industrial interest. In this setting, the overall quality of the recommendations of a new algorithm cannot be assessed by measuring solely its individual recommendation quality. Rather, it should be evaluated in a context where other recommendation lists are already available, to account for how they complement each other. The traditional offline evaluation protocol however does not take this into account. To address this limitation, we propose an offline evaluation protocol for a carousel setting in which the recommendation quality of a model is measured by how much it improves upon that of an already available set of carousels. We also propose to extend ranking metrics to the two-dimensional carousel setting in order to account for a known position bias, i.e., users will not explore the lists sequentially, but rather concentrate on the top-left corner of the screen. Finally, we describe and evaluate two strategies for the ranking of carousels in a scenario where the technique used to generate the two-dimensional layout is agnostic on the algorithms used to generate each carousel. We report experiments on publicly available datasets in the movie domain to show how the relative effectiveness of several recommendation models compares. Our results indicate that under a carousel setting the ranking of the algorithms changes sometimes significantly. Furthermore, when selecting the optimal carousel layout accounting for the two dimensional layout of the user interface leads to very different selections. |
format | Online Article Text |
id | pubmed-9218726 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92187262022-06-24 Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels Ferrari Dacrema, Maurizio Felicioni, Nicolò Cremonesi, Paolo Front Big Data Big Data Many video-on-demand and music streaming services provide the user with a page consisting of several recommendation lists, i.e., widgets or swipeable carousels, each built with specific criteria (e.g., most recent, TV series, etc.). Finding efficient strategies to select which carousels to display is an active research topic of great industrial interest. In this setting, the overall quality of the recommendations of a new algorithm cannot be assessed by measuring solely its individual recommendation quality. Rather, it should be evaluated in a context where other recommendation lists are already available, to account for how they complement each other. The traditional offline evaluation protocol however does not take this into account. To address this limitation, we propose an offline evaluation protocol for a carousel setting in which the recommendation quality of a model is measured by how much it improves upon that of an already available set of carousels. We also propose to extend ranking metrics to the two-dimensional carousel setting in order to account for a known position bias, i.e., users will not explore the lists sequentially, but rather concentrate on the top-left corner of the screen. Finally, we describe and evaluate two strategies for the ranking of carousels in a scenario where the technique used to generate the two-dimensional layout is agnostic on the algorithms used to generate each carousel. We report experiments on publicly available datasets in the movie domain to show how the relative effectiveness of several recommendation models compares. Our results indicate that under a carousel setting the ranking of the algorithms changes sometimes significantly. Furthermore, when selecting the optimal carousel layout accounting for the two dimensional layout of the user interface leads to very different selections. Frontiers Media S.A. 2022-06-09 /pmc/articles/PMC9218726/ /pubmed/35754557 http://dx.doi.org/10.3389/fdata.2022.910030 Text en Copyright © 2022 Ferrari Dacrema, Felicioni and Cremonesi. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Big Data Ferrari Dacrema, Maurizio Felicioni, Nicolò Cremonesi, Paolo Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels |
title | Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels |
title_full | Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels |
title_fullStr | Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels |
title_full_unstemmed | Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels |
title_short | Offline Evaluation of Recommender Systems in a User Interface With Multiple Carousels |
title_sort | offline evaluation of recommender systems in a user interface with multiple carousels |
topic | Big Data |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9218726/ https://www.ncbi.nlm.nih.gov/pubmed/35754557 http://dx.doi.org/10.3389/fdata.2022.910030 |
work_keys_str_mv | AT ferraridacremamaurizio offlineevaluationofrecommendersystemsinauserinterfacewithmultiplecarousels AT felicioninicolo offlineevaluationofrecommendersystemsinauserinterfacewithmultiplecarousels AT cremonesipaolo offlineevaluationofrecommendersystemsinauserinterfacewithmultiplecarousels |