Cargando…

Effect of Dataset Size on Efficiency of Collaborative Filtering Recommender Systems with Multi-clustering as a Neighbourhood Identification Strategy

Determination of accurate neighbourhood of an active user (a user to whom recommendations are generated) is one of the essential problems that collaborative filtering based recommender systems encounter. Properly adjusted neighbourhood leads to more accurate recommendation generated by a recommender...

Descripción completa

Detalles Bibliográficos
Autor principal: Kużelewska, Urszula
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304038/
http://dx.doi.org/10.1007/978-3-030-50420-5_25
_version_ 1783548185752371200
author Kużelewska, Urszula
author_facet Kużelewska, Urszula
author_sort Kużelewska, Urszula
collection PubMed
description Determination of accurate neighbourhood of an active user (a user to whom recommendations are generated) is one of the essential problems that collaborative filtering based recommender systems encounter. Properly adjusted neighbourhood leads to more accurate recommendation generated by a recommender system. In classical collaborative filtering technique, the neighbourhood is modelled by kNN algorithm, but this approach has poor scalability. Clustering techniques, although improved time efficiency of recommender systems, can negatively affect the quality (precision or accuracy) of recommendations. This article presents a new approach to collaborative filtering recommender systems that focuses on the problem of an active user’s neighbourhood modelling. Instead of one clustering scheme, it works on a set of partitions, therefore it selects the most appropriate one that models the neighbourhood precisely. This article presents the results of the experiments validating the advantage of multi-clustering approach, [Formula: see text], over the traditional methods based on single-scheme clustering. The experiments particularly focus on the effect of great size of datasets concerning overall recommendation performance including accuracy and coverage.
format Online
Article
Text
id pubmed-7304038
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73040382020-06-19 Effect of Dataset Size on Efficiency of Collaborative Filtering Recommender Systems with Multi-clustering as a Neighbourhood Identification Strategy Kużelewska, Urszula Computational Science – ICCS 2020 Article Determination of accurate neighbourhood of an active user (a user to whom recommendations are generated) is one of the essential problems that collaborative filtering based recommender systems encounter. Properly adjusted neighbourhood leads to more accurate recommendation generated by a recommender system. In classical collaborative filtering technique, the neighbourhood is modelled by kNN algorithm, but this approach has poor scalability. Clustering techniques, although improved time efficiency of recommender systems, can negatively affect the quality (precision or accuracy) of recommendations. This article presents a new approach to collaborative filtering recommender systems that focuses on the problem of an active user’s neighbourhood modelling. Instead of one clustering scheme, it works on a set of partitions, therefore it selects the most appropriate one that models the neighbourhood precisely. This article presents the results of the experiments validating the advantage of multi-clustering approach, [Formula: see text], over the traditional methods based on single-scheme clustering. The experiments particularly focus on the effect of great size of datasets concerning overall recommendation performance including accuracy and coverage. 2020-05-22 /pmc/articles/PMC7304038/ http://dx.doi.org/10.1007/978-3-030-50420-5_25 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Kużelewska, Urszula
Effect of Dataset Size on Efficiency of Collaborative Filtering Recommender Systems with Multi-clustering as a Neighbourhood Identification Strategy
title Effect of Dataset Size on Efficiency of Collaborative Filtering Recommender Systems with Multi-clustering as a Neighbourhood Identification Strategy
title_full Effect of Dataset Size on Efficiency of Collaborative Filtering Recommender Systems with Multi-clustering as a Neighbourhood Identification Strategy
title_fullStr Effect of Dataset Size on Efficiency of Collaborative Filtering Recommender Systems with Multi-clustering as a Neighbourhood Identification Strategy
title_full_unstemmed Effect of Dataset Size on Efficiency of Collaborative Filtering Recommender Systems with Multi-clustering as a Neighbourhood Identification Strategy
title_short Effect of Dataset Size on Efficiency of Collaborative Filtering Recommender Systems with Multi-clustering as a Neighbourhood Identification Strategy
title_sort effect of dataset size on efficiency of collaborative filtering recommender systems with multi-clustering as a neighbourhood identification strategy
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7304038/
http://dx.doi.org/10.1007/978-3-030-50420-5_25
work_keys_str_mv AT kuzelewskaurszula effectofdatasetsizeonefficiencyofcollaborativefilteringrecommendersystemswithmulticlusteringasaneighbourhoodidentificationstrategy