Cargando…
A Comparative Study of Data-Driven Models for Travel Destination Characterization
Characterizing items for content-based recommender systems is a challenging task in complex domains such as travel and tourism. In the case of destination recommendation, no feature set can be readily used as a similarity ground truth, which makes it hard to evaluate the quality of destination chara...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9022027/ https://www.ncbi.nlm.nih.gov/pubmed/35464121 http://dx.doi.org/10.3389/fdata.2022.829939 |
_version_ | 1784689986037088256 |
---|---|
author | Dietz, Linus W. Sertkan, Mete Myftija, Saadi Thimbiri Palage, Sameera Neidhardt, Julia Wörndl, Wolfgang |
author_facet | Dietz, Linus W. Sertkan, Mete Myftija, Saadi Thimbiri Palage, Sameera Neidhardt, Julia Wörndl, Wolfgang |
author_sort | Dietz, Linus W. |
collection | PubMed |
description | Characterizing items for content-based recommender systems is a challenging task in complex domains such as travel and tourism. In the case of destination recommendation, no feature set can be readily used as a similarity ground truth, which makes it hard to evaluate the quality of destination characterization approaches. Furthermore, the process should scale well for many items, be cost-efficient, and most importantly correct. To evaluate which data sources are most suitable, we investigate 18 characterization methods that fall into three categories: venue data, textual data, and factual data. We make these data models comparable using rank agreement metrics and reveal which data sources capture similar underlying concepts. To support choosing more suitable data models, we capture a desired concept using an expert survey and evaluate our characterization methods toward it. We find that the textual models to characterize cities perform best overall, with data models based on factual and venue data being less competitive. However, we show that data models with explicit features can be optimized by learning weights for their features. |
format | Online Article Text |
id | pubmed-9022027 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-90220272022-04-22 A Comparative Study of Data-Driven Models for Travel Destination Characterization Dietz, Linus W. Sertkan, Mete Myftija, Saadi Thimbiri Palage, Sameera Neidhardt, Julia Wörndl, Wolfgang Front Big Data Big Data Characterizing items for content-based recommender systems is a challenging task in complex domains such as travel and tourism. In the case of destination recommendation, no feature set can be readily used as a similarity ground truth, which makes it hard to evaluate the quality of destination characterization approaches. Furthermore, the process should scale well for many items, be cost-efficient, and most importantly correct. To evaluate which data sources are most suitable, we investigate 18 characterization methods that fall into three categories: venue data, textual data, and factual data. We make these data models comparable using rank agreement metrics and reveal which data sources capture similar underlying concepts. To support choosing more suitable data models, we capture a desired concept using an expert survey and evaluate our characterization methods toward it. We find that the textual models to characterize cities perform best overall, with data models based on factual and venue data being less competitive. However, we show that data models with explicit features can be optimized by learning weights for their features. Frontiers Media S.A. 2022-04-07 /pmc/articles/PMC9022027/ /pubmed/35464121 http://dx.doi.org/10.3389/fdata.2022.829939 Text en Copyright © 2022 Dietz, Sertkan, Myftija, Thimbiri Palage, Neidhardt and Wörndl. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Big Data Dietz, Linus W. Sertkan, Mete Myftija, Saadi Thimbiri Palage, Sameera Neidhardt, Julia Wörndl, Wolfgang A Comparative Study of Data-Driven Models for Travel Destination Characterization |
title | A Comparative Study of Data-Driven Models for Travel Destination Characterization |
title_full | A Comparative Study of Data-Driven Models for Travel Destination Characterization |
title_fullStr | A Comparative Study of Data-Driven Models for Travel Destination Characterization |
title_full_unstemmed | A Comparative Study of Data-Driven Models for Travel Destination Characterization |
title_short | A Comparative Study of Data-Driven Models for Travel Destination Characterization |
title_sort | comparative study of data-driven models for travel destination characterization |
topic | Big Data |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9022027/ https://www.ncbi.nlm.nih.gov/pubmed/35464121 http://dx.doi.org/10.3389/fdata.2022.829939 |
work_keys_str_mv | AT dietzlinusw acomparativestudyofdatadrivenmodelsfortraveldestinationcharacterization AT sertkanmete acomparativestudyofdatadrivenmodelsfortraveldestinationcharacterization AT myftijasaadi acomparativestudyofdatadrivenmodelsfortraveldestinationcharacterization AT thimbiripalagesameera acomparativestudyofdatadrivenmodelsfortraveldestinationcharacterization AT neidhardtjulia acomparativestudyofdatadrivenmodelsfortraveldestinationcharacterization AT worndlwolfgang acomparativestudyofdatadrivenmodelsfortraveldestinationcharacterization AT dietzlinusw comparativestudyofdatadrivenmodelsfortraveldestinationcharacterization AT sertkanmete comparativestudyofdatadrivenmodelsfortraveldestinationcharacterization AT myftijasaadi comparativestudyofdatadrivenmodelsfortraveldestinationcharacterization AT thimbiripalagesameera comparativestudyofdatadrivenmodelsfortraveldestinationcharacterization AT neidhardtjulia comparativestudyofdatadrivenmodelsfortraveldestinationcharacterization AT worndlwolfgang comparativestudyofdatadrivenmodelsfortraveldestinationcharacterization |