Cargando…

XSiteTraj: A cross-site user trajectory dataset

With the development of mobile networks, social networking plays an increasingly important role in people's daily life. User identification, which aims to match user cross-site accounts, has been becoming an important issue for user supervision and recommendation system design in social network...

Descripción completa

Detalles Bibliográficos
Autores principales: Fu, Jiazheng, Li, Yongjun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10694039/
http://dx.doi.org/10.1016/j.dib.2023.109783
_version_ 1785153288461615104
author Fu, Jiazheng
Li, Yongjun
author_facet Fu, Jiazheng
Li, Yongjun
author_sort Fu, Jiazheng
collection PubMed
description With the development of mobile networks, social networking plays an increasingly important role in people's daily life. User identification, which aims to match user cross-site accounts, has been becoming an important issue for user supervision and recommendation system design in social networks. At present, many different user identification methods have emerged, such as DPLink, HFUL, etc. However, compared with the continuous development of user identification methods, the open-source work of datasets is very slow, and the datasets of most of the work are not public. The shortage of datasets has greatly hindered the development of this research field. At present, the academic urgently needs a large-scale social network user linkage dataset. In this paper, we publicize a new social network user linkage dataset, XSiteTraj v1.0 [2]. This dataset has good spatio-temporal coverage, including more than 27,000 users and more than one million check-in records from all over the world crawled from Facebook, Foursquare, and Twitter. Our dataset labels the identical users from different social websites, and each check-in record includes a timestamp, point of interest (PoI), and latitude and longitude of PoI. Through our dataset, we can conduct research on user behaviour habits and apply the dataset to the experiments and evaluation of social network user identification and other algorithms.
format Online
Article
Text
id pubmed-10694039
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-106940392023-12-05 XSiteTraj: A cross-site user trajectory dataset Fu, Jiazheng Li, Yongjun Data Brief Data Article With the development of mobile networks, social networking plays an increasingly important role in people's daily life. User identification, which aims to match user cross-site accounts, has been becoming an important issue for user supervision and recommendation system design in social networks. At present, many different user identification methods have emerged, such as DPLink, HFUL, etc. However, compared with the continuous development of user identification methods, the open-source work of datasets is very slow, and the datasets of most of the work are not public. The shortage of datasets has greatly hindered the development of this research field. At present, the academic urgently needs a large-scale social network user linkage dataset. In this paper, we publicize a new social network user linkage dataset, XSiteTraj v1.0 [2]. This dataset has good spatio-temporal coverage, including more than 27,000 users and more than one million check-in records from all over the world crawled from Facebook, Foursquare, and Twitter. Our dataset labels the identical users from different social websites, and each check-in record includes a timestamp, point of interest (PoI), and latitude and longitude of PoI. Through our dataset, we can conduct research on user behaviour habits and apply the dataset to the experiments and evaluation of social network user identification and other algorithms. Elsevier 2023-11-07 /pmc/articles/PMC10694039/ http://dx.doi.org/10.1016/j.dib.2023.109783 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Fu, Jiazheng
Li, Yongjun
XSiteTraj: A cross-site user trajectory dataset
title XSiteTraj: A cross-site user trajectory dataset
title_full XSiteTraj: A cross-site user trajectory dataset
title_fullStr XSiteTraj: A cross-site user trajectory dataset
title_full_unstemmed XSiteTraj: A cross-site user trajectory dataset
title_short XSiteTraj: A cross-site user trajectory dataset
title_sort xsitetraj: a cross-site user trajectory dataset
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10694039/
http://dx.doi.org/10.1016/j.dib.2023.109783
work_keys_str_mv AT fujiazheng xsitetrajacrosssiteusertrajectorydataset
AT liyongjun xsitetrajacrosssiteusertrajectorydataset