Cargando…

HTTP-level e-commerce data based on server access logs for an online store

Web server logs have been extensively used as a source of data on the characteristics of Web traffic and users’ navigational patterns. In particular, Web bot detection and online purchase prediction using methods from artificial intelligence (AI) are currently key areas of research. However, in real...

Descripción completa

Detalles Bibliográficos
Autores principales: Chodak, Grzegorz, Suchacka, Grażyna, Chawla, Yash
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Authors. Published by Elsevier B.V. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7540248/
https://www.ncbi.nlm.nih.gov/pubmed/35023998
http://dx.doi.org/10.1016/j.comnet.2020.107589
_version_ 1783591165016145920
author Chodak, Grzegorz
Suchacka, Grażyna
Chawla, Yash
author_facet Chodak, Grzegorz
Suchacka, Grażyna
Chawla, Yash
author_sort Chodak, Grzegorz
collection PubMed
description Web server logs have been extensively used as a source of data on the characteristics of Web traffic and users’ navigational patterns. In particular, Web bot detection and online purchase prediction using methods from artificial intelligence (AI) are currently key areas of research. However, in reality, it is hard to obtain logs from actual online stores and there is no common dataset that can be used across different studies. Moreover, there is a lack of studies exploring Web traffic over a longer period of time, due to the unavailability of long-term data from server logs. The need to develop reliable models of Web traffic, Web user navigation, and e-customer behaviour calls for an up-to-date, large-volume e-commerce dataset on Web traffic. Similarly, AI problems require a sufficient amount of solid, real-life data to train and validate new models and methods. Thus, to meet a demand of a publicly available long-term e-commerce dataset, we collected access log data describing the operation of an online store over a six-month period. Using a program written in the C# language, data were aggregated, transformed, and anonymized. As a result, we release this EClog dataset in CSV format, which covers 183 days of HTTP-level e-commerce traffic. The data will be beneficial for research in many areas, including computer science, data science, management, and sociology.
format Online
Article
Text
id pubmed-7540248
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher The Authors. Published by Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-75402482020-10-08 HTTP-level e-commerce data based on server access logs for an online store Chodak, Grzegorz Suchacka, Grażyna Chawla, Yash Computer Networks Data Article Web server logs have been extensively used as a source of data on the characteristics of Web traffic and users’ navigational patterns. In particular, Web bot detection and online purchase prediction using methods from artificial intelligence (AI) are currently key areas of research. However, in reality, it is hard to obtain logs from actual online stores and there is no common dataset that can be used across different studies. Moreover, there is a lack of studies exploring Web traffic over a longer period of time, due to the unavailability of long-term data from server logs. The need to develop reliable models of Web traffic, Web user navigation, and e-customer behaviour calls for an up-to-date, large-volume e-commerce dataset on Web traffic. Similarly, AI problems require a sufficient amount of solid, real-life data to train and validate new models and methods. Thus, to meet a demand of a publicly available long-term e-commerce dataset, we collected access log data describing the operation of an online store over a six-month period. Using a program written in the C# language, data were aggregated, transformed, and anonymized. As a result, we release this EClog dataset in CSV format, which covers 183 days of HTTP-level e-commerce traffic. The data will be beneficial for research in many areas, including computer science, data science, management, and sociology. The Authors. Published by Elsevier B.V. 2020-12-24 2020-10-07 /pmc/articles/PMC7540248/ /pubmed/35023998 http://dx.doi.org/10.1016/j.comnet.2020.107589 Text en © 2020 The Authors. Published by Elsevier B.V. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Data Article
Chodak, Grzegorz
Suchacka, Grażyna
Chawla, Yash
HTTP-level e-commerce data based on server access logs for an online store
title HTTP-level e-commerce data based on server access logs for an online store
title_full HTTP-level e-commerce data based on server access logs for an online store
title_fullStr HTTP-level e-commerce data based on server access logs for an online store
title_full_unstemmed HTTP-level e-commerce data based on server access logs for an online store
title_short HTTP-level e-commerce data based on server access logs for an online store
title_sort http-level e-commerce data based on server access logs for an online store
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7540248/
https://www.ncbi.nlm.nih.gov/pubmed/35023998
http://dx.doi.org/10.1016/j.comnet.2020.107589
work_keys_str_mv AT chodakgrzegorz httplevelecommercedatabasedonserveraccesslogsforanonlinestore
AT suchackagrazyna httplevelecommercedatabasedonserveraccesslogsforanonlinestore
AT chawlayash httplevelecommercedatabasedonserveraccesslogsforanonlinestore