Cargando…
HTTP-level e-commerce data based on server access logs for an online store
Web server logs have been extensively used as a source of data on the characteristics of Web traffic and users’ navigational patterns. In particular, Web bot detection and online purchase prediction using methods from artificial intelligence (AI) are currently key areas of research. However, in real...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Authors. Published by Elsevier B.V.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7540248/ https://www.ncbi.nlm.nih.gov/pubmed/35023998 http://dx.doi.org/10.1016/j.comnet.2020.107589 |
_version_ | 1783591165016145920 |
---|---|
author | Chodak, Grzegorz Suchacka, Grażyna Chawla, Yash |
author_facet | Chodak, Grzegorz Suchacka, Grażyna Chawla, Yash |
author_sort | Chodak, Grzegorz |
collection | PubMed |
description | Web server logs have been extensively used as a source of data on the characteristics of Web traffic and users’ navigational patterns. In particular, Web bot detection and online purchase prediction using methods from artificial intelligence (AI) are currently key areas of research. However, in reality, it is hard to obtain logs from actual online stores and there is no common dataset that can be used across different studies. Moreover, there is a lack of studies exploring Web traffic over a longer period of time, due to the unavailability of long-term data from server logs. The need to develop reliable models of Web traffic, Web user navigation, and e-customer behaviour calls for an up-to-date, large-volume e-commerce dataset on Web traffic. Similarly, AI problems require a sufficient amount of solid, real-life data to train and validate new models and methods. Thus, to meet a demand of a publicly available long-term e-commerce dataset, we collected access log data describing the operation of an online store over a six-month period. Using a program written in the C# language, data were aggregated, transformed, and anonymized. As a result, we release this EClog dataset in CSV format, which covers 183 days of HTTP-level e-commerce traffic. The data will be beneficial for research in many areas, including computer science, data science, management, and sociology. |
format | Online Article Text |
id | pubmed-7540248 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | The Authors. Published by Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-75402482020-10-08 HTTP-level e-commerce data based on server access logs for an online store Chodak, Grzegorz Suchacka, Grażyna Chawla, Yash Computer Networks Data Article Web server logs have been extensively used as a source of data on the characteristics of Web traffic and users’ navigational patterns. In particular, Web bot detection and online purchase prediction using methods from artificial intelligence (AI) are currently key areas of research. However, in reality, it is hard to obtain logs from actual online stores and there is no common dataset that can be used across different studies. Moreover, there is a lack of studies exploring Web traffic over a longer period of time, due to the unavailability of long-term data from server logs. The need to develop reliable models of Web traffic, Web user navigation, and e-customer behaviour calls for an up-to-date, large-volume e-commerce dataset on Web traffic. Similarly, AI problems require a sufficient amount of solid, real-life data to train and validate new models and methods. Thus, to meet a demand of a publicly available long-term e-commerce dataset, we collected access log data describing the operation of an online store over a six-month period. Using a program written in the C# language, data were aggregated, transformed, and anonymized. As a result, we release this EClog dataset in CSV format, which covers 183 days of HTTP-level e-commerce traffic. The data will be beneficial for research in many areas, including computer science, data science, management, and sociology. The Authors. Published by Elsevier B.V. 2020-12-24 2020-10-07 /pmc/articles/PMC7540248/ /pubmed/35023998 http://dx.doi.org/10.1016/j.comnet.2020.107589 Text en © 2020 The Authors. Published by Elsevier B.V. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Data Article Chodak, Grzegorz Suchacka, Grażyna Chawla, Yash HTTP-level e-commerce data based on server access logs for an online store |
title | HTTP-level e-commerce data based on server access logs for an online store |
title_full | HTTP-level e-commerce data based on server access logs for an online store |
title_fullStr | HTTP-level e-commerce data based on server access logs for an online store |
title_full_unstemmed | HTTP-level e-commerce data based on server access logs for an online store |
title_short | HTTP-level e-commerce data based on server access logs for an online store |
title_sort | http-level e-commerce data based on server access logs for an online store |
topic | Data Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7540248/ https://www.ncbi.nlm.nih.gov/pubmed/35023998 http://dx.doi.org/10.1016/j.comnet.2020.107589 |
work_keys_str_mv | AT chodakgrzegorz httplevelecommercedatabasedonserveraccesslogsforanonlinestore AT suchackagrazyna httplevelecommercedatabasedonserveraccesslogsforanonlinestore AT chawlayash httplevelecommercedatabasedonserveraccesslogsforanonlinestore |