Cargando…

Using Entropy in Web Usage Data Preprocessing

The paper is focused on an examination of the use of entropy in the field of web usage mining. Entropy creates an alternative possibility of determining the ratio of auxiliary pages in the session identification using the Reference Length method. The experiment was conducted on two different web por...

Descripción completa

Detalles Bibliográficos
Autores principales: Munk, Michal, Benko, Lubomir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7512266/
https://www.ncbi.nlm.nih.gov/pubmed/33265164
http://dx.doi.org/10.3390/e20010067
_version_ 1783586118381338624
author Munk, Michal
Benko, Lubomir
author_facet Munk, Michal
Benko, Lubomir
author_sort Munk, Michal
collection PubMed
description The paper is focused on an examination of the use of entropy in the field of web usage mining. Entropy creates an alternative possibility of determining the ratio of auxiliary pages in the session identification using the Reference Length method. The experiment was conducted on two different web portals. The first log file was obtained from a course of virtual learning environment web portal. The second log file was received from the web portal with anonymous access. A comparison of the results of entropy estimation of the ratio of auxiliary pages and a sitemap estimation of the ratio of auxiliary pages showed that in the case of sitemap abundance, entropy could be a full-valued substitution for the estimate of the ratio of auxiliary pages.
format Online
Article
Text
id pubmed-7512266
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75122662020-11-09 Using Entropy in Web Usage Data Preprocessing Munk, Michal Benko, Lubomir Entropy (Basel) Article The paper is focused on an examination of the use of entropy in the field of web usage mining. Entropy creates an alternative possibility of determining the ratio of auxiliary pages in the session identification using the Reference Length method. The experiment was conducted on two different web portals. The first log file was obtained from a course of virtual learning environment web portal. The second log file was received from the web portal with anonymous access. A comparison of the results of entropy estimation of the ratio of auxiliary pages and a sitemap estimation of the ratio of auxiliary pages showed that in the case of sitemap abundance, entropy could be a full-valued substitution for the estimate of the ratio of auxiliary pages. MDPI 2018-01-22 /pmc/articles/PMC7512266/ /pubmed/33265164 http://dx.doi.org/10.3390/e20010067 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Munk, Michal
Benko, Lubomir
Using Entropy in Web Usage Data Preprocessing
title Using Entropy in Web Usage Data Preprocessing
title_full Using Entropy in Web Usage Data Preprocessing
title_fullStr Using Entropy in Web Usage Data Preprocessing
title_full_unstemmed Using Entropy in Web Usage Data Preprocessing
title_short Using Entropy in Web Usage Data Preprocessing
title_sort using entropy in web usage data preprocessing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7512266/
https://www.ncbi.nlm.nih.gov/pubmed/33265164
http://dx.doi.org/10.3390/e20010067
work_keys_str_mv AT munkmichal usingentropyinwebusagedatapreprocessing
AT benkolubomir usingentropyinwebusagedatapreprocessing