Cargando…

Collection of datasets with DNS over HTTPS traffic

Recently, the Internet has adopted the DNS over HTTPS (DoH) resolution mechanism for privacy-aware network applications. As DoH becomes more disseminated, it has also become a network monitoring research topic. For comprehensive evaluation and comparison of developed classifiers, real-world datasets...

Descripción completa

Detalles Bibliográficos
Autores principales: Jeřábek, Kamil, Hynek, Karel, Čejka, Tomáš, Ryšavý, Ondřej
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9168479/
https://www.ncbi.nlm.nih.gov/pubmed/35677460
http://dx.doi.org/10.1016/j.dib.2022.108310
_version_ 1784721019119861760
author Jeřábek, Kamil
Hynek, Karel
Čejka, Tomáš
Ryšavý, Ondřej
author_facet Jeřábek, Kamil
Hynek, Karel
Čejka, Tomáš
Ryšavý, Ondřej
author_sort Jeřábek, Kamil
collection PubMed
description Recently, the Internet has adopted the DNS over HTTPS (DoH) resolution mechanism for privacy-aware network applications. As DoH becomes more disseminated, it has also become a network monitoring research topic. For comprehensive evaluation and comparison of developed classifiers, real-world datasets are needed, motivating this contribution. We created a new large-scale collection of datasets consisting of two classes of traffic: i) DoH HTTPS communication and ii) non-DoH HTTPS connections. The DoH traffic is captured for multiple DoH providers and clients to include nuances of various DoH implementations and configurations. The non-DoH HTTPS connections complement the DoH communication aiming to include a wide range of existing network applications. The dataset collection consists of network traffic generated in a controlled environment and traffic captured from a real ISP network. The resulting datasets thus provide real-world network traffic data suitable for evaluating existing classifiers and the development of new methods.
format Online
Article
Text
id pubmed-9168479
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-91684792022-06-07 Collection of datasets with DNS over HTTPS traffic Jeřábek, Kamil Hynek, Karel Čejka, Tomáš Ryšavý, Ondřej Data Brief Data Article Recently, the Internet has adopted the DNS over HTTPS (DoH) resolution mechanism for privacy-aware network applications. As DoH becomes more disseminated, it has also become a network monitoring research topic. For comprehensive evaluation and comparison of developed classifiers, real-world datasets are needed, motivating this contribution. We created a new large-scale collection of datasets consisting of two classes of traffic: i) DoH HTTPS communication and ii) non-DoH HTTPS connections. The DoH traffic is captured for multiple DoH providers and clients to include nuances of various DoH implementations and configurations. The non-DoH HTTPS connections complement the DoH communication aiming to include a wide range of existing network applications. The dataset collection consists of network traffic generated in a controlled environment and traffic captured from a real ISP network. The resulting datasets thus provide real-world network traffic data suitable for evaluating existing classifiers and the development of new methods. Elsevier 2022-05-27 /pmc/articles/PMC9168479/ /pubmed/35677460 http://dx.doi.org/10.1016/j.dib.2022.108310 Text en © 2022 The Authors. Published by Elsevier Inc. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Jeřábek, Kamil
Hynek, Karel
Čejka, Tomáš
Ryšavý, Ondřej
Collection of datasets with DNS over HTTPS traffic
title Collection of datasets with DNS over HTTPS traffic
title_full Collection of datasets with DNS over HTTPS traffic
title_fullStr Collection of datasets with DNS over HTTPS traffic
title_full_unstemmed Collection of datasets with DNS over HTTPS traffic
title_short Collection of datasets with DNS over HTTPS traffic
title_sort collection of datasets with dns over https traffic
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9168479/
https://www.ncbi.nlm.nih.gov/pubmed/35677460
http://dx.doi.org/10.1016/j.dib.2022.108310
work_keys_str_mv AT jerabekkamil collectionofdatasetswithdnsoverhttpstraffic
AT hynekkarel collectionofdatasetswithdnsoverhttpstraffic
AT cejkatomas collectionofdatasetswithdnsoverhttpstraffic
AT rysavyondrej collectionofdatasetswithdnsoverhttpstraffic