Cargando…

Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors

The relative simplicity of IoT networks extends service vulnerabilities and possibilities to different network failures exhibiting system weaknesses. Therefore, having a dataset with a sufficient number of samples, labeled and with a systematic analysis, is essential in order to understand how these...

Descripción completa

Detalles Bibliográficos
Autores principales: Vigoya, Laura, Fernandez, Diego, Carneiro, Victor, Cacheda, Fidel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7374380/
https://www.ncbi.nlm.nih.gov/pubmed/32635443
http://dx.doi.org/10.3390/s20133745
_version_ 1783561685462679552
author Vigoya, Laura
Fernandez, Diego
Carneiro, Victor
Cacheda, Fidel
author_facet Vigoya, Laura
Fernandez, Diego
Carneiro, Victor
Cacheda, Fidel
author_sort Vigoya, Laura
collection PubMed
description The relative simplicity of IoT networks extends service vulnerabilities and possibilities to different network failures exhibiting system weaknesses. Therefore, having a dataset with a sufficient number of samples, labeled and with a systematic analysis, is essential in order to understand how these networks behave and detect traffic anomalies. This work presents DAD: a complete and labeled IoT dataset containing a reproduction of certain real-world behaviors as seen from the network. To approximate the dataset to a real environment, the data were obtained from a physical data center, with temperature sensors based on NFC smart passive sensor technology. Having carried out different approaches, performing mathematical modeling using time series was finally chosen. The virtual infrastructure necessary for the creation of the dataset is formed by five virtual machines, a MQTT broker and four client nodes, each of them with four sensors of the refrigeration units connected to the internal IoT network. DAD presents a seven day network activity with three types of anomalies: duplication, interception and modification on the MQTT message, spread over 5 days. Finally, a feature description is performed, so it can be used for the application of the various techniques of prediction or automatic classification.
format Online
Article
Text
id pubmed-7374380
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-73743802020-08-06 Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors Vigoya, Laura Fernandez, Diego Carneiro, Victor Cacheda, Fidel Sensors (Basel) Article The relative simplicity of IoT networks extends service vulnerabilities and possibilities to different network failures exhibiting system weaknesses. Therefore, having a dataset with a sufficient number of samples, labeled and with a systematic analysis, is essential in order to understand how these networks behave and detect traffic anomalies. This work presents DAD: a complete and labeled IoT dataset containing a reproduction of certain real-world behaviors as seen from the network. To approximate the dataset to a real environment, the data were obtained from a physical data center, with temperature sensors based on NFC smart passive sensor technology. Having carried out different approaches, performing mathematical modeling using time series was finally chosen. The virtual infrastructure necessary for the creation of the dataset is formed by five virtual machines, a MQTT broker and four client nodes, each of them with four sensors of the refrigeration units connected to the internal IoT network. DAD presents a seven day network activity with three types of anomalies: duplication, interception and modification on the MQTT message, spread over 5 days. Finally, a feature description is performed, so it can be used for the application of the various techniques of prediction or automatic classification. MDPI 2020-07-04 /pmc/articles/PMC7374380/ /pubmed/32635443 http://dx.doi.org/10.3390/s20133745 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Vigoya, Laura
Fernandez, Diego
Carneiro, Victor
Cacheda, Fidel
Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors
title Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors
title_full Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors
title_fullStr Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors
title_full_unstemmed Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors
title_short Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors
title_sort annotated dataset for anomaly detection in a data center with iot sensors
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7374380/
https://www.ncbi.nlm.nih.gov/pubmed/32635443
http://dx.doi.org/10.3390/s20133745
work_keys_str_mv AT vigoyalaura annotateddatasetforanomalydetectioninadatacenterwithiotsensors
AT fernandezdiego annotateddatasetforanomalydetectioninadatacenterwithiotsensors
AT carneirovictor annotateddatasetforanomalydetectioninadatacenterwithiotsensors
AT cachedafidel annotateddatasetforanomalydetectioninadatacenterwithiotsensors