Cargando…
Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors
The relative simplicity of IoT networks extends service vulnerabilities and possibilities to different network failures exhibiting system weaknesses. Therefore, having a dataset with a sufficient number of samples, labeled and with a systematic analysis, is essential in order to understand how these...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7374380/ https://www.ncbi.nlm.nih.gov/pubmed/32635443 http://dx.doi.org/10.3390/s20133745 |
_version_ | 1783561685462679552 |
---|---|
author | Vigoya, Laura Fernandez, Diego Carneiro, Victor Cacheda, Fidel |
author_facet | Vigoya, Laura Fernandez, Diego Carneiro, Victor Cacheda, Fidel |
author_sort | Vigoya, Laura |
collection | PubMed |
description | The relative simplicity of IoT networks extends service vulnerabilities and possibilities to different network failures exhibiting system weaknesses. Therefore, having a dataset with a sufficient number of samples, labeled and with a systematic analysis, is essential in order to understand how these networks behave and detect traffic anomalies. This work presents DAD: a complete and labeled IoT dataset containing a reproduction of certain real-world behaviors as seen from the network. To approximate the dataset to a real environment, the data were obtained from a physical data center, with temperature sensors based on NFC smart passive sensor technology. Having carried out different approaches, performing mathematical modeling using time series was finally chosen. The virtual infrastructure necessary for the creation of the dataset is formed by five virtual machines, a MQTT broker and four client nodes, each of them with four sensors of the refrigeration units connected to the internal IoT network. DAD presents a seven day network activity with three types of anomalies: duplication, interception and modification on the MQTT message, spread over 5 days. Finally, a feature description is performed, so it can be used for the application of the various techniques of prediction or automatic classification. |
format | Online Article Text |
id | pubmed-7374380 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-73743802020-08-06 Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors Vigoya, Laura Fernandez, Diego Carneiro, Victor Cacheda, Fidel Sensors (Basel) Article The relative simplicity of IoT networks extends service vulnerabilities and possibilities to different network failures exhibiting system weaknesses. Therefore, having a dataset with a sufficient number of samples, labeled and with a systematic analysis, is essential in order to understand how these networks behave and detect traffic anomalies. This work presents DAD: a complete and labeled IoT dataset containing a reproduction of certain real-world behaviors as seen from the network. To approximate the dataset to a real environment, the data were obtained from a physical data center, with temperature sensors based on NFC smart passive sensor technology. Having carried out different approaches, performing mathematical modeling using time series was finally chosen. The virtual infrastructure necessary for the creation of the dataset is formed by five virtual machines, a MQTT broker and four client nodes, each of them with four sensors of the refrigeration units connected to the internal IoT network. DAD presents a seven day network activity with three types of anomalies: duplication, interception and modification on the MQTT message, spread over 5 days. Finally, a feature description is performed, so it can be used for the application of the various techniques of prediction or automatic classification. MDPI 2020-07-04 /pmc/articles/PMC7374380/ /pubmed/32635443 http://dx.doi.org/10.3390/s20133745 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Vigoya, Laura Fernandez, Diego Carneiro, Victor Cacheda, Fidel Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors |
title | Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors |
title_full | Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors |
title_fullStr | Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors |
title_full_unstemmed | Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors |
title_short | Annotated Dataset for Anomaly Detection in a Data Center with IoT Sensors |
title_sort | annotated dataset for anomaly detection in a data center with iot sensors |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7374380/ https://www.ncbi.nlm.nih.gov/pubmed/32635443 http://dx.doi.org/10.3390/s20133745 |
work_keys_str_mv | AT vigoyalaura annotateddatasetforanomalydetectioninadatacenterwithiotsensors AT fernandezdiego annotateddatasetforanomalydetectioninadatacenterwithiotsensors AT carneirovictor annotateddatasetforanomalydetectioninadatacenterwithiotsensors AT cachedafidel annotateddatasetforanomalydetectioninadatacenterwithiotsensors |