Cargando…

Unsupervised Outlier Detection in IOT Using Deep VAE

The Internet of Things (IoT) refers to a system of interconnected, internet-connected devices and sensors that allows the collection and dissemination of data. The data provided by these sensors may include outliers or exhibit anomalous behavior as a result of attack activities or device failure, fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Gouda, Walaa, Tahir, Sidra, Alanazi, Saad, Almufareh, Maram, Alwakid, Ghadah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9460757/
https://www.ncbi.nlm.nih.gov/pubmed/36081083
http://dx.doi.org/10.3390/s22176617
_version_ 1784786825689169920
author Gouda, Walaa
Tahir, Sidra
Alanazi, Saad
Almufareh, Maram
Alwakid, Ghadah
author_facet Gouda, Walaa
Tahir, Sidra
Alanazi, Saad
Almufareh, Maram
Alwakid, Ghadah
author_sort Gouda, Walaa
collection PubMed
description The Internet of Things (IoT) refers to a system of interconnected, internet-connected devices and sensors that allows the collection and dissemination of data. The data provided by these sensors may include outliers or exhibit anomalous behavior as a result of attack activities or device failure, for example. However, the majority of existing outlier detection algorithms rely on labeled data, which is frequently hard to obtain in the IoT domain. More crucially, the IoT’s data volume is continually increasing, necessitating the requirement for predicting and identifying the classes of future data. In this study, we propose an unsupervised technique based on a deep Variational Auto-Encoder (VAE) to detect outliers in IoT data by leveraging the characteristic of the reconstruction ability and the low-dimensional representation of the input data’s latent variables of the VAE. First, the input data are standardized. Then, we employ the VAE to find a reconstructed output representation from the low-dimensional representation of the latent variables of the input data. Finally, the reconstruction error between the original observation and the reconstructed one is used as an outlier score. Our model was trained only using normal data with no labels in an unsupervised manner and evaluated using Statlog (Landsat Satellite) dataset. The unsupervised model achieved promising and comparable results with the state-of-the-art outlier detection schemes with a precision of ≈90% and an F1 score of 79%.
format Online
Article
Text
id pubmed-9460757
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94607572022-09-10 Unsupervised Outlier Detection in IOT Using Deep VAE Gouda, Walaa Tahir, Sidra Alanazi, Saad Almufareh, Maram Alwakid, Ghadah Sensors (Basel) Article The Internet of Things (IoT) refers to a system of interconnected, internet-connected devices and sensors that allows the collection and dissemination of data. The data provided by these sensors may include outliers or exhibit anomalous behavior as a result of attack activities or device failure, for example. However, the majority of existing outlier detection algorithms rely on labeled data, which is frequently hard to obtain in the IoT domain. More crucially, the IoT’s data volume is continually increasing, necessitating the requirement for predicting and identifying the classes of future data. In this study, we propose an unsupervised technique based on a deep Variational Auto-Encoder (VAE) to detect outliers in IoT data by leveraging the characteristic of the reconstruction ability and the low-dimensional representation of the input data’s latent variables of the VAE. First, the input data are standardized. Then, we employ the VAE to find a reconstructed output representation from the low-dimensional representation of the latent variables of the input data. Finally, the reconstruction error between the original observation and the reconstructed one is used as an outlier score. Our model was trained only using normal data with no labels in an unsupervised manner and evaluated using Statlog (Landsat Satellite) dataset. The unsupervised model achieved promising and comparable results with the state-of-the-art outlier detection schemes with a precision of ≈90% and an F1 score of 79%. MDPI 2022-09-01 /pmc/articles/PMC9460757/ /pubmed/36081083 http://dx.doi.org/10.3390/s22176617 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Gouda, Walaa
Tahir, Sidra
Alanazi, Saad
Almufareh, Maram
Alwakid, Ghadah
Unsupervised Outlier Detection in IOT Using Deep VAE
title Unsupervised Outlier Detection in IOT Using Deep VAE
title_full Unsupervised Outlier Detection in IOT Using Deep VAE
title_fullStr Unsupervised Outlier Detection in IOT Using Deep VAE
title_full_unstemmed Unsupervised Outlier Detection in IOT Using Deep VAE
title_short Unsupervised Outlier Detection in IOT Using Deep VAE
title_sort unsupervised outlier detection in iot using deep vae
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9460757/
https://www.ncbi.nlm.nih.gov/pubmed/36081083
http://dx.doi.org/10.3390/s22176617
work_keys_str_mv AT goudawalaa unsupervisedoutlierdetectioniniotusingdeepvae
AT tahirsidra unsupervisedoutlierdetectioniniotusingdeepvae
AT alanazisaad unsupervisedoutlierdetectioniniotusingdeepvae
AT almufarehmaram unsupervisedoutlierdetectioniniotusingdeepvae
AT alwakidghadah unsupervisedoutlierdetectioniniotusingdeepvae