Cargando…

Data-Driven Anomaly Detection Approach for Time-Series Streaming Data

Recently, wireless sensor networks (WSNs) have been extensively deployed to monitor environments. Sensor nodes are susceptible to fault generation due to hardware and software failures in harsh environments. Anomaly detection for the time-series streaming data of sensor nodes is a challenging but cr...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Minghu, Guo, Jianwen, Li, Xin, Jin, Rui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582627/
https://www.ncbi.nlm.nih.gov/pubmed/33023175
http://dx.doi.org/10.3390/s20195646
_version_ 1783599235013279744
author Zhang, Minghu
Guo, Jianwen
Li, Xin
Jin, Rui
author_facet Zhang, Minghu
Guo, Jianwen
Li, Xin
Jin, Rui
author_sort Zhang, Minghu
collection PubMed
description Recently, wireless sensor networks (WSNs) have been extensively deployed to monitor environments. Sensor nodes are susceptible to fault generation due to hardware and software failures in harsh environments. Anomaly detection for the time-series streaming data of sensor nodes is a challenging but critical fault diagnosis task, particularly in large-scale WSNs. The data-driven approach is becoming essential for the goal of improving the reliability and stability of WSNs. We propose a data-driven anomaly detection approach in this paper, named median filter (MF)-stacked long short-term memory-exponentially weighted moving average (LSTM-EWMA), for time-series status data, including the operating voltage and panel temperature recorded by a sensor node deployed in the field. These status data can be used to diagnose device anomalies. First, a median filter (MF) is introduced as a preprocessor to preprocess obvious anomalies in input data. Then, stacked long short-term memory (LSTM) is employed for prediction. Finally, the exponentially weighted moving average (EWMA) control chart is employed as a detector for recognizing anomalies. We evaluate the proposed approach for the panel temperature and operating voltage of time-series streaming data recorded by wireless node devices deployed in harsh field conditions for environmental monitoring. Extensive experiments were conducted on real time-series status data. The results demonstrate that compared to other approaches, the MF-stacked LSTM-EWMA approach can significantly improve the detection rate (DR) and false rate (FR). The average DR and FR values with the proposed approach are 95.46% and 4.42%, respectively. MF-stacked LSTM-EWMA anomaly detection also achieves a better F(2) score than that achieved by other methods. The proposed approach provides valuable insights for anomaly detection in WSNs by detecting anomalies in the time-series status data recorded by wireless sensor nodes.
format Online
Article
Text
id pubmed-7582627
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75826272020-10-28 Data-Driven Anomaly Detection Approach for Time-Series Streaming Data Zhang, Minghu Guo, Jianwen Li, Xin Jin, Rui Sensors (Basel) Article Recently, wireless sensor networks (WSNs) have been extensively deployed to monitor environments. Sensor nodes are susceptible to fault generation due to hardware and software failures in harsh environments. Anomaly detection for the time-series streaming data of sensor nodes is a challenging but critical fault diagnosis task, particularly in large-scale WSNs. The data-driven approach is becoming essential for the goal of improving the reliability and stability of WSNs. We propose a data-driven anomaly detection approach in this paper, named median filter (MF)-stacked long short-term memory-exponentially weighted moving average (LSTM-EWMA), for time-series status data, including the operating voltage and panel temperature recorded by a sensor node deployed in the field. These status data can be used to diagnose device anomalies. First, a median filter (MF) is introduced as a preprocessor to preprocess obvious anomalies in input data. Then, stacked long short-term memory (LSTM) is employed for prediction. Finally, the exponentially weighted moving average (EWMA) control chart is employed as a detector for recognizing anomalies. We evaluate the proposed approach for the panel temperature and operating voltage of time-series streaming data recorded by wireless node devices deployed in harsh field conditions for environmental monitoring. Extensive experiments were conducted on real time-series status data. The results demonstrate that compared to other approaches, the MF-stacked LSTM-EWMA approach can significantly improve the detection rate (DR) and false rate (FR). The average DR and FR values with the proposed approach are 95.46% and 4.42%, respectively. MF-stacked LSTM-EWMA anomaly detection also achieves a better F(2) score than that achieved by other methods. The proposed approach provides valuable insights for anomaly detection in WSNs by detecting anomalies in the time-series status data recorded by wireless sensor nodes. MDPI 2020-10-02 /pmc/articles/PMC7582627/ /pubmed/33023175 http://dx.doi.org/10.3390/s20195646 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Minghu
Guo, Jianwen
Li, Xin
Jin, Rui
Data-Driven Anomaly Detection Approach for Time-Series Streaming Data
title Data-Driven Anomaly Detection Approach for Time-Series Streaming Data
title_full Data-Driven Anomaly Detection Approach for Time-Series Streaming Data
title_fullStr Data-Driven Anomaly Detection Approach for Time-Series Streaming Data
title_full_unstemmed Data-Driven Anomaly Detection Approach for Time-Series Streaming Data
title_short Data-Driven Anomaly Detection Approach for Time-Series Streaming Data
title_sort data-driven anomaly detection approach for time-series streaming data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7582627/
https://www.ncbi.nlm.nih.gov/pubmed/33023175
http://dx.doi.org/10.3390/s20195646
work_keys_str_mv AT zhangminghu datadrivenanomalydetectionapproachfortimeseriesstreamingdata
AT guojianwen datadrivenanomalydetectionapproachfortimeseriesstreamingdata
AT lixin datadrivenanomalydetectionapproachfortimeseriesstreamingdata
AT jinrui datadrivenanomalydetectionapproachfortimeseriesstreamingdata