Cargando…

An Innovative Huffman Forest-Based Method to Detected Railroad Station Anomalies

Detecting railroad station anomalies is a critical task prior to segmentation and making optimization decisions for each cluster. Three types of anomalies (local clustered, axis paralleled, and surrounded by normal instances) caused by the specialty of railroad operations bring the existing methods...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Yuan, Li, Xiaopeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9143646/
https://www.ncbi.nlm.nih.gov/pubmed/35632324
http://dx.doi.org/10.3390/s22103915
_version_ 1784715857205657600
author Wang, Yuan
Li, Xiaopeng
author_facet Wang, Yuan
Li, Xiaopeng
author_sort Wang, Yuan
collection PubMed
description Detecting railroad station anomalies is a critical task prior to segmentation and making optimization decisions for each cluster. Three types of anomalies (local clustered, axis paralleled, and surrounded by normal instances) caused by the specialty of railroad operations bring the existing methods non-trivial challenges in detecting them accurately and efficiently. To tackle this limitation of existing methods, this paper proposes a novel anomaly detection method named Huffman Anomaly Detection Forest (HuffForest) to detect station anomalies, which leverages Huffman encoding to measure abnormalities in certain railroad scenarios with high accuracy. The proposed method establishes a Huffman forest by constructing trees from the perspective of data points and subsequently computes anomaly scores of instances considering both local and global information. A sampling-based version is also developed to improve scalability for large datasets. Taking advantage of the encoding mechanism, the proposed method can effectively recognize the underlying patterns of railroad stations and detect outliers in various complicated scenarios where the conventional methods are not reliable. Experiment results on both synthesized and public benchmarks are demonstrated to show the advances of the proposed method compared to the state-of-the-art isolation forest (iForest) and local outlier factor (LOF) methods on detection accuracy with an acceptable computational complexity.
format Online
Article
Text
id pubmed-9143646
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91436462022-05-29 An Innovative Huffman Forest-Based Method to Detected Railroad Station Anomalies Wang, Yuan Li, Xiaopeng Sensors (Basel) Article Detecting railroad station anomalies is a critical task prior to segmentation and making optimization decisions for each cluster. Three types of anomalies (local clustered, axis paralleled, and surrounded by normal instances) caused by the specialty of railroad operations bring the existing methods non-trivial challenges in detecting them accurately and efficiently. To tackle this limitation of existing methods, this paper proposes a novel anomaly detection method named Huffman Anomaly Detection Forest (HuffForest) to detect station anomalies, which leverages Huffman encoding to measure abnormalities in certain railroad scenarios with high accuracy. The proposed method establishes a Huffman forest by constructing trees from the perspective of data points and subsequently computes anomaly scores of instances considering both local and global information. A sampling-based version is also developed to improve scalability for large datasets. Taking advantage of the encoding mechanism, the proposed method can effectively recognize the underlying patterns of railroad stations and detect outliers in various complicated scenarios where the conventional methods are not reliable. Experiment results on both synthesized and public benchmarks are demonstrated to show the advances of the proposed method compared to the state-of-the-art isolation forest (iForest) and local outlier factor (LOF) methods on detection accuracy with an acceptable computational complexity. MDPI 2022-05-22 /pmc/articles/PMC9143646/ /pubmed/35632324 http://dx.doi.org/10.3390/s22103915 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Yuan
Li, Xiaopeng
An Innovative Huffman Forest-Based Method to Detected Railroad Station Anomalies
title An Innovative Huffman Forest-Based Method to Detected Railroad Station Anomalies
title_full An Innovative Huffman Forest-Based Method to Detected Railroad Station Anomalies
title_fullStr An Innovative Huffman Forest-Based Method to Detected Railroad Station Anomalies
title_full_unstemmed An Innovative Huffman Forest-Based Method to Detected Railroad Station Anomalies
title_short An Innovative Huffman Forest-Based Method to Detected Railroad Station Anomalies
title_sort innovative huffman forest-based method to detected railroad station anomalies
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9143646/
https://www.ncbi.nlm.nih.gov/pubmed/35632324
http://dx.doi.org/10.3390/s22103915
work_keys_str_mv AT wangyuan aninnovativehuffmanforestbasedmethodtodetectedrailroadstationanomalies
AT lixiaopeng aninnovativehuffmanforestbasedmethodtodetectedrailroadstationanomalies
AT wangyuan innovativehuffmanforestbasedmethodtodetectedrailroadstationanomalies
AT lixiaopeng innovativehuffmanforestbasedmethodtodetectedrailroadstationanomalies