Cargando…

Generating Artificial Sensor Data for the Comparison of Unsupervised Machine Learning Methods

In the field of Cyber-Physical Systems (CPS), there is a large number of machine learning methods, and their intrinsic hyper-parameters are hugely varied. Since no agreed-on datasets for CPS exist, developers of new algorithms are forced to define their own benchmarks. This leads to a large number o...

Descripción completa

Detalles Bibliográficos
Autores principales: Zimmering, Bernd, Niggemann, Oliver, Hasterok, Constanze, Pfannstiel, Erik, Ramming, Dario, Pfrommer, Julius
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8037210/
https://www.ncbi.nlm.nih.gov/pubmed/33808459
http://dx.doi.org/10.3390/s21072397
_version_ 1783677090140258304
author Zimmering, Bernd
Niggemann, Oliver
Hasterok, Constanze
Pfannstiel, Erik
Ramming, Dario
Pfrommer, Julius
author_facet Zimmering, Bernd
Niggemann, Oliver
Hasterok, Constanze
Pfannstiel, Erik
Ramming, Dario
Pfrommer, Julius
author_sort Zimmering, Bernd
collection PubMed
description In the field of Cyber-Physical Systems (CPS), there is a large number of machine learning methods, and their intrinsic hyper-parameters are hugely varied. Since no agreed-on datasets for CPS exist, developers of new algorithms are forced to define their own benchmarks. This leads to a large number of algorithms each claiming benefits over other approaches but lacking a fair comparison. To tackle this problem, this paper defines a novel model for a generation process of data, similar to that found in CPS. The model is based on well-understood system theory and allows many datasets with different characteristics in terms of complexity to be generated. The data will pave the way for a comparison of selected machine learning methods in the exemplary field of unsupervised learning. Based on the synthetic CPS data, the data generation process is evaluated by analyzing the performance of the methods of the Self-Organizing Map, One-Class Support Vector Machine and Long Short-Term Memory Neural Net in anomaly detection.
format Online
Article
Text
id pubmed-8037210
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-80372102021-04-12 Generating Artificial Sensor Data for the Comparison of Unsupervised Machine Learning Methods Zimmering, Bernd Niggemann, Oliver Hasterok, Constanze Pfannstiel, Erik Ramming, Dario Pfrommer, Julius Sensors (Basel) Article In the field of Cyber-Physical Systems (CPS), there is a large number of machine learning methods, and their intrinsic hyper-parameters are hugely varied. Since no agreed-on datasets for CPS exist, developers of new algorithms are forced to define their own benchmarks. This leads to a large number of algorithms each claiming benefits over other approaches but lacking a fair comparison. To tackle this problem, this paper defines a novel model for a generation process of data, similar to that found in CPS. The model is based on well-understood system theory and allows many datasets with different characteristics in terms of complexity to be generated. The data will pave the way for a comparison of selected machine learning methods in the exemplary field of unsupervised learning. Based on the synthetic CPS data, the data generation process is evaluated by analyzing the performance of the methods of the Self-Organizing Map, One-Class Support Vector Machine and Long Short-Term Memory Neural Net in anomaly detection. MDPI 2021-03-30 /pmc/articles/PMC8037210/ /pubmed/33808459 http://dx.doi.org/10.3390/s21072397 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zimmering, Bernd
Niggemann, Oliver
Hasterok, Constanze
Pfannstiel, Erik
Ramming, Dario
Pfrommer, Julius
Generating Artificial Sensor Data for the Comparison of Unsupervised Machine Learning Methods
title Generating Artificial Sensor Data for the Comparison of Unsupervised Machine Learning Methods
title_full Generating Artificial Sensor Data for the Comparison of Unsupervised Machine Learning Methods
title_fullStr Generating Artificial Sensor Data for the Comparison of Unsupervised Machine Learning Methods
title_full_unstemmed Generating Artificial Sensor Data for the Comparison of Unsupervised Machine Learning Methods
title_short Generating Artificial Sensor Data for the Comparison of Unsupervised Machine Learning Methods
title_sort generating artificial sensor data for the comparison of unsupervised machine learning methods
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8037210/
https://www.ncbi.nlm.nih.gov/pubmed/33808459
http://dx.doi.org/10.3390/s21072397
work_keys_str_mv AT zimmeringbernd generatingartificialsensordataforthecomparisonofunsupervisedmachinelearningmethods
AT niggemannoliver generatingartificialsensordataforthecomparisonofunsupervisedmachinelearningmethods
AT hasterokconstanze generatingartificialsensordataforthecomparisonofunsupervisedmachinelearningmethods
AT pfannstielerik generatingartificialsensordataforthecomparisonofunsupervisedmachinelearningmethods
AT rammingdario generatingartificialsensordataforthecomparisonofunsupervisedmachinelearningmethods
AT pfrommerjulius generatingartificialsensordataforthecomparisonofunsupervisedmachinelearningmethods