Cargando…
Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry
The features that are used in the classification process are acquired from sensor data on the production site (associated with toxic, physicochemical properties) and also a dataset associated with cybersecurity that may affect the above-mentioned risk. These are large datasets, so it is important to...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434006/ https://www.ncbi.nlm.nih.gov/pubmed/34502644 http://dx.doi.org/10.3390/s21175753 |
_version_ | 1783751496214511616 |
---|---|
author | Topolski, Mariusz |
author_facet | Topolski, Mariusz |
author_sort | Topolski, Mariusz |
collection | PubMed |
description | The features that are used in the classification process are acquired from sensor data on the production site (associated with toxic, physicochemical properties) and also a dataset associated with cybersecurity that may affect the above-mentioned risk. These are large datasets, so it is important to reduce them. The author’s motivation was to develop a method of assessing the dimensionality of features based on correlation measures and the discriminant power of features allowing for a more accurate reduction of their dimensions compared to the classical Kaiser criterion and assessment of scree plot. The method proved to be promising. The results obtained in the experiments demonstrate that the quality of classification after extraction is better than using classical criteria for estimating the number of components and features. Experiments were carried out for various extraction methods, demonstrating that the rotation of factors according to centroids of a class in this classification task gives the best risk assessment of chemical threats. The classification quality increased by about 7% compared to a model where feature extraction was not used and resulted in an improvement of 4% compared to the classical PCA method with the Kaiser criterion, with an evaluation of the scree plot. Furthermore, it has been shown that there is a certain subspace of cybersecurity features, which complemented with the features of the concentration of volatile substances, affects the risk assessment of chemical hazards. The identified cybersecurity factors are the number of packets lost, incorrect Logins, incorrect sensor responses, increased email spam, and excessive traffic in the computer network. To visualize the speed of classification in real-time, simulations were carried out for various systems used in Industry 4.0. |
format | Online Article Text |
id | pubmed-8434006 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-84340062021-09-12 Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry Topolski, Mariusz Sensors (Basel) Article The features that are used in the classification process are acquired from sensor data on the production site (associated with toxic, physicochemical properties) and also a dataset associated with cybersecurity that may affect the above-mentioned risk. These are large datasets, so it is important to reduce them. The author’s motivation was to develop a method of assessing the dimensionality of features based on correlation measures and the discriminant power of features allowing for a more accurate reduction of their dimensions compared to the classical Kaiser criterion and assessment of scree plot. The method proved to be promising. The results obtained in the experiments demonstrate that the quality of classification after extraction is better than using classical criteria for estimating the number of components and features. Experiments were carried out for various extraction methods, demonstrating that the rotation of factors according to centroids of a class in this classification task gives the best risk assessment of chemical threats. The classification quality increased by about 7% compared to a model where feature extraction was not used and resulted in an improvement of 4% compared to the classical PCA method with the Kaiser criterion, with an evaluation of the scree plot. Furthermore, it has been shown that there is a certain subspace of cybersecurity features, which complemented with the features of the concentration of volatile substances, affects the risk assessment of chemical hazards. The identified cybersecurity factors are the number of packets lost, incorrect Logins, incorrect sensor responses, increased email spam, and excessive traffic in the computer network. To visualize the speed of classification in real-time, simulations were carried out for various systems used in Industry 4.0. MDPI 2021-08-26 /pmc/articles/PMC8434006/ /pubmed/34502644 http://dx.doi.org/10.3390/s21175753 Text en © 2021 by the author. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Topolski, Mariusz Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry |
title | Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry |
title_full | Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry |
title_fullStr | Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry |
title_full_unstemmed | Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry |
title_short | Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry |
title_sort | application of feature extraction methods for chemical risk classification in the pharmaceutical industry |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434006/ https://www.ncbi.nlm.nih.gov/pubmed/34502644 http://dx.doi.org/10.3390/s21175753 |
work_keys_str_mv | AT topolskimariusz applicationoffeatureextractionmethodsforchemicalriskclassificationinthepharmaceuticalindustry |