Cargando…

Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry

The features that are used in the classification process are acquired from sensor data on the production site (associated with toxic, physicochemical properties) and also a dataset associated with cybersecurity that may affect the above-mentioned risk. These are large datasets, so it is important to...

Descripción completa

Detalles Bibliográficos
Autor principal: Topolski, Mariusz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434006/
https://www.ncbi.nlm.nih.gov/pubmed/34502644
http://dx.doi.org/10.3390/s21175753
_version_ 1783751496214511616
author Topolski, Mariusz
author_facet Topolski, Mariusz
author_sort Topolski, Mariusz
collection PubMed
description The features that are used in the classification process are acquired from sensor data on the production site (associated with toxic, physicochemical properties) and also a dataset associated with cybersecurity that may affect the above-mentioned risk. These are large datasets, so it is important to reduce them. The author’s motivation was to develop a method of assessing the dimensionality of features based on correlation measures and the discriminant power of features allowing for a more accurate reduction of their dimensions compared to the classical Kaiser criterion and assessment of scree plot. The method proved to be promising. The results obtained in the experiments demonstrate that the quality of classification after extraction is better than using classical criteria for estimating the number of components and features. Experiments were carried out for various extraction methods, demonstrating that the rotation of factors according to centroids of a class in this classification task gives the best risk assessment of chemical threats. The classification quality increased by about 7% compared to a model where feature extraction was not used and resulted in an improvement of 4% compared to the classical PCA method with the Kaiser criterion, with an evaluation of the scree plot. Furthermore, it has been shown that there is a certain subspace of cybersecurity features, which complemented with the features of the concentration of volatile substances, affects the risk assessment of chemical hazards. The identified cybersecurity factors are the number of packets lost, incorrect Logins, incorrect sensor responses, increased email spam, and excessive traffic in the computer network. To visualize the speed of classification in real-time, simulations were carried out for various systems used in Industry 4.0.
format Online
Article
Text
id pubmed-8434006
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-84340062021-09-12 Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry Topolski, Mariusz Sensors (Basel) Article The features that are used in the classification process are acquired from sensor data on the production site (associated with toxic, physicochemical properties) and also a dataset associated with cybersecurity that may affect the above-mentioned risk. These are large datasets, so it is important to reduce them. The author’s motivation was to develop a method of assessing the dimensionality of features based on correlation measures and the discriminant power of features allowing for a more accurate reduction of their dimensions compared to the classical Kaiser criterion and assessment of scree plot. The method proved to be promising. The results obtained in the experiments demonstrate that the quality of classification after extraction is better than using classical criteria for estimating the number of components and features. Experiments were carried out for various extraction methods, demonstrating that the rotation of factors according to centroids of a class in this classification task gives the best risk assessment of chemical threats. The classification quality increased by about 7% compared to a model where feature extraction was not used and resulted in an improvement of 4% compared to the classical PCA method with the Kaiser criterion, with an evaluation of the scree plot. Furthermore, it has been shown that there is a certain subspace of cybersecurity features, which complemented with the features of the concentration of volatile substances, affects the risk assessment of chemical hazards. The identified cybersecurity factors are the number of packets lost, incorrect Logins, incorrect sensor responses, increased email spam, and excessive traffic in the computer network. To visualize the speed of classification in real-time, simulations were carried out for various systems used in Industry 4.0. MDPI 2021-08-26 /pmc/articles/PMC8434006/ /pubmed/34502644 http://dx.doi.org/10.3390/s21175753 Text en © 2021 by the author. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Topolski, Mariusz
Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry
title Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry
title_full Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry
title_fullStr Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry
title_full_unstemmed Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry
title_short Application of Feature Extraction Methods for Chemical Risk Classification in the Pharmaceutical Industry
title_sort application of feature extraction methods for chemical risk classification in the pharmaceutical industry
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434006/
https://www.ncbi.nlm.nih.gov/pubmed/34502644
http://dx.doi.org/10.3390/s21175753
work_keys_str_mv AT topolskimariusz applicationoffeatureextractionmethodsforchemicalriskclassificationinthepharmaceuticalindustry