Cargando…

Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters

The industrial control data set has many features and large redundancy, which has a certain impact on the training speed and classification results of the neural network anomaly detection algorithm. However, features are independent of each other, and dimension reduction often increases the false po...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Xue-Jun, Kong, Xiang-Min, Zhang, Xiao-Ni, Luan, Hai-Ying, Yan, Yong, Sha, Yun, Li, Kai-Li, Cao, Xue-Ying, Chen, Jian-Ping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9071983/
https://www.ncbi.nlm.nih.gov/pubmed/35528350
http://dx.doi.org/10.1155/2022/9248267
_version_ 1784700952092082176
author Liu, Xue-Jun
Kong, Xiang-Min
Zhang, Xiao-Ni
Luan, Hai-Ying
Yan, Yong
Sha, Yun
Li, Kai-Li
Cao, Xue-Ying
Chen, Jian-Ping
author_facet Liu, Xue-Jun
Kong, Xiang-Min
Zhang, Xiao-Ni
Luan, Hai-Ying
Yan, Yong
Sha, Yun
Li, Kai-Li
Cao, Xue-Ying
Chen, Jian-Ping
author_sort Liu, Xue-Jun
collection PubMed
description The industrial control data set has many features and large redundancy, which has a certain impact on the training speed and classification results of the neural network anomaly detection algorithm. However, features are independent of each other, and dimension reduction often increases the false positive rate and false negative rate. The feature sequencing algorithm can reduce this effect. In order to select the appropriate feature sequencing algorithm for different data sets, this paper proposes an adaptive feature sequencing method based on data set evaluation index parameters. Firstly, the evaluation index system is constructed by the basic information of the data set, the mathematical characteristics of the data set, and the association degree of the data set. Then, the selection model is obtained by the decision tree training with the data label and the evaluation index, and the suitable feature sequencing algorithm is selected. Experiments were conducted on 11 data sets, including Batadal data set, CICIDS 2017, and Mississippi data set. The sequenced data sets are classified by ResNet. The accuracy of the sequenced data sets increases by 2.568% on average in 30 generations, and the average time reduction per epoch is 24.143%. Experiments show that this method can effectively select the feature sequencing algorithm with the best comprehensive performance.
format Online
Article
Text
id pubmed-9071983
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-90719832022-05-06 Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters Liu, Xue-Jun Kong, Xiang-Min Zhang, Xiao-Ni Luan, Hai-Ying Yan, Yong Sha, Yun Li, Kai-Li Cao, Xue-Ying Chen, Jian-Ping Comput Intell Neurosci Research Article The industrial control data set has many features and large redundancy, which has a certain impact on the training speed and classification results of the neural network anomaly detection algorithm. However, features are independent of each other, and dimension reduction often increases the false positive rate and false negative rate. The feature sequencing algorithm can reduce this effect. In order to select the appropriate feature sequencing algorithm for different data sets, this paper proposes an adaptive feature sequencing method based on data set evaluation index parameters. Firstly, the evaluation index system is constructed by the basic information of the data set, the mathematical characteristics of the data set, and the association degree of the data set. Then, the selection model is obtained by the decision tree training with the data label and the evaluation index, and the suitable feature sequencing algorithm is selected. Experiments were conducted on 11 data sets, including Batadal data set, CICIDS 2017, and Mississippi data set. The sequenced data sets are classified by ResNet. The accuracy of the sequenced data sets increases by 2.568% on average in 30 generations, and the average time reduction per epoch is 24.143%. Experiments show that this method can effectively select the feature sequencing algorithm with the best comprehensive performance. Hindawi 2022-04-28 /pmc/articles/PMC9071983/ /pubmed/35528350 http://dx.doi.org/10.1155/2022/9248267 Text en Copyright © 2022 Xue-Jun Liu et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Liu, Xue-Jun
Kong, Xiang-Min
Zhang, Xiao-Ni
Luan, Hai-Ying
Yan, Yong
Sha, Yun
Li, Kai-Li
Cao, Xue-Ying
Chen, Jian-Ping
Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters
title Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters
title_full Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters
title_fullStr Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters
title_full_unstemmed Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters
title_short Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters
title_sort feature sequencing method of industrial control data set based on multidimensional evaluation parameters
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9071983/
https://www.ncbi.nlm.nih.gov/pubmed/35528350
http://dx.doi.org/10.1155/2022/9248267
work_keys_str_mv AT liuxuejun featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters
AT kongxiangmin featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters
AT zhangxiaoni featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters
AT luanhaiying featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters
AT yanyong featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters
AT shayun featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters
AT likaili featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters
AT caoxueying featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters
AT chenjianping featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters