Cargando…
Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters
The industrial control data set has many features and large redundancy, which has a certain impact on the training speed and classification results of the neural network anomaly detection algorithm. However, features are independent of each other, and dimension reduction often increases the false po...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9071983/ https://www.ncbi.nlm.nih.gov/pubmed/35528350 http://dx.doi.org/10.1155/2022/9248267 |
_version_ | 1784700952092082176 |
---|---|
author | Liu, Xue-Jun Kong, Xiang-Min Zhang, Xiao-Ni Luan, Hai-Ying Yan, Yong Sha, Yun Li, Kai-Li Cao, Xue-Ying Chen, Jian-Ping |
author_facet | Liu, Xue-Jun Kong, Xiang-Min Zhang, Xiao-Ni Luan, Hai-Ying Yan, Yong Sha, Yun Li, Kai-Li Cao, Xue-Ying Chen, Jian-Ping |
author_sort | Liu, Xue-Jun |
collection | PubMed |
description | The industrial control data set has many features and large redundancy, which has a certain impact on the training speed and classification results of the neural network anomaly detection algorithm. However, features are independent of each other, and dimension reduction often increases the false positive rate and false negative rate. The feature sequencing algorithm can reduce this effect. In order to select the appropriate feature sequencing algorithm for different data sets, this paper proposes an adaptive feature sequencing method based on data set evaluation index parameters. Firstly, the evaluation index system is constructed by the basic information of the data set, the mathematical characteristics of the data set, and the association degree of the data set. Then, the selection model is obtained by the decision tree training with the data label and the evaluation index, and the suitable feature sequencing algorithm is selected. Experiments were conducted on 11 data sets, including Batadal data set, CICIDS 2017, and Mississippi data set. The sequenced data sets are classified by ResNet. The accuracy of the sequenced data sets increases by 2.568% on average in 30 generations, and the average time reduction per epoch is 24.143%. Experiments show that this method can effectively select the feature sequencing algorithm with the best comprehensive performance. |
format | Online Article Text |
id | pubmed-9071983 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-90719832022-05-06 Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters Liu, Xue-Jun Kong, Xiang-Min Zhang, Xiao-Ni Luan, Hai-Ying Yan, Yong Sha, Yun Li, Kai-Li Cao, Xue-Ying Chen, Jian-Ping Comput Intell Neurosci Research Article The industrial control data set has many features and large redundancy, which has a certain impact on the training speed and classification results of the neural network anomaly detection algorithm. However, features are independent of each other, and dimension reduction often increases the false positive rate and false negative rate. The feature sequencing algorithm can reduce this effect. In order to select the appropriate feature sequencing algorithm for different data sets, this paper proposes an adaptive feature sequencing method based on data set evaluation index parameters. Firstly, the evaluation index system is constructed by the basic information of the data set, the mathematical characteristics of the data set, and the association degree of the data set. Then, the selection model is obtained by the decision tree training with the data label and the evaluation index, and the suitable feature sequencing algorithm is selected. Experiments were conducted on 11 data sets, including Batadal data set, CICIDS 2017, and Mississippi data set. The sequenced data sets are classified by ResNet. The accuracy of the sequenced data sets increases by 2.568% on average in 30 generations, and the average time reduction per epoch is 24.143%. Experiments show that this method can effectively select the feature sequencing algorithm with the best comprehensive performance. Hindawi 2022-04-28 /pmc/articles/PMC9071983/ /pubmed/35528350 http://dx.doi.org/10.1155/2022/9248267 Text en Copyright © 2022 Xue-Jun Liu et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Liu, Xue-Jun Kong, Xiang-Min Zhang, Xiao-Ni Luan, Hai-Ying Yan, Yong Sha, Yun Li, Kai-Li Cao, Xue-Ying Chen, Jian-Ping Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters |
title | Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters |
title_full | Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters |
title_fullStr | Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters |
title_full_unstemmed | Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters |
title_short | Feature Sequencing Method of Industrial Control Data Set Based on Multidimensional Evaluation Parameters |
title_sort | feature sequencing method of industrial control data set based on multidimensional evaluation parameters |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9071983/ https://www.ncbi.nlm.nih.gov/pubmed/35528350 http://dx.doi.org/10.1155/2022/9248267 |
work_keys_str_mv | AT liuxuejun featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters AT kongxiangmin featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters AT zhangxiaoni featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters AT luanhaiying featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters AT yanyong featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters AT shayun featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters AT likaili featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters AT caoxueying featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters AT chenjianping featuresequencingmethodofindustrialcontroldatasetbasedonmultidimensionalevaluationparameters |