Cargando…

CBD: A Deep-Learning-Based Scheme for Encrypted Traffic Classification with a General Pre-Training Method

With the rapid increase in encrypted traffic in the network environment and the increasing proportion of encrypted traffic, the study of encrypted traffic classification has become increasingly important as a part of traffic analysis. At present, in a closed environment, the classification of encryp...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Xinyi, Gu, Chunxiang, Chen, Yihang, Wei, Fushan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8705865/
https://www.ncbi.nlm.nih.gov/pubmed/34960324
http://dx.doi.org/10.3390/s21248231
_version_ 1784622052434509824
author Hu, Xinyi
Gu, Chunxiang
Chen, Yihang
Wei, Fushan
author_facet Hu, Xinyi
Gu, Chunxiang
Chen, Yihang
Wei, Fushan
author_sort Hu, Xinyi
collection PubMed
description With the rapid increase in encrypted traffic in the network environment and the increasing proportion of encrypted traffic, the study of encrypted traffic classification has become increasingly important as a part of traffic analysis. At present, in a closed environment, the classification of encrypted traffic has been fully studied, but these classification models are often only for labeled data and difficult to apply in real environments. To solve these problems, we propose a transferable model called CBD with generalization abilities for encrypted traffic classification in real environments. The overall structure of CBD can be generally described as a of one-dimension CNN and the encoder of Transformer. The model can be pre-trained with unlabeled data to understand the basic characteristics of encrypted traffic data, and be transferred to other datasets to complete the classification of encrypted traffic from the packet level and the flow level. The performance of the proposed model was evaluated on a public dataset. The results showed that the performance of the CBD model was better than the baseline methods, and the pre-training method can improve the classification ability of the model.
format Online
Article
Text
id pubmed-8705865
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87058652021-12-25 CBD: A Deep-Learning-Based Scheme for Encrypted Traffic Classification with a General Pre-Training Method Hu, Xinyi Gu, Chunxiang Chen, Yihang Wei, Fushan Sensors (Basel) Article With the rapid increase in encrypted traffic in the network environment and the increasing proportion of encrypted traffic, the study of encrypted traffic classification has become increasingly important as a part of traffic analysis. At present, in a closed environment, the classification of encrypted traffic has been fully studied, but these classification models are often only for labeled data and difficult to apply in real environments. To solve these problems, we propose a transferable model called CBD with generalization abilities for encrypted traffic classification in real environments. The overall structure of CBD can be generally described as a of one-dimension CNN and the encoder of Transformer. The model can be pre-trained with unlabeled data to understand the basic characteristics of encrypted traffic data, and be transferred to other datasets to complete the classification of encrypted traffic from the packet level and the flow level. The performance of the proposed model was evaluated on a public dataset. The results showed that the performance of the CBD model was better than the baseline methods, and the pre-training method can improve the classification ability of the model. MDPI 2021-12-09 /pmc/articles/PMC8705865/ /pubmed/34960324 http://dx.doi.org/10.3390/s21248231 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Hu, Xinyi
Gu, Chunxiang
Chen, Yihang
Wei, Fushan
CBD: A Deep-Learning-Based Scheme for Encrypted Traffic Classification with a General Pre-Training Method
title CBD: A Deep-Learning-Based Scheme for Encrypted Traffic Classification with a General Pre-Training Method
title_full CBD: A Deep-Learning-Based Scheme for Encrypted Traffic Classification with a General Pre-Training Method
title_fullStr CBD: A Deep-Learning-Based Scheme for Encrypted Traffic Classification with a General Pre-Training Method
title_full_unstemmed CBD: A Deep-Learning-Based Scheme for Encrypted Traffic Classification with a General Pre-Training Method
title_short CBD: A Deep-Learning-Based Scheme for Encrypted Traffic Classification with a General Pre-Training Method
title_sort cbd: a deep-learning-based scheme for encrypted traffic classification with a general pre-training method
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8705865/
https://www.ncbi.nlm.nih.gov/pubmed/34960324
http://dx.doi.org/10.3390/s21248231
work_keys_str_mv AT huxinyi cbdadeeplearningbasedschemeforencryptedtrafficclassificationwithageneralpretrainingmethod
AT guchunxiang cbdadeeplearningbasedschemeforencryptedtrafficclassificationwithageneralpretrainingmethod
AT chenyihang cbdadeeplearningbasedschemeforencryptedtrafficclassificationwithageneralpretrainingmethod
AT weifushan cbdadeeplearningbasedschemeforencryptedtrafficclassificationwithageneralpretrainingmethod