Full text

Turn on search term navigation

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Accurate classification and identification of Internet traffic are crucial for maintaining network security. However, unknown network traffic in the real world can affect the accuracy of current machine learning models, reducing the efficiency of traffic classification. Existing unknown traffic classification algorithms are unable to optimize traffic features and require the entire system to be retrained each time new traffic data are collected. This results in low recognition efficiency, making the algoritms unsuitable for real-time application detection. To solve the above issues, we suggest a multi-feature fusion-based incremental technique for detecting unknown traffic in this paper. The approach employs a multiple-channel parallel architecture to extract temporal and spatial traffic features. It then uses the mRMR algorithm to rank and fuse the features extracted from each channel to overcome the issue of redundant encrypted traffic features. In addition, we combine the density-ratio-based clustering algorithm to identify the unknown traffic features and update the model via incremental learning. The cassifier enables real-time classification of known and unknown traffic by learning newly acquired class knowledge. Our model can identify encrypted unknown Internet traffic with at least 86% accuracy in various scenarios, using the public ISCX-VPN-Tor datasets. Furthermore, it achieves 90% accuracy on the intrusion detection dataset NSL-KDD. In our self-collected dataset from a real-world environment, the accuracy of our model exceeds 96%. This work offers a novel method for identifying unknown network traffic, contributing to the security preservation of network environments.

Details

Title
Unknown Traffic Recognition Based on Multi-Feature Fusion and Incremental Learning
Author
Liu, Junyi 1   VIAFID ORCID Logo  ; Wang, Jiarong 2   VIAFID ORCID Logo  ; Tian, Yan 2 ; Fazhi Qi 3 ; Chen, Gang 2 

 Computing Center, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China; [email protected] (J.L.); [email protected] (T.Y.); [email protected] (F.Q.); [email protected] (G.C.); School of Nuclear Science and Technology, University of Chinese Academy of Sciences, Beijing 100049, China 
 Computing Center, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China; [email protected] (J.L.); [email protected] (T.Y.); [email protected] (F.Q.); [email protected] (G.C.) 
 Computing Center, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China; [email protected] (J.L.); [email protected] (T.Y.); [email protected] (F.Q.); [email protected] (G.C.); China Spallation Neutron Source Science Center, Dongguan 523803, China 
First page
7649
Publication year
2023
Publication date
2023
Publisher
MDPI AG
e-ISSN
20763417
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2836328172
Copyright
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.