Abstract

Cloud environment is a virtual, online, and distributed computing environment that provides users with large-scale services. And cloud monitoring plays an integral role in protecting infrastructures in the cloud environment. Cloud monitoring systems need to closely monitor various KPIs of cloud resources, to accurately detect anomalies. However, due to the complexity and highly dynamic nature of the cloud environment, anomaly detection for these KPIs with various patterns and data quality is a huge challenge, especially those massive unlabeled data. Besides, it’s also difficult to improve the accuracy of the existing anomaly detection methods. To solve these problems, we propose a novel Dynamic Graph Transformer based Parallel Framework (DGT-PF) for efficiently detect system anomalies in cloud infrastructures, which utilizes Transformer with anomaly attention mechanism and Graph Neural Network (GNN) to learn the spatio-temporal features of KPIs to improve the accuracy and timeliness of model anomaly detection. Specifically, we propose an effective dynamic relationship embedding strategy to dynamically learn spatio-temporal features and adaptively generate adjacency matrices, and soft cluster each GNN layer through Diffpooling module. In addition, we also use nonlinear neural network model and AR-MLP model in parallel to obtain better detection accuracy and improve detection performance. The experiment shows that the DGT-PF framework have achieved the highest F1-Score on 5 public datasets, with an average improvement of 21.6% compared to 11 anomaly detection models.

Details

Title
Efficiently localizing system anomalies for cloud infrastructures: a novel Dynamic Graph Transformer based Parallel Framework
Author
He, Hongxia 1 ; Li, Xi 1 ; Chen, Peng 1 ; Chen, Juan 1 ; Liu, Ming 2 ; Wu, Lei 2 

 School of Computer and Software Engineering, Xihua University, Chengdu, China (GRID:grid.412983.5) (ISNI:0000 0000 9427 7895) 
 School of Mathematical Sciences, University of Electronic Science and Technology of China, Chengdu, China (GRID:grid.54549.39) (ISNI:0000 0004 0369 4060); Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China (GRID:grid.54549.39) (ISNI:0000 0004 0369 4060) 
Pages
115
Publication year
2024
Publication date
Dec 2024
Publisher
Springer Nature B.V.
e-ISSN
2192113X
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3064404635
Copyright
© The Author(s) 2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.