Content area
With the increasing number of network security threats and the frequent occurrence of software vulnerability attacks, the effective management and large-scale retrieval of vulnerability data have become urgent needs. Existing vulnerability information is scattered across heterogeneous sources and is difficult to integrate, which in turn makes it hard for security analysts to quickly retrieve and analyze relevant security knowledge. To address this problem, this paper proposes a method to construct a vulnerability knowledge graph by integrating multi-source vulnerability data, combining graph embedding technology with large language model reasoning to aggregate, infer, and enrich vulnerability knowledge. Experiments demonstrated that our domain-tuned Bidirectional Long Short-Term Memory–Conditional Random Field (BiLSTM-CRF) named entity recognition (NER), enhanced with a cybersecurity dictionary, achieved a 90.1% F1-score for entity extraction. For link prediction, a hybrid Graph Attention Network fused with GPT-3 reasoning boosted Hits1 by 0.137, Hits3 by 0.116, and Hits10 by 0.101 over the baseline. These results confirm that our approach markedly enhanced entity identification and relationship inference, yielding a more complete and dynamically updatable cybersecurity knowledge graph.
Details
1 School of Cyber Engineering, Xidian University, Xi’an 710126, China; [email protected] (R.L.); [email protected] (Z.D.); [email protected] (J.H.); [email protected] (C.P.), Key Laboratory of Cyberspace Security, Zhengzhou 450001, China
2 School of Cyber Engineering, Xidian University, Xi’an 710126, China; [email protected] (R.L.); [email protected] (Z.D.); [email protected] (J.H.); [email protected] (C.P.), State Key Laboratory of Integrated Services Networks (ISN), Xi’an 710126, China
3 School of Cyber Engineering, Xidian University, Xi’an 710126, China; [email protected] (R.L.); [email protected] (Z.D.); [email protected] (J.H.); [email protected] (C.P.)
4 Yuanjiang Shengbang Safety Technology Group Co., Beijing 100085, China; [email protected]
5 State Grid Jiangxi Electric Power Research Institute, Nanchang 330052, China; [email protected]