Content area

Abstract

With the continuous expansion of global Internet infrastructure, wide area networks play a crucial role in transmitting traffic between multiple data centers and users worldwide. However, efficient traffic management has become a core challenge due to the high costs of building and maintaining these networks. Traditional traffic engineering methods based on linear programming achieve optimal solutions but suffer from exponential computational complexity growth with network size, making them impractical for real-time applications in large-scale networks. Recent machine learning approaches show promise but still face fundamental limitations in handling complex network constraints and maintaining performance across different network scales. This paper proposes GRL-TE (Graph-based Reinforcement Learning for Traffic Engineering), a novel framework that achieves near-optimal performance while maintaining computational efficiency across diverse network scales. GRL-TE introduces three key innovations: (1) TopoFlowNet, a graph neural network architecture that models WANs as bipartite graphs with edge nodes representing physical links and path nodes representing candidate paths, enabling efficient bidirectional information propagation through GINConv layers while MLP modules handle collaborative relationships among paths serving the same demand; (2) A one-step A2C mechanism specifically designed for TE with immediate reward structure, eliminating the need for future state estimation and significantly simplifying training; (3) Integration of ADMM as a post-processing step to iteratively reduce constraint violations while improving solution quality. Extensive experiments on six real-world WAN topologies ranging from 12 to 1,739 nodes demonstrate that GRL-TE achieves an overall average demand satisfaction rate of 89.36%, outperforming state-of-the-art learning-based methods (Teal: 82.04%, Figret: 82.20%) and the clustering-based NCFlow (76.48%), while providing 3-4 orders of magnitude speedup compared to LP solvers on large-scale networks. The framework maintains robust performance under link failures and meets real-time scheduling requirements for production deployment.

Details

1009240
Title
Graph-based reinforcement learning for software-defined networking traffic engineering
Author
Lu, Jingwen 1 ; Tang, Chaowei 1 ; Ma, Wenyu 1 ; Xing, Wenjuan 2 

 Chongqing University, School of Microelectronics and Communication Engineering, Chongqing, China (GRID:grid.190737.b) (ISNI:0000 0001 0154 0904) 
 Chongqing University, School of Microelectronics and Communication Engineering, Chongqing, China (GRID:grid.190737.b) (ISNI:0000 0001 0154 0904); Beijing Research Institute, China Telecom Corporation Limited, Beijing, China (GRID:grid.506877.b) 
Volume
37
Issue
6
Pages
119
Publication year
2025
Publication date
Aug 2025
Publisher
Springer Nature B.V.
Place of publication
Amsterdam
Country of publication
Netherlands
Publication subject
e-ISSN
13191578
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-07-21
Milestone dates
2025-06-23 (Registration); 2025-04-10 (Received); 2025-06-21 (Accepted)
Publication history
 
 
   First posting date
21 Jul 2025
ProQuest document ID
3257123389
Document URL
https://www.proquest.com/scholarly-journals/graph-based-reinforcement-learning-software/docview/3257123389/se-2?accountid=208611
Copyright
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-10-05
Database
ProQuest One Academic