Content area

Abstract

In the pursuit of graph processing performance, graph partitioning, as a crucial preprocessing step, has been widely concerned. Based on an in-depth analysis of Neighbor Expansion (NE) graph partitioning algorithm, we propose Parallel Expansion based on Clustering Coefficient (PECC). Firstly, to address the partition disturbance caused by internal structural changes during the process of vertex neighborhood expansion in the traditional NE algorithm, we perform a formal redefinition of the vertex state during the partitioning process and introduce the concept of clustering coefficient. Then, PECC uses the clustering coefficient as a metric to measure the closeness between vertices and potential partitions. Based on this metric, a novel parallel partitioning strategy in the distributed environment is proposed. This strategy consists of two core steps: the expansion process and the allocation process. Through two steps, PECC can effectively improve the operating efficiency of programs and significantly reduce the partitioning time. In addition, to ensure data consistency during parallel expansion, we adopt a distributed locking engine to solve concurrency management problems. Our evaluations on large real-world graphs show that in many cases, PECC achieves a balance between partitioning quality and computational efficiency. Finally, we show that PECC integrated on GraphX outperforms the built-in native algorithms.

Details

Business indexing term
Title
PECC: parallel expansion based on clustering coefficient for efficient graph partitioning
Author
Shi, Chengcheng 1 ; Xie, Zhenping 1 

 Jiangnan University, School of Artificial Intelligence and Computer Science, Wuxi, China (GRID:grid.258151.a) (ISNI:0000 0001 0708 1323); Jiangnan University, Jiangsu Key University Laboratory of Software and Media Technology Under Human-Computer Cooperation, Wuxi, China (GRID:grid.258151.a) (ISNI:0000 0001 0708 1323) 
Publication title
Volume
42
Issue
4
Pages
447-467
Publication year
2024
Publication date
Dec 2024
Publisher
Springer Nature B.V.
Place of publication
New York
Country of publication
Netherlands
ISSN
09268782
e-ISSN
15737578
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2024-06-10
Milestone dates
2024-05-26 (Registration); 2024-05-26 (Accepted)
Publication history
 
 
   First posting date
10 Jun 2024
ProQuest document ID
3255419858
Document URL
https://www.proquest.com/scholarly-journals/pecc-parallel-expansion-based-on-clustering/docview/3255419858/se-2?accountid=208611
Copyright
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
Last updated
2025-09-29
Database
ProQuest One Academic