Content area

Abstract

Triangle enumeration is a foundation brick for solving harder graph problems related to social networks, the Internet and transportation, to name a few applications. This problem is well studied in the theory literature, but remains an open problem with big data. In this paper, we defend the idea of solving triangle enumeration with SQL queries evaluating the steps of a new adaptive algorithm with linear speedup. Such SQL approach provides scalability beyond RAM limits, automatic parallel processing and more importantly: linear speedup as more machines are added. We present theory results and experimental validation showing our solution works well with large graphs analyzed on a parallel cluster with many machines, producing a balanced workload even with highly skewed degree vertices. We consider two types of distributed systems: (1) a parallel DBMS that evaluates SQL queries, and (2) a parallel HPC cluster calling the MPI library (called via Python). Extensive benchmark experiments with large graphs show our SQL solution offers many advantages over MPI and competing graph analytic systems.

Details

Business indexing term
Title
Balanced parallel triangle enumeration with an adaptive algorithm
Author
Farouzi, Abir 1 ; Zhou, Xiantian 2 ; Bellatreche, Ladjel 3 ; Malki, Mimoun 4 ; Ordonez, Carlos 2 

 ISAE-ENSMA, Poitiers, France (GRID:grid.434217.7) (ISNI:0000 0001 2178 9782); Ecole Supérieure en Informatique, Sidi Bel Abbès, Algeria (GRID:grid.434217.7) 
 University of Houston, Houston, USA (GRID:grid.266436.3) (ISNI:0000 0004 1569 9707) 
 ISAE-ENSMA, Poitiers, France (GRID:grid.434217.7) (ISNI:0000 0001 2178 9782) 
 Ecole Supérieure en Informatique, Sidi Bel Abbès, Algeria (GRID:grid.434217.7) 
Publication title
Volume
42
Issue
1
Pages
103-141
Publication year
2024
Publication date
Mar 2024
Publisher
Springer Nature B.V.
Place of publication
New York
Country of publication
Netherlands
ISSN
09268782
e-ISSN
15737578
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2023-07-13
Milestone dates
2023-06-25 (Registration); 2023-06-25 (Accepted)
Publication history
 
 
   First posting date
13 Jul 2023
ProQuest document ID
3255419737
Document URL
https://www.proquest.com/scholarly-journals/balanced-parallel-triangle-enumeration-with/docview/3255419737/se-2?accountid=208611
Copyright
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.
Last updated
2025-09-29
Database
ProQuest One Academic