Abstract

Many times the nodes of a complex network, whether deliberately or not, are aggregated for technical, ethical, legal limitations or privacy reasons. A common example is the geographic position: one may uncover communities in a network of places, or of individuals identified with their typical geographical position, and then aggregate these places into larger entities, such as municipalities, thus obtaining another network. The communities found in the networks obtained at various levels of aggregation may exhibit various degrees of similarity, from full alignment to perfect independence. This is akin to the problem of ecological and atomic fallacies in statistics, or to the Modified Areal Unit Problem in geography.

We identify the class of community detection algorithms most suitable to cope with node aggregation, and develop an index for aggregability, capturing to which extent the aggregation preserves the community structure. We illustrate its relevance on real-world examples (mobile phone and Twitter reply-to networks). Our main message is that any node-partitioning analysis performed on aggregated networks should be interpreted with caution, as the outcome may be strongly influenced by the level of the aggregation.

Details

Title
Measuring the effect of node aggregation on community detection
Author
Gandica, Yérali 1 ; Decuyper, Adeline 2 ; Cloquet, Christophe 3 ; Thomas, Isabelle 2 ; Delvenne, Jean-Charles 1 

 Université catholique de Louvain, Center for Operations Research and Econometrics, Louvain-la-Neuve, Belgium (GRID:grid.7942.8) (ISNI:0000 0001 2294 713X); Université catholique de Louvain, Institute of Information and Communication Technologies, Electronics and Applied Mathematics, Louvain-la-Neuve, Belgium (GRID:grid.7942.8) (ISNI:0000 0001 2294 713X) 
 Université catholique de Louvain, Center for Operations Research and Econometrics, Louvain-la-Neuve, Belgium (GRID:grid.7942.8) (ISNI:0000 0001 2294 713X) 
 Université catholique de Louvain, Center for Operations Research and Econometrics, Louvain-la-Neuve, Belgium (GRID:grid.7942.8) (ISNI:0000 0001 2294 713X); Université catholique de Louvain, Institute of Information and Communication Technologies, Electronics and Applied Mathematics, Louvain-la-Neuve, Belgium (GRID:grid.7942.8) (ISNI:0000 0001 2294 713X); Poppy, Jette, Belgium (GRID:grid.7942.8) 
Pages
6
Publication year
2020
Publication date
2020
Publisher
Springer Nature B.V.
e-ISSN
21931127
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3059109525
Copyright
© The Author(s) 2020. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.