This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
Social networks display significant applications in people’s social lives, such as the online social platforms and the communications [1, 2]. The key or the sensitive nodes play important roles in information diffusion. Detecting the vital nodes is a fundamental problem due to its wide application in many areas. For example, the propagation process of news and ideas can be accelerated by the key individuals in networks [3].
A variety of methods to measure the key nodes had been proposed in recent years. The topological location based methods were presented by many researches, such as degree centrality (DC) [4], betweenness centrality (BC) [5], closeness centrality [6], eigenvector centrality [7], and structure hole [8]. Degree centrality is one of the most basic topology attributes with low computational complexity. It is thought that a node with higher degree has much more influence than the node with lower degree. However, the weak connection of nodes proved that the high degree centrality of the node does not always mean the important of it. For example, Granovetter [9] had researched the topic of how people find jobs and discovered that information about jobs that led to employment was more likely to come from the weak connection with acquaintances than from closer friends. Structure hole method, which is somewhat close to the strength of weak ties, searches the bridge joints which are channels spanned by a group of indirectly connected nodes. Betweenness and closeness centralities identify the node’s influences in the global scope. Unfortunately, both of them lost efficiency in large scale networks because of the huge computing complexity in detecting shortest paths between each pair of nodes.
Therefore, some other centrality indexes have been proposed to make up the defects of those measurements. Kitsak and his cooperators [10] found that the most influential nodes are not the ones with the largest degree but the ones located at the core of the networks with the largest
In order to find the more effective measure to evaluate the importance of the node, it is needed to capture the natures of the node as much as possible. We take together both the local triangle structures and the degrees of nodes to evaluate the importance of them. The experiments of the presented method and the other seven indexes are achieved on real-world data and synthetic networks, and the results show that the proposed method in this paper is effective and accurate. It is proved that the connections among the node’s neighbors and its degree centrality are reasonable factors to evaluate the influence of nodes.
The arrangement of this paper is as follows: Introduction of literatures review and the research problem are given in Section 1. The motivation, the proposed method, and the example analysis are presented in Section 2. The datasets including real networks and synthetic networks are presented in Section 3. In experiments on real data in Section 4 the superior performances are displayed which are verified by the discriminability, the correctness of ranks, and the most influenced nodes. The experimental results of synthetic networks are arranged in Section 5. And, finally, the conclusions and discussions are given in Section 6.
2. The Local Triangle Structure Centrality Method
In this paper, an undirected and simple network without weight on edges or nodes is denoted by
2.1. The Motivation
In order to find a more effective measure to evaluate the node’s importance, it is needed to capture as many as possible natures of the node. The topology fundamentally affects the dynamics of the networks [21]; therefore, it is important to measure the node’s influence abilities by the local or global information. For example, the clustering coefficient of a node is usually employed as the local structures to evaluate the importance of the node. The topological connections among the node’s neighbors indicate the tendency of them to form triangle structures. Once a triangle is formed, the three nodes of this triad would exchange information with each other. Obviously, the ratio of the triangle structures of the node’s neighbors plays an important role in the information propagation. During the process of information dissemination, two adjacent nodes
In reality, during the epidemic or information spreading, the degree and the triangle structures of neighbors of the infected node play important roles. The bigger degree of a node means more neighbors. The higher the percentage of triangle structures formed between a node and its neighbors is, the more likely the neighbors infect each other during the spreading. Therefore, the centrality of a node should combine the node’s degree and the triangles of the node with its neighbors.
We are motivated by the semilocal centrality methods, the local centrality, and the local structure centrality [14, 20] and combine both the local triangle structures and the degree of the node to evaluate the importance of it.
The inner-outer spreading ability of a node is measured by a linear combination of the local triangle structures and the degrees information of its neighbors, and the sum of the neighbors’ inner and outer spreading ability of the node is defined as the local triangle structure centrality.
2.2. The Method
In networks, the information usually is spread through a node to its neighbors and to the neighbors’ neighbors and so on. So the spreading ability of the node heavily depends on the degree and its neighbors’ structures. The more the triangle structures formed among nodes themselves and their neighbors are, the more likely the node locates in the dense part of the network. The number of triangles and clustering coefficients are not trivially related [22] in networks, where larger density of triangles does not imply the high clustering coefficients. Hence, to avoid the same flaw as the definition of the clustering coefficient and some indexes, the proportion of triangles (TP) instead of the number of triangles is considered as an index to measure how close the nodes neighbors connected to each other.
Therefore, we partition the spreading ability into two parts: one is the inner spreading and the other is the outer spreading. The inner spreading ability is contributed by the degree and the TP which measures how close the nodes neighbors are and reflects the location of a node in a network. The outer spreading ability is measured by the degrees of its neighbors and the degree of itself.
Based on above analysis together with the inner and the outer spreading ability, an inner-outer spreading ability of node
By definition of the inner-outer spreading ability, a new index to indicate the importance of nodes in a network is defined as the local triangle structures centrality, shortly
The algorithm pseudocodes of computing the nodes’ LTSC values are presented in Table 1.
Table 1
Computing the node’s LTSC value.
Algorithm |
Input: |
Output: |
|
Step 1: Calculate |
Step 2: |
Step 3: Calculate |
Step 4: Calculate |
Step 5: Calculate |
Step 6: Compute |
The computational complexity for calculating the percentage triangle structures of a node is
2.3. Example for the LTSC Method
As an example, Figure 1 shows that nodes 7 and 13 are two neighbors of node 6. There are 2 triangles between node 7 and its neighbors more than node 13 and its neighbors. And the spreading abilities, calculated by the SIR model [10], of the two nodes 7 and 13 are 2.28 and 1.61, respectively, which coincide with the number of triangles of nodes. Meanwhile the clustering coefficients of nodes 7 and 13 are 0.333 and 1, respectively. At this point, the clustering coefficients of nodes do not imply the dense edge of their neighbors. On the other hand, nodes 1 and 6 have the same degrees and local centralities, and their spreading abilities are 2.87 and 3.0, respectively. Seen in this way, the number of the node’s neighbors, even, and the number of next-nearest neighbors may not be enough to measure the spreading ability of it. It is reasonable to consider the global topological connections. So, we compare the different values of eight indexes on the example of Figure 1.
[figure omitted; refer to PDF]The spreading ability and the centrality or importance is calculated by DC, BC, H-index, K-shell, S-shell, LC, LSC, and the method we proposed. The centrality values are shown in Table 2. Table 2 shows the values of different measures in Figure 1. DC, H-index, and K-shell encounter the same problem that many nodes are assigned the same centrality value. Some nodes get larger BC value while the spreading influences of them are relatively small. The values of LC show better performance comparing with DC and others. The index’s values of the node are positively correlated with the node’s spreading ability. While node 1 and node 6 have the same LC value, the spreading values are quite different. LSC and LTSC take the topology connections into consideration, so they present positive linear correlation trends with the real spreading ability. LSC and LTSC show similar values, and the latter shows better performance because of the percentage of triangle considered.
Table 2
The centralities with 8 different measures on the network of Figure 1.
Nodes | Sprd | | | | | | | | |
---|---|---|---|---|---|---|---|---|---|
1 | 3.874 | 6 | 218 | 2 | 2 | 3 | 118 | 35.62 | 52.5 |
2 | 3.127 | 4 | 74 | 2 | 2 | 3 | 79 | 20.81 | 31.417 |
3 | 3.063 | 4 | 74 | 2 | 2 | 3 | 79 | 20.81 | 31.417 |
4 | 2.128 | 1 | 0 | 1 | 1 | 2 | 44 | 11 | 15.25 |
5 | 2.541 | 2 | 180 | 2 | 2 | 3 | 56 | 13.82 | 19.25 |
6 | 3.999 | 6 | 218 | 2 | 2 | 5 | 118 | 36.52 | 56.833 |
7 | 3.278 | 4 | 72.5 | 2 | 2 | 5 | 87 | 21.86 | 37.667 |
8 | 3.276 | 4 | 72.5 | 2 | 2 | 5 | 87 | 21.86 | 37.667 |
9 | 2.598 | 2 | 180 | 2 | 2 | 3 | 56 | 14.32 | 19.25 |
10 | 2.175 | 1 | 0 | 1 | 1 | 2 | 44 | 11.5 | 15.25 |
11 | 2.574 | 2 | 0 | 2 | 2 | 3 | 63 | 15.62 | 23.333 |
12 | 2.585 | 2 | 0 | 2 | 2 | 3 | 63 | 15.62 | 23.333 |
13 | 2.613 | 2 | 0 | 2 | 2 | 4 | 63 | 16.22 | 23.333 |
14 | 2.650 | 2 | 0 | 2 | 2 | 4 | 63 | 16.22 | 23.333 |
15 | 2.194 | 2 | 182 | 2 | 2 | 3 | 29 | 9.14 | 12 |
16 | 2.191 | 2 | 182 | 2 | 2 | 3 | 29 | 9.24 | 12 |
17 | 1.886 | 1 | 0 | 1 | 1 | 1 | 25 | 7.57 | 11.167 |
18 | 2.262 | 2 | 0 | 2 | 2 | 3 | 38 | 10.74 | 17.25 |
19 | 2.298 | 2 | 0 | 2 | 2 | 3 | 38 | 10.74 | 17.25 |
20 | 2.305 | 2 | 0 | 2 | 2 | 3 | 38 | 10.74 | 17.25 |
21 | 2.286 | 2 | 0 | 2 | 2 | 3 | 38 | 10.74 | 17.25 |
22 | 1.874 | 1 | 0 | 1 | 1 | 1 | 25 | 7.57 | 11.167 |
23 | 2.452 | 2 | 0 | 2 | 2 | 5 | 42 | 11.34 | 21.583 |
24 | 2.711 | 3 | 0.5 | 2 | 2 | 5 | 51 | 14.66 | 27.5 |
25 | 2.440 | 2 | 0 | 2 | 2 | 5 | 42 | 11.34 | 21.583 |
26 | 2.443 | 2 | 0 | 2 | 2 | 5 | 42 | 11.34 | 21.583 |
27 | 2.709 | 3 | 0.5 | 2 | 2 | 5 | 51 | 14.66 | 27.5 |
28 | 2.435 | 2 | 0 | 2 | 2 | 5 | 42 | 11.34 | 21.583 |
Sprd is the node’s spreading ability got by the SIR model.
3. Dataset for Experiments
The performance of LTSC will be evaluated on both real-world data and synthetic networks in the following text. The real-world data contains six real-world networks presented by some researches, and the synthetic networks are generated by the famous BA model and LFR model.
3.1. The Real-World Data
Six real-world networks are chosen in the following discussion, including Karate club network (Karate) [23], Polbook network (Polbook) [24], Coauthorship network of scientists (Netscience) [25], Email network of the University of Rovira Virgili (Email) [26], Football [27], Western States Power Grid (PowerGrid) [28]. The basic topological features of these networks are shown in Table 3, where
Table 3
Some statistical properties of real networks.
Networks | | | | | |
---|---|---|---|---|---|
Karate | 34 | 78 | 17 | 4.5882 | 0.5706 |
Polbook | 105 | 441 | 25 | 8.4 | 0.4875 |
Football | 115 | 613 | 12 | 10.66 | 0.4032 |
Netscience | 379 | 914 | 34 | 4.8232 | 0.7412 |
1133 | 5451 | 71 | 9.623 | 0.2219 | |
PowerGrid | 4941 | 6594 | 19 | 2.6691 | 0.0801 |
Zachary’s Karate club is a well-known social network of a university Karate club described by Zachary. The network became a popular example of community structure in networks after its use by Michelle Girvan and Mark Newman in 2002 with a paper “Community Structure in Social and Biological Networks”. The political books dataset was compiled by Valid Krebs. The nodes represent 105 books about US politics sold by online seller Amazon.com. The edges present the frequent copurchasing of books by the same buyers. Books can be divided according to the attitude into 3 categories, which are conservative, liberal, and neutral. The Football network represents football games in Division IA colleges. Each node represents a football team and the edge connects the two teams representing games between the two teams during the season. There are 115 teams and 613 games. The teams are divided into 12 “conferences” containing teams around 8 to 12. The Netscience network is a coauthorship network of scientists working on network theory and experiment, compiled by Newman [25]. Here, in this paper, the biggest component with obvious community structures is chosen as the Netscience network with 379 nodes and 914 edges. The Email network was established by professor Alexandre Arenas. It describes the email interchanges between members of the Univerisity of Rovira i Virgili (Tarragona). The PowerGrid network is a well-known network showing the scale-free phenomenon, representing the topology of the Western States Power Grid of the United States. The 4941 nodes and 6594 edges represent the substations and transmission lines between the substations, respectively. The national grid is concerned with simple geography dictating that it is always just a few transmission lines from collapse.
3.2. The Synthetic Data
The synthetic networks include networks generated by the Barabasi-Albert (BA) network model [29] and Lancichinetti-Fortunato-Radicchi (LFR) network model [30]. We also use SIR model for examining the performance of our proposed method.
BA model is an algorithm for generating random scale-free networks using a preferential attachment mechanism. Several natural and human-made systems, including the Internet, WWW, citation networks, and some social networks, are thought to be approximately scale-free and certainly contain few nodes with unusually high degree as compared to the other nodes of the network. The BA model tries to explain the existence of such nodes in real networks. The algorithm is named for its inventors Albert Barab
The other kind of synthetic networks is the widely used Lancichinetti-Fortunato-Radicchi (LFR) network model [30]. LFR is an algorithm that generates benchmark networks that resemble real-world networks. They have a priori known communities and are used to compare different community detection methods. LFR assumes that both the degree and the community size have power-law distributions with different exponents,
4. The Method of Experiments on Real-World Networks
The feasibility and effectiveness using LTSC to rank the importance of nodes in networks are empirically evaluated through a series of experiments. The experiments on LTSC are compared with other seven well-known measures from the aspects of discriminability, correctness, and so on. These measures include degree centrality (DC), betweenness centrality (BC), H-index method (H-index), K-shell (Ks), S-shell (Ss), local centrality (LC), and local structure centrality (LSC) [20]. DC, H-index, Ks, and Ss of a node heavily depended on the nearest neighbors of it. BC evaluates the node’s influence in a global scope. LC of a node is defined by computing the number of neighbors and the neighbors’ neighbors of the node. LSC investigates the impact of the topological connections among neighbors. Meanwhile LTSC is to calculate the number of neighbors together with the ratio of triangles of the node. Comparing to LSC and LC, LTSC takes the percentage of triangles (TP) into consideration instead of clustering coefficient to measure the topological connections; LTSC might display better performance.
The comparisons of LTSC with the other seven centrality indexes are analyzed by SIR model in discriminability, correctness of the most influential nodes, and the performances of measures which are evaluated by SI model at the end of this section.
4.1. Discriminability
In general, if nodes
The CCDF curves by the LTSC and the other seven indexes on Karate, Polbook, Netscience, and Email are plotted in Figure 2. Two reasons of the four networks are chosen to verify the discriminability: One is that the size of nodes and edges of the four networks are from tens to thousands of nodes. And the other reason is that there are obvious communities and central nodes in them. LTSC achieves a better rank distribution performance compared with the other indexes. It shows a slowdown trend in the four panels and has similar variant behaviors with LC, LSC, and BC. DC, H-index, and K-shell show poor performances because they only depend on the nearest neighbors without the information of the connections and the neighbors’ of the neighbors. BC, LC, and LSC evaluate nodes with more information, so the three indexes together with LTSC show better performance in distinguishing the influence capability in Karate network. The four indexes agree with each other from the first to the 15th nodes in the rank. BC lost its efficiency from the 21st to the 34th rank. LTSC is better than LC from the 15th to the 29th rank, though some evaluations of LSC are in the overhead LTSC and LC. And all indexes do not distinguish the 30th to the 34th rank, because the 30th to the 34th rank have much similar structures nodes [23].
[figures omitted; refer to PDF]
It might be the reason that LC, LSC, and LTSC are defined by the local topological connections among the node’s neighbors. In Polbook and Email networks, the CCDF curves of LTSC, LSC, LC, and BC almost coincide with each other except a few nodes on BC. In Netscience network, LTSC and LSC almost overlap from the first to the 268th rank, and LC is a little inferior from the first to 250th rank. LTSC, LSC, and LC show slower slope and more distinct ranks. BC lost efficacy about the 100th to the last rank. The clustering coefficient of Netscience network is 0.7412 more than the other three networks. That is to say, most of nodes’ neighbors form triangles with the node which enlarge the clustering coefficient. LTSC and LSC are defined by the triangle structures of a node and its neighbors. Thus, LTSC and LSC achieve better ranks.
The interval
In fact, there is a nature of the CCDF curves in Figure 2: a good discriminability curve of CCDF is almost the curve of
4.2. Correctness of Two Ranks
The rank of nodes in networks generated by an effective method should be as same as the real spreading process. The susceptible-infectious-recovered (SIR) model [10] is employed to simulate the real spreading process. Then, the spreading ability measured by LTSC index is appropriated to be verified by SIR model.
In the SIR model, each node belongs to one of the three states: susceptible (S), infected (I), and recovered (R). In the SIR process, the node to be investigated would be in state I and all other nodes are set to be in state S at the initial time. Then, at each time step, the infected nodes will infect their susceptible neighbors with a probability of
The numerical simulations on SIR model are repeated 10000 times if the number of nodes in networks is no more than 100. Otherwise, the simulations are repeated 1000 times. The spreading probability
Kendall’s rank correlation coefficient
The value of
Table 4
Kendall’s
Networks | | | | | | | | | | |
---|---|---|---|---|---|---|---|---|---|---|
Karata | 0.129 | 0.13 | 0.7540 | 0.6522 | 0.6902 | 0.6904 | 0.8278 | 0.8585 | 0.8245 | 0.9001 |
Polbook | 0.0838 | 0.09 | 0.7814 | 0.3669 | 0.7588 | 0.7652 | 0.7787 | 0.9017 | 0.8743 | 0.9222 |
Football | 0.0932 | 0.10 | 0.7514 | 0.2659 | 0.5413 | 0.1319 | 0.1319 | 0.7803 | 0.7053 | 0.7937 |
Netscience | 0.1247 | 0.13 | 0.6256 | 0.3956 | 0.6061 | 0.5683 | 0.7547 | 0.8127 | 0.8288 | 0.9013 |
0.0535 | 0.06 | 0.7913 | 0.6308 | 0.7711 | 0.8201 | 0.8628 | 0.9249 | 0.9119 | 0.9341 | |
PowerGrid | 0.2583 | 0.26 | 0.5899 | 0.4183 | 0.4982 | 0.5044 | 0.6452 | 0.7814 | 0.7764 | 0.8077 |
In Table 4,
[figures omitted; refer to PDF]
In Figure 3(a), when
For the Polbook (see Figure 3(b)), the two curves of LTSC and LC are growing close to each other with
In Netscience, the clustering coefficients of nodes are larger than the other networks; that means the connections among the node’s neighbors are dense. It is proper that the topological structure is considered to evaluate the spreading ability. Even though both of the average degree and clustering coefficients of Power network are the smallest, LTSC takes the percentage of triangle structure into consideration which confirms its effectiveness in ranking the spreading ability of nodes. That is the reason of LSC and LTSC achieving better performances in a wide range of
The most weird thing in the six panels is the Email network, shown in Figure 3(e). The curves of DC, BC,H-index, Ks, and Ss decrease at first and then gradually increase. However, the trends of the three indexes, LTSC, LSC, and LC, show opposite behaviors. LTSC shows the best performance near the threshold
4.3. Correctness of the Most Influential Nodes
The rank of the most influential nodes is more significant in many real applications. People are more interested in the most influential spreader [32]. So the rank of the most influential nodes deserves to be discussed in detail, besides Kendall’s
Another measurement we introduce to explore the correlation of two ranks is the rank similarity function [33]. The rank similarity function
If we study the most influential nodes,
We calculate the rank similarity function values of LTSC in six real networks and compare them with the other seven indexes, DC, BC, LSC, H-index, Ks, Ss, and LC.
[figures omitted; refer to PDF]
[figures omitted; refer to PDF]
The values of
In the Football network, shown as Figures 5(a)–5(c), LTSC displays the best performance of the top 5 nodes, same as the case in the top
LTSC shows almost the best performance in the top 10 except three nodes of Email and PowerGrid networks, as shown in Figures 5(d) and 5(g). In Email network, LTSC exhibits nearly the best of all the nodes in interval of the rank from the 10th to the
Figures 4 and 5 show the good performance of LTSC, especially in evaluating the most influence nodes of networks. LTSC has absolutely superiority in the accuracy and the efficiency of the most top nodes compared with DC, H-index, and BC on entire range of
By comparing
4.4. Evaluating LTSC by the SI Model
Besides the SIR model, we also use the standard SI model [34] to check the performance of our proposed LTSC method. All the nodes belong to two states in the SI model: susceptible(S) and infected (I). At each time step, a node in state I infects its susceptible neighbors with probability
[figures omitted; refer to PDF]
LTSC shows better performance than the other indexes in the above experiments, because LTSC takes the percentage of triangles and the number of neighbors of the node into consideration. The spreading ability heavily depends on the number of the neighbors when the spreading probability is smaller. Meanwhile the evaluation of spreading ability concerned not only the network structures, but also the local neighbor’s structures when the spreading probability is increasing. Comparing with LC and LSC, LTSC employs the percentage of triangle structures rather than the cluster coefficient to denote the topological structure relations among the nodes. LTSC achieves its best performance when the spreading probability is around the threshold
5. Experimental Results of Synthetic Networks
Besides the real networks, we also investigate the performance of LTSC method on synthetic networks. The classical synthetic networks are Barabasi-Albert (BA) model [29] and the Lancichinetti-Fortunato-Radicchi (LFR) model [30].
In this section, the BA model is generated with the preferential attachment mechanism: Initially, there are
In the BA model, the preferential attachment mechanism makes the node’s degrees in the network nearly equal; K-shell method may fail to rank nodes. Hence, we compare LTSC with the other six indexes except Ks method with the SIR model in Figure 7. For the fixed
[figures omitted; refer to PDF]
By Figures 7(a)–7(c), for the given size of nodes, it is easy to find that the performance of LC, LSC, and LTSC is better than DC, H-index, and BC methods on a wide range of
In Figures 7(d)–7(f), for the fixed
Figure 7 shows that the proposed LTSC method performs better than the other five indexes when
Another kind of synthetic networks used to check the performance of the LTSC method is Lancichinetti-Fortunato-Radicchi (LFR) network model [30].
We fix some parameters, such as the maximum degree and the maximum and minimum community size which are set 50, 50, and 10, respectively. The other parameters in the LFR network model are set as follows: The number of nodes of networks
The LFR networks generated by the above parameters are denoted by
[figures omitted; refer to PDF]
Overall, in the nine panels of Figure 8, the performance of LC, LSC, and LTSC is better than the other three in a wide range of
Comparing the three panels of left column, Figures 8(a), 8(d), and 8(g), the trends of the three tau curves are almost steady. It is showed that the mixing parameter
All in all, the experiments on LFR networks show that LTSC method does perform the best when the influence probabilities are around the spreading threshold. Comparing the performance between LSC and our proposed LTSC method, we are convinced that the percentage of triangle structures is effective in measuring the local structural information among the neighbors of a node. It confirms that the local triangle structures together with the degrees among neighbors play an important role in measuring the spreading ability of a node.
6. Conclusion
Identifying the most influential nodes is a fundamental problem due to its practical application in many areas, such as information dissemination and epidemic spread control. Thus, constructing efficient methods to detect the influential nodes is a valuable thing.
In this paper, we focus on the problem of finding influential nodes based on the local triangle structures and the degree of the node and present a centrality measure by considering both the neighbors connections and the degree of the node. This centrality definition leverages the proportion of triangle structures instead of local clustering coefficient to quantify the structural characteristics among the neighbors of the node. The property of triangle structures can measure how close its neighbors are connected to each other. The higher the proportion is, the more the connections among the neighbors are. And this can also reflect the location of a node in a network: the higher the proportion is, the more likely it is to be in a dense part of the node.
To evaluate the performance of the proposed LTSC method, we apply LTSC method on both synthetic and real networks. We compared LTSC with other seven centrality measures in terms of CCDF and find that LTSC is effective in assignment of distinct ranks to nodes with different spreading capabilities. Further, the SIR model is employed to simulate the real spreading process. By Kendall’s tau rank correlation coefficient, we compute the rank correlation between the two ranks generated by the SIR model and the one of centrality measures. The results demonstrate that LTSC method is better correlated with the real spreading process and outperforms the other local and semilocal methods in evaluating the node’s influence in most cases. The SI model also shows the effectiveness of LTSC. Furthermore, other comprehensive experiments also demonstrate that LTSC is more accurate in the identifying the most influential nodes.
Finally, we conduct the experiments on synthetic networks, the BA mode, and the LFR network model in scale-free networks with different sizes and different community size. The results show that LTSC method performs better than the other centrality measures in evaluating the influence of the node. As further work, it will be necessary to find which measures work better in which type of networks.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Acknowledgments
We show our great appreciation to all the authors who collected and shared the data, such as Karate, Dolphins, Football, Polbook, Email, Netscience, and Facebook, and the methods, such as degree, betweenness, H-index, local centrality, local structure centrality, K-shell, and S-shell, to be used as benchmarks. Finally, we would like to acknowledge the National Natural Science Foundation of China (No. 71471106) that supports this research.
[1] G. Caldarelli, A. Vespignani, Large scale structure and dynamics of complex networks, vol. 2,DOI: 10.1142/6455, 2007.
[2] O. Diekmann, J. A. P. Heesterbeek, Mathematical Epidemiology of Infectious Diseases, Model Building, Analysis and Interpretation, 2000.
[3] L. Lü, D. Chen, T. Zhou, "The small world yields the most effective information spreading," New Journal of Physics, vol. 13,DOI: 10.1088/1367-2630/13/12/123005, 2011.
[4] L. C. Freeman, "Centrality in social networks conceptual clarification," Social Networks, vol. 1 no. 3, pp. 215-239, DOI: 10.1016/0378-8733(78)90021-7, 1978.
[5] L. C. Freeman, "A set of measures of centrality based on betweenness," Sociometry, vol. 40 no. 1, pp. 35-41, DOI: 10.2307/3033543, 1977.
[6] G. Sabidussi, "The centrality index of a graph," Psychometrika, vol. 31, pp. 581-603, DOI: 10.1007/BF02289527, 1966.
[7] P. Bonacich, P. Lloyd, "Eigenvector-like measures of centrality for asymmetric relations," Social Networks, vol. 23 no. 3, pp. 191-201, DOI: 10.1016/S0378-8733(01)00038-7, 2001.
[8] Z. M. Han, W. Yang, X. S. Tan, D. G. Duan, W. J. Yang, "Ranking key nodes in complex networks by considering structural holes," Acta Physica Sinica, vol. 64, 2015.
[9] M. S. Granovetter, "The strength of weak ties," American Journal of Sociology, vol. 78 no. 6, pp. 1360-1380, DOI: 10.1086/225469, 1973.
[10] M. Kitsak, L. K. Gallos, S. Havlin, F. Liljeros, L. Muchnik, H. E. Stanley, H. A. Makse, "Identification of influential spreaders in complex networks," Nature Physics, vol. 6 no. 11, pp. 888-893, DOI: 10.1038/nphys1746, 2010.
[11] Y. Liu, M. Tang, Y. Do, P. Hui, "Identify influential spreaders in complex networks, the role of neighborhood," Physica A, vol. 452, 2016.
[12] A. Zeng, C.-J. Zhang, "Ranking spreaders by decomposing complex networks," Physics Letters A, vol. 377, 2013.
[13] L. l. Ma, C. Ma, H. Zhang, B. Wang, "Identifying influential spreaders in complex networks based on gravity formula," Physica A, vol. 451, 2016.
[14] D. Chen, L. Lü, M. Shang, Y. Zhang, T. Zhou, "Identifying influential nodes in complex networks," Physica A: Statistical Mechanics and its Applications, vol. 391 no. 4, pp. 1777-1787, DOI: 10.1016/j.physa.2011.09.017, 2012.
[15] J. Bae, S. Kim, "Identifying and ranking influential spreaders in complex networks by neighborhood coreness," Physica A: Statistical Mechanics and its Applications, vol. 395, pp. 549-559, DOI: 10.1016/j.physa.2013.10.047, 2014.
[16] D. B. Chen, H. Gao, L. Y. Lü, T. Zhou, "Identifying Influential Nodes in Large-Scale Directed Networks: The Role of Clustering," Plos One, vol. 8, 2013.
[17] Y. Liu, M. Tang, Y. Do, P. M. Hui, "Accurate ranking of influential spreaders in networks based on dynamically asymmetric link weights," Physical Review E, vol. 96, 2017.
[18] D. Chen, R. Xiao, A. Zeng, Y. Zhang, "Path diversity improves the identification of influential spreaders," EPL (Europhysics Letters), vol. 104 no. 6,DOI: 10.1209/0295-5075/104/68006, 2013.
[19] J. E. Hirsch, "An index to quantify an individual's scientific research output," PNAS, vol. 102 no. 46, pp. 16569-16572, DOI: 10.1073/pnas.0507655102, 2005.
[20] S. Gao, J. Ma, Z. Chen, G. Wang, C. Xing, "Ranking the spreading ability of nodes in complex networks based on local structure," Physica A, vol. 403, 2014.
[21] H. Shen, X. Cheng, K. Cai, M.-B. Hu, "Detect overlapping and hierarchical community structure in networks," Physica A: Statistical Mechanics and its Applications, vol. 388 no. 8, pp. 1706-1712, DOI: 10.1016/j.physa.2008.12.021, 2009.
[22] C. Qi, R. Xin, Z. Shi, H. Bin, "Triangular clustering in document networks," New Journal of Physics, vol. 11 no. 3, 2009.
[23] W. W. Zachary, "An information flow model for conflict and fission in small groups," Journal of Anthropological Research, vol. 33 no. 4, pp. 452-473, DOI: 10.1086/jar.33.4.3629752, 1977.
[24] V. Krebs, Uspolbooks, 2015. http://www.orgnet.com
[25] M. E. J. Newman, "Finding community structure in networks using the eigenvectors of matrices," Physical Review E: Statistical, Nonlinear, and Soft Matter Physics, vol. 74 no. 3,DOI: 10.1103/PhysRevE.74.036104, 2006.
[26] R. Guimerà, L. Danon, A. Díaz-Guilera, F. Giralt, A. Arenas, "Self-similar community structure in a network of human interactions," Physical Review E: Statistical, Nonlinear, and Soft Matter Physics, vol. 68 no. 6,DOI: 10.1103/PhysRevE.68.065103, 2003.
[27] M. Girvan, M. E. J. Newman, "Community structure in social and biological networks," Proceedings of the National Academy of Sciences, USA, vol. 99, pp. 7281-7826, 2002.
[28] D. J. Watts, S. H. Strogatz, "Collective dynamics of “small-world” networks," Nature, vol. 393 no. 6684, pp. 440-442, DOI: 10.1038/30918, 1998.
[29] A. Barabasi, R. Albert, "Emergence of scaling in random networks," Science, vol. 286 no. 5439, pp. 509-512, DOI: 10.1126/science.286.5439.509, 1999.
[30] A. Lancichinetti, S. Fortunato, F. Radicchi, "Benchmark graphs for testing community detection algorithms," Physical Review E: Statistical, Nonlinear, and Soft Matter Physics, vol. 78 no. 4,DOI: 10.1103/PhysRevE.78.046110, 2008.
[31] M. G. Kendall, "A new measure of rank correlation," Biometrika, vol. 30 no. 1-2, pp. 81-93, DOI: 10.1093/biomet/30.1-2.81, 1938.
[32] Q. Li, T. Zhou, L. Lü, D. Chen, "Identifying influential spreaders by weighted LeaderRank," Physica A: Statistical Mechanics and its Applications, vol. 404, pp. 47-55, DOI: 10.1016/j.physa.2014.02.041, 2014.
[33] M. Kimura, K. Saito, "Tractable models for information diffusion in social networks," Knowledge Discovery in Databases: PKDD 2006, vol. 4213, pp. 259-271, DOI: 10.1007/11871637_27, 2006.
[34] R. Pastor-Satorras, A. Vespignani, "Epidemic dynamics and endemic states in complex networks," Physical Review E: Statistical, Nonlinear, and Soft Matter Physics, vol. 63 no. 6,DOI: 10.1103/PhysRevE.63.066117, 2001.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2019 Xiaojian Ma and Yinghong Ma. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/
Abstract
Detecting influential spreaders had become a challenging and crucial topic so far due to its practical application in many areas, such as information propagation inhibition and disease dissemination control. Some traditional local based evaluation methods had given many discussions on ranking important nodes. In this paper, ranking nodes of networks continues to be discussed. A semilocal structures method for ranking nodes based on the degree and the neighbors’ connections of the node is presented. The semilocal structures are regarded as the number of neighbors of the nodes and the connections between the node and its neighbors. We combined the triangle structure and the degree information of the neighbors to define the inner-outer spreading ability of the nodes and then summed the node neighbors’ inner-outer spreading ability to be used as the local triangle structure centrality (LTSC). The LTSC avoids the defect “pseudo denser connections” in measuring the structure of neighbors. The performance of the proposed LTSC method is evaluated by comparing the spreading ability on both real-world and synthetic networks with the SIR model. The simulation results of the discriminability and the correctness compared with pairs of ranks (one is generated by SIR model and the others are generated by central nodes measures) show that LTSC outperforms some other local or semilocal methods in evaluating the node’s influence in most cases, such as degree, betweenness, H-index, local centrality, local structure centrality, K-shell, and S-shell. The experiments prove that the LTSC is an efficient and accurate ranking method which provides a more reasonable evaluating index to rank nodes than some previous approaches.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer