7315 ijcnc01

International Journal of Computer Networks & Communications (IJCNC) Vol.7, No.3, May 2015

USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF COMPLEX NETWORKS

Natarajan Meghanathan

Jackson State University, 1400 Lynch St, Jackson, MS, USA

ABSTRACT

In this paper, we show that the spectral radius ratio for node degree could be used to analyze the variation of node degree during the evolution of complex networks. We focus on three commonly studied models of complex networks: random networks, scale-free networks and small-world networks. The spectral radius ratio for node degree is defined as the ratio of the principal (largest) eigenvalue of the adjacency matrix of a network graph to that of the average node degree. During the evolution of each of the above three categories of networks (using the appropriate evolution model for each category), we observe the spectral radius ratio for node degree to exhibit high-very high positive correlation (0.75 or above) to that of the coefficient of variation of node degree (ratio of the standard deviation of node degree and average node degree). We show that the spectral radius ratio for node degree could be used as the basis to tune the operating parameters of the evolution models for each of the three categories of complex networks as well as analyze the impact of specific operating parameters for each model.

KEYWORDS

Eigenvalue, Spectral radius, Random network, Scale-free network, Small-world network, Node degree

1.INTRODUCTION

Network analysis and visualization of large complex real-world networks, ranging anywhere from social networks [1][2], co-authorship networks [3], Internet [4], World wide web [4] to biological networks [5] and etc is an actively researched area in recent years. The strength of network analysis is to abstract the complex relationships between the members of the system in the form of a graph with nodes (comprising of the constituent members) and edges (weighted or unit- weight as well as directed or undirected, depending on the nature of the interactions) and study the characteristics of the graph with respect to one or more metrics (like node degree, diameter, clustering coefficient and etc). The adjacency matrix A(G) of the network graph G essentially captures the presence of edges between any two vertices. For any two vertices i and j in graph G, the entry in the ith row and jth column of A(G) is 1 if there is an edge from vertex i to vertex j and 0 otherwise. This paper focuses on undirected graphs (edge i–j exists in both the directions: i–>j as well as from j–>i) and node degree as the metric under study.

Spectral decomposition is a method of projecting the characteristics of a network graph in n- dimensions or directions (that are mutually perpendicular) where n is the number of vertices in the graph. The projection in each direction is represented in the form of a scalar value (called the eigenvalue) and its corresponding vector with entries for each vertex (called the eigenvector). Though the number of dimensions in the spectrum is the number of vertices in the graph, most of the variations could be captured in the first few dimensions of the coordinate system represented

DOI : 10.5121/ijcnc.2015.7301

International Journal of Computer Networks & Communications (IJCNC) Vol.7, No.3, May 2015

by the eigenvalues and the eigenvectors. The largest eigenvalue of the projection is called the principal eigenvalue and the corresponding eigenvector is called the principal eigenvector (could be determined by executing the power iteration algorithm [6] on the adjacency matrix of a network graph). The principal eigenvalue and its corresponding eigenvector capture maximum amount of variability in the data (in the case of a network graph, the data are the edges connecting the vertices). In this paper, we make use of the principal eigenvalue (also called the spectral radius) of the adjacency matrix of complex network graphs to analyze the variations in node degree and correlate with the coefficient of variation of node degree. We specifically use the spectral radius ratio for node degree as the basis to evaluate the evolution of three commonly studied complex network models, such as the random networks, scale-free networks and small- world networks. To the best of our knowledge, we could not come across any related work that uses spectral radius as the basis to comprehensively analyze the evolution of all the above three categories of complex networks.

The rest of the paper is organized as follows: In Section 2, we introduce the power iteration method that was used in this research to calculate the spectral radius of a network graph. Sections 3, 4 and 5 validate our hypothesis on random networks, scale-free networks and small-world networks respectively. Section 6 concludes the paper.

2. POWER ITERATION METHOD TO CALCULATE SPECTRAL RADIUS

The power iteration method can be used to calculate the principal eigenvalue (i.e., spectral radius) and the corresponding principal eigenvector of a graph based on its adjacency matrix. The eigenvector Xi+1 of a network graph at the end of the (i+1)th iteration is given by: Xi+1 = AXi / ||AXi||, where ||AXi|| is the normalized value of the product of the adjacency matrix A of a given graph and the tentative eigenvector Xi at the end of iteration i. The initial value of Xi is [1, 1, …, 1], a column vector of all 1s, where the number of 1s correspond to the number of vertices in the graph. We continue the iterations until the normalized value ||AXi+1|| converges to that of the normalized value ||AXi||. The value of the column vector Xi at this juncture is declared the principal eigenvector of the network graph and the normalized value to which the iterations converge is the principal eigenvalue (i.e., the spectral radius). Figure 1 illustrates an example for computation of the spectral radius on a network graph. The converged normalized value of 2.21 is the spectral radius of the graph (denoted λsp(G)). If kmin, kavg and kmax are the minimum, average and maximum node degrees, then, kmin ≤ kavg ≤ λsp(G) ≤ kmax [7].

Figure 1: Example to illustrate the execution of the power iteration method to compute the spectral radius of a network graph

International Journal of Computer Networks & Communications (IJCNC) Vol.7, No.3, May 2015

3. SIMULATION AND ANALYSIS OF RANDOM NETWORKS

A random network is a network wherein all the nodes have comparative degree and no single node has a degree that is significantly different from that of the other nodes. Ideally, for a random network, the standard deviation of the node degree should be 0; however, this is possible only if all nodes have the same degree. In real-world random networks and the random networks generated by evolutionary models (using simulations), the degrees of all the nodes lie close to the average degree of the nodes and the variation in the node degree is appreciably low. In this section, we discuss the well-known Erdos-Renoyi model [11] and its simulation to study the variation of node degrees during the evolution of random networks.

Initially, all the nodes in the network are uniform-randomly distributed; there could be links between any two nodes. Thus, if there are N nodes in the network, the maximum number of links in the network could be N(N-1)/2 and we store the set of all possible links in a set called the CandidateLinks. The Erdos-Renoyi model constructs a random network as follows: We randomly pick a node (say node u) among the nodes in the network and if there exists at least one link involving node u among the links in CandidateLinks, then we randomly pick a node (say node v) among these nodes with which u has link in the set CandidateLinks (if no such u-v link exists in the set CandidateLinks, then node u is not considered any more for link addition). We generate a random number from 0 to 1; if this random number is less than pER (pER is the probability that there can be a link between any two nodes in the network), then the link u-v is added to the network; otherwise, not. Either way, the link u-v is removed from the set CandidateLinks after being considered for possible inclusion in the network. We repeat the above procedure until the CandidateLinks set becomes empty. During this process of evolution, the network is not connected (i.e., not all nodes exist as a single component such that the nodes are reachable to each other either directly or through one or more intermediate nodes) for a while and we refer to this phase of the network evolution as the growth phase. During the growth phase, the network comprises of several components (subsets of the vertices) that are not reachable from each other. An interesting fact is that not all the components are of the same size; there exists a particular component called the giant component whose size increases with the addition of links in the network and the rest of the components are of relatively much smaller size. As the network evolves (with the addition of new links), the smaller components merge with the giant component

International Journal of Computer Networks & Communications (IJCNC) Vol.7, No.3, May 2015

and eventually, at the end of the growth phase, there exists only one connected component in the network. We refer to the phase of network evolution subsequent to the growth phase as the balancing phase – because, the further addition of links (i.e., after the emergence of a single connected component that encompasses all the vertices) reduces the variation in node degree and facilitates all nodes to have comparable degree. We observe the same in Tables 2 and 3 that list the values for the spectral radius ratio for node degree and the coefficient of variation of node degree at the time of emergence of the giant component as well as at the end of the network evolution.

We simulate the evolution of random networks according to the Erdos-Renoyi model by starting with a network of total of N nodes (we conduct simulations with N = 100 nodes and 200 nodes) and attempting to add one link at a time with a probability pER whose values are 0.05, 0.10, 0.20 and 0.30. For each value of pER, we run 200 instances of these simulations and average the results for the spectral radius ratio for node degree and the coefficient of variation of node degree observed for each possible value of the size of the giant component (the size of the giant component ranges from 2 nodes to the total number of nodes in the network). Figures 2 and 3 capture the distribution of the spectral radius ratio for node degree and coefficient of variation of node degree as a function of the size of the giant component during the growth phase for different values of pER for 100 and 200 nodes respectively. The interesting observations are that the distribution of the variation in node degrees in the giant component during the growth phase remains almost the same, irrespective of the probability of link between any two nodes (as long as pER is above the threshold value of 1/number of nodes). We observe a similar pattern of increase in the variation of node degree with respect to both the metrics as the giant component evolves; after the giant component reaches a size that is more than 85% of the nodes in the network, we start seeing a marginal decrease in the variation of node degree. We could thus say that the spectral radius ratio for node degree as well as the coefficient of variation in node degree exhibit a concave down pattern of increase with increase in the size of the giant component during the growth phase.

Figure 2: Distribution of the spectral radius ratio for node degree and the coefficient of variation of node degree during evolution of the giant component of a random network: Growth phase (Total Number of Nodes: 100)

International Journal of Computer Networks & Communications (IJCNC) Vol.7, No.3, May 2015

Figure 3: Distribution of the spectral radius ratio for node degree and the coefficient of variation of node degree during evolution of the giant component of a random network: Growth phase(Total Number of Nodes: 200)

Table 1 illustrates the correlation coefficient between the values for the above two metrics during the evolution of the giant component and we observe the correlation coefficient to be at least above 0.75 (we refer to such a correlation as high correlation). For a given number of nodes, we observe the correlation between the two metrics during the growth phase evolution of the giant component not to change much with increase in the probability of link between an two nodes. We observe similar trend of relationship between the spectral radius ratio for node degree and coefficient of variation in node degree for both 100 nodes and 200 nodes networks. As the number of nodes increases from 100 to 200, we observe the correlation coefficient between the two metrics to remain about the same, with some minor differences. This shows that the correlation between the spectral radius ratio for node degree and the coefficient of variation for node degree is scalable with the number of nodes.

International Journal of Computer Networks & Communications (IJCNC) Vol.7, No.3, May 2015

When we continue to add links (with probability pER) after the emergence of a single connected giant component that encompasses all the vertices, we observe the further addition of links causes the variation in node degree to rapidly decrease (the rate of decrease increases with increase in the pER value). As the probability of link between any two nodes increases, there is an increase in the probability of any two nodes to have the same degree. From Tables 2 and 3, we could observe the spectral radius ratio for node degree and the coefficient of variation of node degree to be higher at lower values of pER (say, 0.05) and decrease rapidly to their minimum possible values (1.0 for spectral radius ratio for node degree and 0.0 for the coefficient of variation of node degree) as the probability of link between any two nodes increases (to values of 0.20 and 0.30). Thus, even though the degree distribution of the nodes in the giant component (during the growth phase) does not change much with the probability of link between any two nodes, once a single connected giant component emerges (encompassing all the vertices), the impact of the probability of link between any two nodes could be observed on the degree distribution of the nodes: the larger the probability of link between any two nodes, the larger the chances of the degrees of the nodes to converge towards the average value (thereby lowering the spectral radius ratio for node degree and the coefficient of variation in node degree to their lowest possible values of 1.0 and 0.0 respectively).

4. SIMULATION AND ANALYSIS OF SCALE-FREE NETWORKS

A scale-free network is a network wherein a majority of the nodes have a very small degree and very few nodes (but appreciable number of nodes) have a very high degree [4]. An airline network is typically a scale-free network with certain airports being used as high-degree hubs with connections to other hubs as well as connections to nearby low-degree nodes/airports. The simulations for scale-free networks were conducted as follows: We use the Barabasi-Albert (BA) model [8] for generating scale-free networks. According to this model, to start with, the network is assigned an initial set of nodes (3, in our simulations): the links between which are chosen arbitrarily, as long as there is one link per node (accordingly, we consider each node for link inclusion and connect the node to a randomly chosen node, other than itself, from the initial set of nodes). We then introduce nodes, one at a time. At each time step, a new node is introduced to the network with m number of new links (varied from 2, 3, 4, 5, 10, 20, 30, 40, 50 and 90) connecting to the nodes that are already exist in the network (at most one new link per node). If the number of new links that could be added to the network is less than the number of nodes in the network, then the number of new links is set to the number of nodes in the network (in such cases, the newly introduced is basically connected to every node that already exist in the network). Once the number of new links added per node becomes less than the number of nodes that already exist in

International Journal of Computer Networks & Communications (IJCNC) Vol.7, No.3, May 2015

the network, then the node to pair with is chosen according to the probability formula that denotes the probability at which an already existing node i with degree ki at time instant t (corresponds to the introduction of node with ID t) acquires a link. We compute such probabilities for the nodes that were introduced prior to time instant t (i.e., node IDs 1,…, t-1). Note that the denominator in the probability formula sums only the degrees of the nodes that already exist in the network before the introduction of the new node and its links (i.e., the new node and its impact on the node degrees is not considered in the denominator portion of the formula). The values reported in Figures 2-5 and Tables 4-5 for the Spectral Radius Ratio for Node Degree (ratio of the spectral radius of the adjacency matrix and the average node degree) and the coefficient of variation of node degree (ratio of the standard deviation to the average node degree) are average values obtained from 100 runs of the simulation code for a particular condition (initial number nodes, total number of nodes and number of new links added per node introduction).

The spectral radius ratio for node degree could be used as a measure to determine the appropriate number of new links per node (that could be added to the network upon introduction of a node into the network) to achieve a desired threshold level of variation in node degree. As we divide the spectral radius by the average node degree, the spectral radius ratio for node degree can be used as a measure to judge the variation of node degree across networks of any size. Tables 4 and 5 display the spectral radius ratio for node degree and coefficient of variation of node degree after the introduction of the last node in the network. As we can see, for a fixed value of the number of new links added per node and the initial number of nodes, the smaller the value of the total number of nodes to be introduced to the network, the lower the variation in node degree. As we increase the number of new links added per node introduction, the distribution of links is relatively more even across all the nodes, especially when the total number of nodes in the network is lower.

Figure 4: Distribution of the spectral radius ratio for node degree under the scale-free model for network evolution (initial # nodes: 3; total # nodes: 1000)

International Journal of Computer Networks & Communications (IJCNC) Vol.7, No.3, May 2015

Figures 4-5 and 6-7 respectively capture the distribution of the spectral radius ratio for node degree and the coefficient of variation of node degree as a function of time (upon the introduction of each new node into the network). After the initial set of nodes are assigned at least one link (per node), we add new nodes to the network, one at a time, as per the probability formula for an existing node to get a new link as a function of its degree. With passage of time, nodes that have been around in the network for a longer time incur a larger degree compared to nodes that were recently added; such a variation in node degree increases with time. Thus, we could see both the spectral radius ratio for node degree and the coefficient of variation of node degree to exhibit a very high correlation and similar pattern of distribution with time.

For a fixed value of the number of new links added per node and the initial number of nodes, the larger the value for the total number of nodes to be introduced to the network, the larger the space of distribution (range of values) for the two measures as a function of time as well as larger the correlation coefficient between the two measures. In other words, larger the scale-free nature of the network, the larger the correlation coefficient between the spectral radius ratio for node degree and the coefficient of variation of node degree. Table 6 displays the values for the correlation coefficient computed using the distribution of the spectral radius ratio for node degree and the coefficient of variation [9] of node degree as a function of time, with the introduction of new nodes into the network (for fixed values of the number of new links added per node introduction and the initial number of nodes). Apparently, when then the spectral radius ratio for node degree approaches close to 1 (scenario of total of 100 nodes with 90 new links per node introduction), the correlation coefficient between the two measures (that was henceforth decreasing with increase in the number of new links added per node) bounces back and approaches 1.

Figure 7: Distribution of the coefficient of variation of node degree under the scale-free model for network evolution (initial # nodes: 3; total # nodes: 100)

International Journal of Computer Networks & Communications (IJCNC) Vol.7, No.3, May 2015

4. SIMULATION AND ANALYSIS OF SMALL-WORLD NETWORKS

A small-world network is a type of graph in which most nodes are not neighbors of one another, but most nodes can be reached from every other by a small number of hops [4]. An example for a small-world network is the road network of a country, wherein most of the cities are not neighbors of each other, but can be reached from one another by going through a fewer number of intermediate cities. The simulations for small-world networks are conducted according to the Watts and Strogatz model [10], explained as follows.

We start with a regular 1-dimensional ring network comprising of two concentric rings, each with the same number of nodes (say, N; hence, the total number of nodes in the network is referred to as 2N). The nodes are initially organized in a ring network as shown in the left side of Figure 8. The number of neighbors per node is 4. For a node i in the outer ring, its neighbors in the outer ring are chosen as follows: (i + 1) % N and (i – 1 + N) % N. For a node i in the inner ring, its

International Journal of Computer Networks & Communications (IJCNC) Vol.7, No.3, May 2015

neighbors in the inner ring are chosen as follows: N + ((i + 1) % N) and N + ((i – 1 + N) % N). In the initial regular ring network, as each node has 4 neighbors, the average number of neighbors per node is 4; with a total of 2N nodes, the total number of links in the initial regular ring network is (average node degree * total number of nodes / 2) = (4 * 2N / 2) = 4N = 2*total number of nodes in the network.

Regular Network (before rewiring)

Small-World Network (after rewiring)

Figure 8: Transformation of a regular network to small-world network

We consider each link in the initial set of links for a possible rewiring with a probability β. For each link (u–v), where u < v (in the order of the numerical IDs), we generate a random number (uniform-randomly distributed from 0 to 1) and if the random number is less than or equal to β, the node with the lower ID u is paired with a randomly chosen node (other than itself, node v and the current neighbors) and the newly created link is not considered for any rewiring. This way, the average number of neighbors per node at any time is still the initial value for the average number of neighbor per node (which is 4 in our simulations); but, variation in the distribution of the node degree starts creeping in with rewiring. The network at the right side of Figure 8 displays a sample network at the end of rewiring.

We calculated the spectral radius of the network during the course of the rewiring of the links in the network (i.e., whenever a link is rewired). For a fixed total number of nodes and rewiring probability, as more links get rewired, the spectral radius of the network increased and the coefficient of variation in the node degree also increased (the correlation coefficient always remained above 0.95 during the process of rewiring). Figure 9 illustrates the distribution of the spectral radius for node degree and the coefficient of variation of node degree during the process of rewiring for a network with a total of 100 nodes (i.e., # nodes per ring N is 50) and a rewiring probability of 0.5 – about 50% of the total number of links (0.5 * 2 * 100 = 100) got rewired and the figure displays the increase in the two metrics with rewiring.

For a fixed probability of rewiring, the variation in the node degree is about the same when the value for the total number of nodes in the network is varied (from 100 to 1000 in the simulations). In other words, for any fixed value of the rewiring probability, the spectral radius ratio for node degree observed at the end of rewiring a network of 100 nodes is about the same as the spectral radius ratio for node degree observed at the end of rewiring a network of 1000 nodes. This is a significant observation as the total number of nodes we start with does not impact the variation in the node degree during the course of rewiring (for a fixed probability of rewiring).

International Journal of Computer Networks & Communications (IJCNC) Vol.7, No.3, May 2015

Figure 9: Distribution of variation in node degree during the process of rewiring (total # nodes: 100; rewiring probability: 0.5)

Figure 10: Distribution of variation in node degree as a function of the rewiring probability (averaged across the total # nodes from 100 to 1000)

For a fixed value of the total number of nodes, smaller the value for the rewiring probability, smaller the variation in the node degree, as reflected in Figure 10 (averaged for each rewiring probability, based on the observations for the total number of nodes from 100 to 1000, as we do not see significant variation in node degree with increase in the number of nodes for a fixed probability of rewiring). We observe the spectral radius ratio for node degree to increase from 1 to 1.15 as we increase the rewiring probability from 0 to 0.5; but, as we increase the rewiring probability from 0.6 to 1 (the network being completely random), the spectral radius ratio for node degree increased only from 1.15 to 1.25.

5. CONCLUSIONS

The high-level contribution of this research is the identification of a positive correlation between the ratio of the spectral radius to the average node degree to that of the coefficient of variation of node degree for networks simulated from theoretical models (for random networks, scale-free networks and small-world networks). The positive correlation has been observed for networks of different sizes, ranging from networks of 5 nodes, to a few hundred nodes, all the way to networks of thousands of nodes. With this observation, we can now confidently say that the closer the value of the spectral radius ratio for node degree to 1.0, the smaller the variation among the degrees of the nodes in the network. The variation in the node degrees for any two networks can also be simply compared based on the spectral radius ratio for node degrees observed for the two networks, rather than requiring to compute the standard deviation of the node degrees for the two networks. As explained in Sections 3, 4 and 5, the operating parameters of the theoretical models

International Journal of Computer Networks & Communications (IJCNC) Vol.7, No.3, May 2015

can be tuned to obtain a desired variation in the degree of the nodes in the network, measured in the form of the spectral radius ratio for node degree.

For each category of complex networks, we have a significant observation on the variation of node degree: For the random networks, we observe that the variation in node degree is relatively more pronounced in the single connected giant component at the time of its emergence, compared to the variation in node degree observed at the end of the network evolution. Such an observation has been so far not reported in the literature. For a given probability of link between any two nodes, the variation in node degree decreases with increase in the number of nodes. On the other hand, for a scale-free network, given a certain number of links added per time, the variation in node degree increases with increase in the number of nodes. In the case of small-world networks, for a given probability of rewiring of the links, we observe the number of nodes in the network does not impact the variation in node degree during the course of rewiring.

REFERENCES

[1]D. Lusseau, K. Schneider, O. J. Boisseau, P. Haase, E. Slooten, and S. M. Dawson, “The Bottlenose Dolphin Community of Doubtful Sound Features a Large Proportion of Long-lasting Associations,” Behavioral Ecology and Sociobiology, vol. 54, pp. 396-405, 2003.

[2]W. W. Zachary, “An Information Flow Model for Conflict and Fission in Small Groups,” Journal of Anthropological Research, vol. 33, no. 4, pp. 452-473, 1977.

[3]M. E. J. Newman, “The Structure of Scientific Collaboration Networks,” Proceedings of the National Academy of Sciences of the United States of America, vol. 98, no. 2, pp. 404-409, January 2001.

[4]M. Newman, Networks: An Introduction, 1st ed., Oxford University Press, May 2010.

[5]M. Girvan and M. E. J. Newman, “Community Structure in Social and Biological Networks,” Proceedings of the National Academy of Sciences of the United States of America, vol. 19, no. 12,

pp.7821-7826, April 2002.

[6]G. Strang, Linear Algebra and its Applications, Cengage Learning, 4th edition, July 2005.

[7]B. G. Horne, “Lower Bounds for the Spectral Radius of a Matrix,” Linear Algebra and its Applications, vol. 263, pp. 261-273, September 1997.

[8]A-L. Barabasi and R. Albert, “Emergence of Scaling in Random Networks,” Science, vol. 286, pp. 509-512, October 1999.

[9]J. Benesty, J. Chen, Y. Huang, I. Cohen, “Pearson Correlation Coefficient,” Springer Topics in Signal Processing: Noise Reduction in Speech Processing, vol. 2, pp. 1-4, March 2009.

[10]D. J. Watts and S. H. Strogatz, “Collective Dynamics of ‘Small-World’ Networks,” Nature, vol. 393,

pp.440-442, June 1998.

[11]P. Erdos and A. Renyi, “On Random Graphs,” Publications Mathematicae, vol. 6, pp. 290-297, 1959.

International Journal of Computer Networks & Communications (IJCNC)

7315 ijcnc01

Share this:

Leave a comment Cancel reply