Scale-free network

Source: Wikipedia, the free encyclopedia.
Degree distribution for a network with 150000 vertices and mean degree = 6 created using the Barabási–Albert model (blue dots). The distribution follows an analytical form given by the ratio of two gamma functions (black line) which approximates as a power-law.

A scale-free network is a network whose degree distribution follows a power law, at least asymptotically. That is, the fraction P(k) of nodes in the network having k connections to other nodes goes for large values of k as

where is a parameter whose value is typically in the range (wherein the second moment (scale parameter) of is infinite but the first moment is finite), although occasionally it may lie outside these bounds.[1][2] The name "scale-free" could be explained by the fact that some moments of the degree distribution are not defined, so that the network does not have a characteristic scale or "size".

Many networks have been reported to be scale-free, although statistical analysis has refuted many of these claims and seriously questioned others.[3][4] Additionally, some have argued that simply knowing that a degree-distribution is fat-tailed is more important than knowing whether a network is scale-free according to statistically rigorous definitions.[5][6] Preferential attachment and the fitness model have been proposed as mechanisms to explain conjectured power law degree distributions in real networks. Alternative models such as super-linear preferential attachment and second-neighbour preferential attachment may appear to generate transient scale-free networks, but the degree distribution deviates from a power law as networks become very large.[7][8]

History

In studies of the networks of citations between scientific papers, Derek de Solla Price showed in 1965 that the number of links to papers—i.e., the number of citations they receive—had a heavy-tailed distribution following a Pareto distribution or power law, and thus that the citation network is scale-free. He did not however use the term "scale-free network", which was not coined until some decades later. In a later paper in 1976, Price also proposed a mechanism to explain the occurrence of power laws in citation networks, which he called "cumulative advantage" but which is today more commonly known under the name preferential attachment.

Recent interest in scale-free networks started in 1999 with work by Albert-László Barabási and Réka Albert at the University of Notre Dame who mapped the topology of a portion of the World Wide Web,[9] finding that some nodes, which they called "hubs", had many more connections than others and that the network as a whole had a power-law distribution of the number of links connecting to a node. After finding that a few other networks, including some social and biological networks, also had heavy-tailed degree distributions, Barabási and Réka Albert coined the term "scale-free network" to describe the class of networks that exhibit a power-law degree distribution. However, studying seven examples of networks in social, economic, technological, biological, and physical systems, Amaral et al. were not able to find a scale-free network among these seven examples. Only one of these examples, the movie-actor network, had degree distribution P(k) following a power law regime for moderate k, though eventually this power law regime was followed by a sharp cutoff showing exponential decay for large k.[10]

Barabási and

Mendes and Samukhin[11] and independently by Krapivsky, Redner, and Leyvraz, and later rigorously proved by mathematician Béla Bollobás.[12] Notably, however, this mechanism only produces a specific subset of networks in the scale-free class, and many alternative mechanisms have been discovered since.[13]

The history of scale-free networks also includes some disagreement. On an empirical level, the scale-free nature of several networks has been called into question. For instance, the three brothers Faloutsos believed that the

layer 2 structure of the ASes
they interconnect. [14]

On a theoretical level, refinements to the abstract definition of scale-free have been proposed. For example, Li et al. (2005) offered a potentially more precise "scale-free metric". Briefly, let G be a graph with edge set E, and denote the degree of a vertex (that is, the number of edges incident to ) by . Define

This is maximized when high-degree nodes are connected to other high-degree nodes. Now define

where smax is the maximum value of s(H) for H in the set of all graphs with degree distribution identical to that of G. This gives a metric between 0 and 1, where a graph G with small S(G) is "scale-rich", and a graph G with S(G) close to 1 is "scale-free". This definition captures the notion of self-similarity implied in the name "scale-free".

Overview

When the concept of "scale-free" was initially introduced in the context of networks,[9] it primarily referred to a specific trait: a power-law distribution for a given variable , expressed as . This property maintains its form when subjected to a continuous scale transformation , evoking parallels with the renormalization group techniques in statistical field theory.[15][16]

However, there's a key difference. In statistical field theory, the term "scale" often pertains to system size. In the realm of networks, "scale" is a measure of connectivity, generally quantified by a node's degree—that is, the number of links attached to it. Networks featuring a higher number of high-degree nodes are deemed to have greater connectivity.

The power-law degree distribution enables us to make "scale-free" assertions about the prevalence of high-degree nodes.[17] For instance, we can say that "nodes with triple the average connectivity occur half as frequently as nodes with average connectivity." The specific numerical value of what constitutes "average connectivity" becomes irrelevant, whether it's a hundred or a million.[18]

Characteristics

Random network (a) and scale-free network (b)
Complex network degree distribution of random and scale-free

The most notable characteristic in a scale-free network is the relative commonness of vertices with a degree that greatly exceeds the average. The highest-degree nodes are often called "hubs", and are thought to serve specific purposes in their networks, although this depends greatly on the domain.

Clustering

Another important characteristic of scale-free networks is the

small-world phenomenon
.

At present, the more specific characteristics of scale-free networks vary with the generative mechanism used to create them. For instance, networks generated by preferential attachment typically place the high-degree vertices in the middle of the network, connecting them together to form a core, with progressively lower-degree nodes making up the regions between the core and the periphery. The random removal of even a large fraction of vertices impacts the overall connectedness of the network very little, suggesting that such topologies could be useful for security, while targeted attacks destroys the connectedness very quickly. Other scale-free networks, which place the high-degree vertices at the periphery, do not exhibit these properties. Similarly, the clustering coefficient of scale-free networks can vary significantly depending on other topological details.

Immunization

The question of how to immunize efficiently scale free networks which represent realistic networks such as the Internet and social networks has been studied extensively. One such strategy is to immunize the largest degree nodes, i.e., targeted (intentional) attacks since for this case p is relatively high and less nodes are needed to be immunized. However, in many realistic cases the global structure is not available and the largest degree nodes are not known.

Properties of random graph may change or remain invariant under graph transformations. Mashaghi A. et al., for example, demonstrated that a transformation which converts random graphs to their edge-dual graphs (or line graphs) produces an ensemble of graphs with nearly the same degree distribution, but with degree correlations and a significantly higher clustering coefficient. Scale free graphs, as such, remain scale free under such transformations.[19]

Examples

Although many real-world networks are thought to be scale-free, the evidence often remains inconclusive, primarily due to the developing awareness of more rigorous data analysis techniques.[3] As such, the scale-free nature of many networks is still being debated by the scientific community. A few examples of networks claimed to be scale-free include:

A snapshot of the weighted planar stochastic lattice (WPSL)

Scale free topology has been also found in high temperature superconductors.[23] The qualities of a high-temperature superconductor — a compound in which electrons obey the laws of quantum physics, and flow in perfect synchrony, without friction — appear linked to the fractal arrangements of seemingly random oxygen atoms and lattice distortion.[24]

A space-filling cellular structure,

weighted planar stochastic lattice (WPSL)
has recently been proposed whose coordination number distribution follow a power-law. It implies that the lattice has a few blocks which have astonishingly large number neighbors with whom they share common borders. Its construction starts with an initiator, say a square of unit area, and a generator that divides it randomly into four blocks. The generator thereafter is sequentially applied over and over again to only one of the available blocks picked preferentially with respect to their areas. It results in the partitioning of the square into ever smaller mutually exclusive rectangular blocks. The dual of the WPSL (DWPSL), which is obtained by replacing each block with a node at its center, and each common border between blocks with an edge joining the two corresponding vertices, emerges as a network whose degree distribution follows a power-law.[25][26] The reason for it is that it grows following mediation-driven attachment model rule which also embodies preferential attachment rule but in disguise.

Generative models

Scale-free networks do not arise by chance alone. Erdős and Rényi (1960) studied a model of growth for graphs in which, at each step, two nodes are chosen uniformly at random and a link is inserted between them. The properties of these random graphs are different from the properties found in scale-free networks, and therefore a model for this growth process is needed.

The most widely known generative model for a subset of scale-free networks is Barabási and Albert's (1999)

rich get richer
generative model in which each new Web page creates links to existing Web pages with a probability distribution which is not uniform, but proportional to the current in-degree of Web pages. This model was originally invented by
Mendes.[citation needed] Some mechanisms such as super-linear preferential attachment and second neighbour attachment generate networks which are transiently scale-free, but deviate from a power law as networks grow large.[7][8]

A somewhat different generative model for Web links has been suggested by Pennock et al. (2002). They examined communities with interests in a specific topic such as the home pages of universities, public companies, newspapers or scientists, and discarded the major hubs of the Web. In this case, the distribution of links was no longer a power law but resembled a normal distribution. Based on these observations, the authors proposed a generative model that mixes preferential attachment with a baseline probability of gaining a link.

Another generative model is the copy model studied by Kumar et al.[28] (2000), in which new nodes choose an existent node at random and copy a fraction of the links of the existent node. This also generates a power law.

There are two major components that explain the emergence of the power-law distribution in the Barabási–Albert model: the growth and the preferential attachment.[29] By "growth" is meant a growth process where, over an extended period of time, new nodes join an already existing system, a network (like the World Wide Web which has grown by billions of web pages over 10 years). Finally, by "preferential attachment" is meant that new nodes prefer to connect to nodes that already have a high number of links with others. Thus, there is a higher probability that more and more nodes will link themselves to that one which has already many links, leading this node to a hub in-fine.[9] Depending on the network, the hubs might either be assortative or disassortative. Assortativity would be found in social networks in which well-connected/famous people would tend to know better each other. Disassortativity would be found in technological (Internet, World Wide Web) and biological (protein interaction, metabolism) networks.[29]

However, the growth of the networks (adding new nodes) is not a necessary condition for creating a scale-free network (see Dangalchev[30]). One possibility (Caldarelli et al. 2002) is to consider the structure as static and draw a link between vertices according to a particular property of the two vertices involved. Once specified the statistical distribution for these vertex properties (fitnesses), it turns out that in some circumstances also static networks develop scale-free properties.

Generalized scale-free model

There has been a burst of activity in the modeling of

scale-free complex networks. The recipe of Barabási and Albert[31] has been followed by several variations and generalizations[32][33][34][35][27] and the revamping of previous mathematical works.[36]

In today's terms, if a complex network has a power-law distribution of any of its metrics, it's generally considered a scale-free network. Similarly, any model with this feature is called a scale-free model.[17]

Features

Many real networks are (approximately) scale-free and hence require scale-free models to describe them. In Price's scheme, there are two ingredients needed to build up a scale-free model:

1. Adding or removing nodes. Usually we concentrate on growing the network, i.e. adding nodes.

2. Preferential attachment: The probability that new nodes will be connected to the "old" node.

Note that some models (see Dangalchev[30] and Fitness model below) can work also statically, without changing the number of nodes. It should also be kept in mind that the fact that "preferential attachment" models give rise to scale-free networks does not prove that this is the mechanism underlying the evolution of real-world scale-free networks, as there might exist different mechanisms at work in real-world systems that nevertheless give rise to scaling.

Examples

There have been several attempts to generate scale-free network properties. Here are some examples:

The Barabási–Albert model

The Barabási–Albert model, an undirected version of Price's model has a linear preferential attachment and adds one new node at every time step.

(Note, another general feature of in real networks is that , i.e. there is a nonzero probability that a new node attaches to an isolated node. Thus in general has the form , where is the initial attractiveness of the node.)

Two-level network model

Dangalchev (see [30]) builds a 2-L model by considering the importance of each of the neighbours of a target node in preferential attachment. The attractiveness of a node in the 2-L model depends not only on the number of nodes linked to it but also on the number of links in each of these nodes.

where C is a coefficient between 0 and 1.

A variant of the 2-L model, the k2 model, where first and second neighbour nodes contribute equally to a target node's attractiveness, demonstrates the emergence of transient scale-free networks.[8] In the k2 model, the degree distribution appears approximately scale-free as long as the network is relatively small, but significant deviations from the scale-free regime emerge as the network grows larger. This results in the relative attractiveness of nodes with different degrees changing over time, a feature also observed in real networks.

Mediation-driven attachment (MDA) model

In the mediation-driven attachment (MDA) model, a new node coming with edges picks an existing connected node at random and then connects itself, not with that one, but with of its neighbors, also chosen at random. The probability that the node of the existing node picked is

The factor is the inverse of the harmonic mean (IHM) of degrees of the neighbors of a node . Extensive numerical investigation suggest that for approximately the mean IHM value in the large limit becomes a constant which means . It implies that the higher the links (degree) a node has, the higher its chance of gaining more links since they can be reached in a larger number of ways through mediators which essentially embodies the intuitive idea of rich get richer mechanism (or the preferential attachment rule of the Barabasi–Albert model). Therefore, the MDA network can be seen to follow the PA rule but in disguise.[37]

However, for it describes the winner takes it all mechanism as we find that almost of the total nodes has degree one and one is super-rich in degree. As value increases the disparity between the super rich and poor decreases and as we find a transition from rich get super richer to rich get richer mechanism.

Non-linear preferential attachment

The Barabási–Albert model assumes that the probability that a node attaches to node is proportional to the degree of node . This assumption involves two hypotheses: first, that depends on , in contrast to random graphs in which , and second, that the functional form of is linear in .

In non-linear preferential attachment, the form of is not linear, and recent studies have demonstrated that the degree distribution depends strongly on the shape of the function

Krapivsky, Redner, and Leyvraz

asymptotically
linear, i.e. as . In this case the rate equation leads to

This way the exponent of the degree distribution can be tuned to any value between 2 and .[clarification needed]

Hierarchical network model

Hierarchical network models are, by design, scale free and have high clustering of nodes.[38]

The

iterative
construction leads to a hierarchical network. Starting from a fully connected cluster of five nodes, we create four identical replicas connecting the peripheral nodes of each cluster to the central node of the original cluster. From this, we get a network of 25 nodes (N = 25). Repeating the same process, we can create four more replicas of the original cluster – the four peripheral nodes of each one connect to the central node of the nodes created in the first step. This gives N = 125, and the process can continue indefinitely.

Fitness model

The idea is that the link between two vertices is assigned not randomly with a probability p equal for all the couple of vertices. Rather, for every vertex j there is an intrinsic fitness xj and a link between vertex i and j is created with a probability .[39] In the case of World Trade Web it is possible to reconstruct all the properties by using as fitnesses of the country their GDP, and taking

[40]

Hyperbolic geometric graphs

Assuming that a network has an underlying hyperbolic geometry, one can use the framework of spatial networks to generate scale-free degree distributions. This heterogeneous degree distribution then simply reflects the negative curvature and metric properties of the underlying hyperbolic geometry.[41]

Edge dual transformation to generate scale free graphs with desired properties

Starting with scale free graphs with low degree correlation and clustering coefficient, one can generate new graphs with much higher degree correlations and clustering coefficients by applying edge-dual transformation.[19]

Uniform-preferential-attachment model (UPA model)

UPA model is a variant of the preferential attachment model (proposed by Pachon et al.) which takes into account two different attachment rules: a preferential attachment mechanism (with probability 1−p) that stresses the rich get richer system, and a uniform choice (with probability p) for the most recent nodes. This modification is interesting to study the robustness of the scale-free behavior of the degree distribution. It is proved analytically that the asymptotically power-law degree distribution is preserved.[27]

Scale-free ideal networks

In the context of

scale-free ideal gas density distribution. These networks are able to reproduce city-size distributions and electoral results by unraveling the size distribution of social groups with information theory on complex networks when a competitive cluster growth process is applied to the network.[42][43] In models of scale-free ideal networks it is possible to demonstrate that Dunbar's number is the cause of the phenomenon known as the 'six degrees of separation
'.

Novel characteristics

For a scale-free network with nodes and power-law exponent , the induced subgraph constructed by vertices with degrees larger than is a scale-free network with , almost surely.[44]

Estimating the power law exponent

Estimating the power-law exponent of a scale-free network is typically done by using the maximum likelihood estimation with the degrees of a few uniformly sampled nodes.[3] However, since uniform sampling does not obtain enough samples from the important heavy-tail of the power law degree distribution, this method can yield a large bias and a variance. It has been recently proposed to sample random friends (i.e., random ends of random links) who are more likely come from the tail of the degree distribution as a result of the friendship paradox.[45][46] Theoretically, maximum likelihood estimation with random friends lead to a smaller bias and a smaller variance compared to classical approach based on uniform sampling.[46]

See also

References

  1. PMID 17456605
    .
  2. .
  3. ^ .
  4. .
  5. .
  6. .
  7. ^ .
  8. ^ .
  9. ^ .
  10. .
  11. .
  12. .
  13. .
  14. ^ Willinger, Walter; David Alderson; John C. Doyle (May 2009). "Mathematics and the Internet: A Source of Enormous Confusion and Great Potential" (PDF). Notices of the AMS. 56 (5). American Mathematical Society: 586–599. Archived (PDF) from the original on 2011-05-15. Retrieved 2011-02-03.
  15. .
  16. .
  17. ^ .
  18. .
  19. ^ .
  20. .
  21. .
  22. .
  23. .
  24. .
  25. .
  26. ^ Hassan, M. K.; Hassan, M. Z.; Pavel, N. I. (2010). "Scale-free coordination number disorder and multifractal size disorder in weighted planar stochastic lattice". J. Phys.: Conf. Ser. 297: 01.
  27. ^
    S2CID 119320331
    .
  28. (PDF) from the original on 2016-03-03. Retrieved 2016-02-10.
  29. ^ .
  30. ^ .
  31. ^ Barabási, A.-L. and R. Albert, Science 286, 509 (1999).
  32. ^ R. Albert, and A.L. Barabási, Phys. Rev. Lett. 85, 5234(2000).
  33. ^ S. N. Dorogovtsev, J. F. F. Mendes, and A. N. Samukhim, cond-mat/0011115.
  34. ^ a b P.L. Krapivsky, S. Redner, and F. Leyvraz, Phys. Rev. Lett. 85, 4629 (2000).
  35. ^ B. Tadic, Physica A 293, 273(2001).
  36. ^ S. Bomholdt and H. Ebel, cond-mat/0008465; H.A. Simon, Bimetrika 42, 425(1955).
  37. S2CID 51976352
    .
  38. .
  39. .
  40. .
  41. .
  42. ]., submitted to European Physical Journal B
  43. ^ André A. Moreira; Demétrius R. Paula; Raimundo N. Costa Filho; José S. Andrade, Jr. (2006). "Competitive cluster growth in complex networks". Physical Review E. 73 (6): 065101.
    S2CID 45651735
    .
  44. ].
  45. .
  46. ^ .

Further reading