Characterization of Complex Networks
Characterization of Complex Networks
Advances in Physics,
Vol. 56, No. 1, February 2007, 167242
Contents
1. Introduction
2. Basic concepts
3. Complex network models
3.1. The random graph of Erdo00 s and Renyi
3.2. The small-world model of Watts and Strogatz
3.3. Generalized random graphs
3.4. Scale-free networks of Barabasi and Albert
3.5. Networks with community structure
3.6. Geographical models
4. Measurements related to distance
4.1. Average distance
page
169
173
175
176
176
178
179
180
181
182
182
168
L. da F. Costa et al.
4.2. Vulnerability
5. Clustering and cycles
5.1. Clustering coefficients
5.2. Cyclic coefficient
5.3. Structure of loops
5.4. Rich-Club coefficient
6. Degree distribution and correlations
7. Networks with different vertex types
7.1. Assortativity
7.2. Bipartivity degree
8. Entropy and energy
8.1. Entropy of the degree distribution
8.2. Search information, target entropy and road entropy
8.3. Energy of complex networks
9. Centrality measurements
10. Spectral measurements
11. Community identification and measurements
11.1. Spectral methods
11.2. Divisive methods
11.2.1. Betweenness centrality
11.2.2. Edge clustering coefficient
11.3. Agglomerative methods
11.3.1. Similarity measurements
11.4. Maximization of the modularity
11.4.1. Extremal optimization
11.5. Local methods
11.6. Method selection
11.7. Roles of vertices
12. Subgraphs
12.1. Network motifs
12.2. Subgraphs and motifs in weighted networks
12.3. Subgraph centrality
13. Hierarchical measurements
14. Fractal dimension
15. Other measurements
15.1. Network complexity
15.2. Edge reciprocity
15.3. Matching index
16. Measurements of network dynamics and perturbation
16.1. Trajectories
16.1.1. Average clustering coefficient and average shortest path length
16.1.2. Average clustering coefficient and average hierarchical clustering
coefficient of second level
16.1.3. Pearson correlation coefficient and central point dominance
16.1.4. Average hierarchical degree of second level and average hierarchical
divergence ratio of third level
16.1.5. Discussion
16.2. Perturbation analysis
17. Correlation analysis
18. Multivariate statistical methods for dimensionality reduction and
measurement selection
183
184
184
186
186
187
188
190
190
190
191
191
192
193
194
195
195
197
198
198
198
199
200
200
201
201
202
202
203
204
205
206
207
209
210
210
211
211
212
212
212
214
214
215
215
216
217
218
169
221
222
224
227
235
236
236
1. Introduction
Complex networks research can be conceptualized as lying at the intersection
between graph theory and statistical mechanics, which endows it with a truly multidisciplinary nature. While its origin can be traced back to the pioneering works on
00
percolation and random graphs by Flory [1], Rapoport [24], and Erdos and Renyi
[57], research in complex networks became a focus of attention only recently. The
main reason for this was the discovery that real networks have characteristics which
are not explained by uniformly random connectivity. Instead, networks derived from
real data may involve community structure, power law degree distributions and
hubs, among other structural features. Three particular developments have
contributed to the ongoing related advances: Watts and Strogatzs investigation of
small-world networks [8], Barabasi and Alberts characterization of scale-free models
[9], and Girvan and Newmans identication of the community structures present
in many networks (e.g. [10]).
Although graph theory is a well-established and developed area in mathematics
and theoretical computer science (e.g., [11, 12]), many of the recent developments in
complex networks have taken place in areas such as sociology (e.g., [13, 14]), biology
(e.g., [15]) and physics (e.g., [16, 17]). Current interest has focused not only on
applying the developed concepts to many real data and situations, but also on studying the dynamic evolution of network topology. Supported by the availability of high
performance computers and large data collections, results like the discovery of the
scale-free structure of the Internet [18] and of the WWW [19, 20] were of major
importance for the increased interest in the new area of complex networks, whose
growing relevance has been substantiated by the large number of recent related
publications. Reviews of such developments have been presented in four excellent
surveys [2124]; introductory papers [17, 2527]; several proceedings [2833]; edited
books [16, 34]; and books related to random graphs [11, 35, 36], complex networks
theory [37], scientic dissemination [3844], social networks [4554], economic systems and political networks [5558], and WWW and Internet [5962]. For additional
information about the related areas of percolation, disordered systems and fractals
see [6365]; for complex systems see [66, 67].
One of the main reasons behind complex networks popularity is their exibility
and generality for representing virtually any natural structure, including those undergoing dynamic changes of topology. As a matter of fact, every discrete structure such
as lists, trees, or even lattices, can be suitably represented as special cases of graphs.
It is thus of little surprise that several investigations into complex networks involve
the representation of the structure of interest as a network, followed by an analysis of
170
L. da F. Costa et al.
1
2
Characterization
=
Representation
M
Figure 1. The mapping from a complex network into a feature vector. Generic mappings can
be used in order to obtain the characterization of the network in terms of a suitable set of
measurements. In case the mapping is invertible, we have a complete representation of the
original structure.
171
1
2
3
M
T
T1
T2
T= T3
Characterization
Representation
TM
172
L. da F. Costa et al.
t=1
t=2
2
t=4
t=3
t=3
t=4
t=2
t=1
1
(a)
(b)
Figure 3. Given a network undergoing some dynamic evolution (a) and a set of measurements (e.g., 1 and 2), trajectories can be dened in the features space (b).
Threshold
Weighted
digraph
Unweighted
digraph
Symmetry
173
Symmetry
Weighted
graph
Unweighted
graph
Threshold
Figure 4. The four main types of complex networks and their transformations. All network
types can be derived from the weighted digraph through appropriate transformations.
2. Basic concepts
Figure 4 shows the four main types of complex networks, which include weighted
digraphs (directed graphs), unweighted digraphs, weighted graphs and unweighted
graphs. The operation of symmetry can be used to transform a digraph into a graph,
and the operation of thresholding can be applied to transform a weighted graph into
its unweighted counterpart. These types of graphs and operations are dened more
formally in the following, starting from the concept of weighted digraph, from which
all the other three types can be derived.
A weighted directed graph, G, is dened by a set N G of N vertices (or nodes), a set
EG of M edges (or links), and a mapping !: EG 7 ! R. Each vertex can be identied
by an integer value i  1, 2, . . . , N; and each edge can be identied by a pair (i, j)
that represents a connection going from vertex i to vertex j to which a weight !(i, j)
is associated. In the complex network literature, it is often assumed that no
self-connections or multiple connections exist; i.e. there are no edges of the form
(i, i) and for each pair of edges (i1, j1) and (i2, j2) it holds that i1 6 i2 or j1 6 j2 Graphs
with self- or duplicate connections are sometimes called multigraphs, or degenerate
graphs. Only non-degenerate graphs are considered henceforth. In an unweighted
digraph, the edges have no weight, and the mapping ! is not needed. For undirected
graphs (weighted or unweighted), the edges have no directions; the presence of a edge
(i, j) in EG thus means that a connection exist from i to j and from j to i.
A weighted digraph can be completely represented in terms of its weight matrix W,
so that each element wij  !(i, j) expresses the weight of the connection from vertex i
to vertex j. The operation of thresholding can be applied to a weighted digraph to
produce an unweighted counterpart. This operation, henceforth represented as
T(W), is applied to each element of the matrix W, yielding the matrix A  T (W).
The elements of the matrix A are computed comparing the corresponding elements
of W with a specied threshold T; in case |wij| > T we have aij  1, otherwise aij  0.
The resulting matrix A can be understood as the adjacency matrix of the unweighted
digraph obtained as a result of the thresholding operation. Any weighted digraph can
be transformed into a graph by using the symmetry operation (W)  W  WT,
where WT is the transpose of W.
For undirected graphs, two vertices i and j are said to be adjacent or neighbors if
aij 6 0. For directed graphs, the corresponding concepts are those of predecessor and
174
L. da F. Costa et al.
The average degree of a network is the average of ki for all vertices in the network,
1X
1X
ki 
a :
2
hki 
N i
N ij ij
In the case of directed networks, there are two kinds of degrees: the out-degree,
,
equal to the number of outgoing edges (i.e. the cardinality of the set of succeskout
i
sors), and the in-degree, kin
i , corresponding to the number of incoming edges (i.e. the
cardinality of the set of predecessors),
X
aij ,
3
kout
i
kin
i 
j
X
aji :
j
out
Note that in this case the total degree is dened as ki  kin
i  ki . The average
in- and out-degrees are the same (the network is supposed isolated)
1X
a :
6
hkout i  hkin i 
N ij ij
For weighted networks, the denitions of degree given above can be used, but a
quantity called strength of i, si, dened as the sum of the weights of the corresponding edges, is more generally used [71]:
X
wij ,
7
sout
i
sin
i 
j
X
wji :
In the general case, two vertices of a complex network are not adjacent. In fact,
most of the networks of interest are sparse, in the sense that only a small fraction of
all possible edges are present. Nevertheless, two non-adjacent vertices i and j can be
connected through a sequence of m edges (i, k1), (k1, k2), . . . , (km1, j); such set of
edges is called a walk between i and j, and m is the length of the walk. We say that
two vertices are connected if there is at least one walk connecting them. A loop or
cycle is dened as a walk starting and terminating in the same vertex i and passing
only once through each vertex kn. In case all the vertices and edges along a walk are
distinct, the walk is a path. Many measurements are based on the length of such
connecting paths (see section 4).
Symbol
N G
EG
jX j
N
M
W
wij
A
aij
ki
kout
i
kin
i
si
sout
i
sin
i
i
kXk
175
In undirected graphs, if vertices i and j are connected and vertices j and k are
connected, then i and k are also connected. This property can be used to partition the
vertices of a graph in non-overlapping subsets of connected vertices. These subsets
are called connected components or clusters.
If a network has too few edges, i.e. the average connectivity of its vertices hki is too
small, there will be many isolated vertices and clusters with a small number of
vertices. As more edges are added to the network, the small clusters are connected
to larger clusters; after some critical value of the connectivity, most of the vertices are
connected into a giant cluster, characterizing the percolation [63] of the network. For
00
the Erdos-Renyi graph (see section 3.1) in the limit N ! 1 this happens at hki  1
[35]. Of special interest is the distribution of sizes of the clusters in the percolation
point and the fraction of vertices in the giant cluster. The critical density of edges (as
well as average and standard deviation) needed to achieve percolation can be used to
characterize network models or experimental phenomena. Table 1 lists the basic
symbols used in the paper.
176
L. da F. Costa et al.
0.10
0.08
P(k)
0.06
0.04
0.02
0.00
1850
1900
1950
2000
2050
2100
2150
k
(a)
(b)
00
Figure 5. The random graph of Erdos and Renyi: (a) an example and (b) average degree
distribution over 10 random networks formed by 10,000 vertices using a probability p  0.2.
various models is not intended here. Instead, the next subsections present some
models used in the discussion on network measurements (sections 16, 17, 18 and 19).
3.1. The random graph of Erdo00 s and Renyi
00
The random graph developed by Rapoport [24] and independently by Erdos and
Renyi [57] can be considered the most basic model of complex networks. In their
00
1959 paper [5], Erdos and Renyi introduced a model to generate random graphs
consisting of N vertices and M edges. Starting with N disconnected vertices, the
network is constructed by the addition of M edges at random, avoiding multiple
and self connections. Another similar model denes N vertices and a probability p of
00
connecting each pair of vertices. The latter model is widely known as Erdos-Renyi
(ER) model. Figure 5(a) shows an example of this type of network.
For the ER model, in the large network size limit N ! 1, the average number of
connections of each vertex hki, given by
hki  pN  1,
00
ln N
lnhki
N, p  pT fNpT 
3  1
1  p3
22  1
log N
loglog N
C N0:75
hki 2m
hki  2?
Cp 
P(k) k3
Barabasi-Albert
Watts-Strogatz
mink,
X   
pki p
Pk 
e
1  pi pi
i
k    i!
i1
In WS networks, the value  represents the number of neighbors of each vertex in the initial regular network (in gure 6,   4).
*The function f(u)  constant if u  1 or f(u)  ln(u)/u if u  1.
Cp
Clustering coecient
ehki hkik
k!
hki pN 1
Pk
Erdos-Renyi
Degree distribution
00
Analytical result of some basic measurements for the Erdos-Renyi, Watts-Strogatz and Barabasi-Albert network models.
Measurement
Table 2.
178
L. da F. Costa et al.
found that two US citizens chosen at random were connected by an average of six
acquaintances.
Another property of many networks is the presence of a large number of loops of
size three, i.e. if vertex i is connected to vertices j and k, there is a high probability
of vertices j and k being connected (the clustering coecient, section 5, is high);
for example, in a friendship network, if B and C are friends of A, there is a high
probability that B and C are also friends. ER networks have the small world property but a small average clustering coecient; on the other hand, regular networks
with the second property are easy to construct, but they have large average distances.
The most popular model of random networks with small world characteristics and
an abundance of short loops was developed by Watts and Strogatz [8] and is called
the Watts-Strogatz (WS) small-world model. They showed that small-world networks
are common in a variety of realms ranging from the C. elegans neuronal system to
power grids. This model is situated between an ordered nite lattice and a random
graph presenting the small world property and high clustering coecient.
To construct a small-word network, one starts with a regular lattice of N vertices
(gure 6) in which each vertex is connected to  nearest neighbors in each direction,
totalizing 2 connections, where N    logN  1. Next, each edge is randomly
rewired with probability p. When p  0 we have an ordered lattice with high number
of loops but large distances and when p ! 1, the network becomes a random graph
with short distances but few loops. Watts and Strogatz have shown that, in an
intermediate regime, both short distances and a large number of loops are present.
Figure 7(a) shows an example of a Watts-Strogatz network. Alternative procedures
to generate small-world networks based on addition of edges instead of rewiring
have been proposed [73, 74], but are not discussed here.
The degree distribution for small-world networks is similar to that of random
networks, with a peak at hki  2 (see also table 2 and gure 7(b)).
3.3. Generalized random graphs
A common way to study real networks is to compare their characteristics with the
values expected for similar random networks. As the degrees of the vertices are
179
P(k)
0.08
0.06
0.04
0.02
0.00
35
40
45
50
55
60
65
(a)
(b)
Figure 7. The small-world model of Watts and Strogatz: (a) an example of a network with
N  64 vertices,   2, p  0.1, and (b) average degree distribution over 10 WS networks
with 10,000 vertices,   25 and p  0.3.
180
L. da F. Costa et al.
100
P(k)
101
102
103
104
1
10
(a)
100
1000
k
(b)
Figure 8. The scale-free network of Barabasi and Albert. (a) an example and (b) average
degree distribution over 10 Barabasi-Albert networks formed by 10,000 vertices using m  5.
The degree distribution follows a power law, in contrast to that presented in gure 5.
while others have few connections, with the absence of a characteristic degree. More
specically, the degree distribution has been found to follow a power law for large k,
Pk  k
10
ku
11
Thus, the most connected vertices have greater a probability of receiving new
vertices. This is known as the rich get richer paradigm.
Figure 8(a) shows an example of a Barabasi-Albert network.
3.5. Networks with community structure
Some real networks, such as social and biological networks, present modular
structure [10]. These networks are formed by sets or communities of vertices such
that most connections are found between vertices inside the same community, while
connections between vertices of dierent communities are less common. A model
to generate networks with this property was proposed by Girvan and Newman [10].
181
(a)
(b)
182
L. da F. Costa et al.
space  and link them with a given probability which decays with the distance, for
instance
Pi ! j  esij ;
12
where sij is the geographical distance of the vertices and  xes the length scale of the
edges. This model generates a Poisson degree distribution as observed for random
graphs and can be used to model road networks (see gure 9(b)). Alternatively, the
network development might start with a few nodes while new nodes and connections
are added at each subsequent time step (spatial growth). Such a model is able
to generate a wide range of network topologies including small-world and linear
scale-free networks [93].
d :
13
NN  1 i6j ij
A problem with this denition is that it diverges if there are unconnected vertices
in the network. To circumvent this problem, only connected pairs of vertices are
included in the sum. This avoids the divergence, but introduces a distortion for
networks with many unconnected pairs of vertices, which will show a small value
of average distance, expected only for networks with a high number of connections.
183
Latora and Marchiori [94] proposed a closely related measurement that they called
global eciency:
X1
1
E
,
14
NN  1 i6j dij
where the sum takes all pairs of vertices into account. This measurement quanties
the eciency of the network in sending information between vertices, assuming
that the eciency for sending information between two vertices i and j is proportional to the reciprocal of their distance. The reciprocal of the global eciency is the
harmonic mean of the geodesic distances:
1
:
15
E
As equation (15) does not present the divergence problem of equation (13), it is
therefore a more appropriate measurement for graphs with more than one connected
component.
The determination of shortest distances in a network is only possible with global
information on the structure of the network. This information is not always available. When global information is unavailable, navigation in a network must happen
using limited, local information and a specic algorithm. The eective distance
between two vertices is thus generally larger than the shortest distance, and dependent on the algorithm used for navigation as well as network structure [95].
4.2. Vulnerability
In infrastructure networks (like WWW, the Internet, energy supply, etc), it is important to know which components (vertices or edges) are crucial to optimum functioning. Intuitively, the critical vertices of a network are their hubs (vertices with higher
degree), however there are situations in which they are not necessarily the most vital
for the performance of the system which the network underlies. For instance, all
vertices of a network in the form of a binary tree have equal degree, therefore there
is no hub, but disconnection of vertices closer to the root and the root itself have
a greater impact than of those near the leaves. This suggests that networks have a
hierarchical property, which means that the most crucial components are those in
higher positions in the hierarchy.
A way to nd critical components of a network is by looking for the most vulnerable vertices. If we associate the performance of a network with its global eciency,
equation (14), the vulnerability of a vertex can be dened as the drop in performance
when the vertex and all its edges are removed from the network [96]
Vi 
E  Ei
;
E
16
where E is the global eciency of the original network and Ei is the global eciency
after the removal of the vertex i and all its edges. As suggested by Goldshtein et al.
[96], the ordered distribution of vertices with respect to their vulnerability Vi is
related to the network hierarchy, thus the most vulnerable (critical) vertex occupies
the highest position in the network hierarchy.
184
L. da F. Costa et al.
17
00
A characteristic of the Erdos-Renyi model is that the local structure of the network
near a vertex tends to be a tree. More precisely, the probability of loops involving a
small number of vertices goes to 0 in the large network size limit. This is in marked
contrast with the profusion of short loops which appear in many real-world networks. Some measurements proposed to study the cyclic structure of networks and
the tendency to form sets of tightly connected vertices are described in the following.
3N
,
N3
18
k>j>i
20
k>j>i
where the aij are the elements of the adjacency matrix A and the sum is taken over
all triples of distinct vertices i, j, and k only one time.
The second denition of the clustering coecient of a given vertex i [8] is given by:
Ci 
N i
,
N3 i
21
185
where N(i) is the number of triangles involving vertex i and N3(i) is the number
of connected triples having i as the central vertex:
X
aij aik ajk ,
22
N i 
k>j
X
aij aik ,
23
N3 i 
k>j
If ki is the number of neighbors of vertex i, then N3(i)  ki(ki  1)/2. N(i) counts the
number of edges between neighbors of i. Representing the number of edges between
neighbors of i as li, equation (21) can be rewritten as:
Ci
2li
:
ki ki  1
24
186
L. da F. Costa et al.
network, with the exponent  being called its hierarchical exponent [100]. Soer and
Vazquez [101] found that this dependence of the clustering coecient with k is to
some extent due to the degree correlations (section 6) of the networks, with vertices
of high degree connecting with vertices of low degree. They suggested a new denition of clustering coecient without degree correlation bias:
l
C~ i  i ,
!i
30
X 1
2
a a ,
ki ki  1 k>j Sijk ij ik
31
where Sijk is the size of the smallest cycle which passes through vertices i, j and k.
Note that if vertices j and k are connected, the smallest cycle is a triangle and Sijk  3.
If there is no loop passing through i, j and k, then these vertices are treelike
connected and Sijk  1. The cyclic coecient of a network is the average of the
cyclic coecient of all its vertices:
	
1X
	:
N i i
32
187
C4 i  Pk Pk
i
i
j1
lj1
lj1
qi j, l
ai j, l qi j, l
33
where j and l label neighbors of node i, qi(j, l) are the number of common neighbors
between j and l. Also, ai(j, l)  (kj  
i(j, l))(kl  
i(j, l)) with 
i(j, l)  1  qi(j, l)  	jl
and 	jl  1 if neighbors j and l are connected with each other and 0 otherwise.
The estimated distribution of cycles can be used for network characterization
[106]. Rozenfeld et al. [104] proposed a loopiness exponent determined in terms of
the characteristic cycle length. Moreover, a way to compute cycles in networks is by
using the adjacency matrix A [107]. The number of loops of order 3 is given as,
1 X 3
N3 
A ii ,
34
6 i
and for orders 4 and 5,
"
#
X 2
X 2
1 X 4
2
A ii  2
A ii A ii 
A ii ,
N4 
8 i
i
i
35
"
#
X
X
1 X 5
2
3
3
N5 
A ii  5
A ii A ii  5
A ii :
10 i
i
i
36
Such loops have been studied in the context of autonomous systems by Bianconi
et al. [108], who showed that the statistical distribution of loops of order 3, 4 and 5
remains stable during the network evolution. Other works have addressed the statistical estimation of loops in complex networks, including the work by Bianconi and
Marsili, who studied the size of loops and Hamiltonian cycles in random scale-free
networks [109], and the impact of degree correlations in loops present in scale-free
networks [110].
5.4. Rich-Club coecient
In science, inuential researchers of some areas tend to form collaborative groups
and publish papers together [111]. This tendency is observed in other real networks
and reect the tendency of hubs to be well connected with each other. This phenomenon, known as rich-club, can be measured by the rich-club coecient, introduced by
Zhou and Mondragon [112]. The rich-club of degree k of a network G is the set
of vertices with degree greater than k, Rk  fv 2 N Gjkv > kg. The rich-club
coecient of degree k is given by
X
1
k 
a
37
jRkjjRkj  1 i, j2Rk ij
(the sum corresponds to two times the number of edges between vertices in the club).
This measurement is similar to that dened before for the clustering coecient
(see equation (24)), giving the fraction of existing connections among vertices with
degree higher than k.
188
L. da F. Costa et al.
k2
:
hkiN
38
40
hkiPk, k0 
:
kPk
41
P
0
0
0
Notice that
k0 Pk jk  1. For undirected networks, P(k, k )  P(k , k) and
0
0
0
0
k P(k|k ) P(k )  kP(k |k) P(k). For directed networks, k is the degree at the tail of
189
the edge, k0 is the degree at the head, both k and k0 may be in-, out-, or total degrees,
and in general P(k, k0 ) 6 P(k0 , k). For weighted networks the strength s can be used
instead of k.
P(k, k0 ) and P(k|k0 ) characterize formally the vertex degree correlations, but they
are dicult to evaluate experimentally, especially for fat-tailed distributions, as a
consequence of the nite network size and the resulting small sample of vertices with
high degree. This problem can be addressed by computing the average degree of the
nearest neighbors of vertices with a given degree k [116], which is given by
X
knn k 
k0 Pk0 jk:
42
k0
j>i
43
where M is the total number of edges. If r > 0 the network is assortative; if r<0, the
network is disassortative; for r  0 there is no correlation between vertex degrees.
Degree correlations can be used to characterize networks and to validate the
ability of network models to represent real network topologies. Newman [117]
computed the Pearson correlation coecient for some real and model networks
and discovered that, although the models reproduce specic topological features
such as the power law degree distribution or the small-world property, most of
00
them (e.g., the Erdo-Renyi and Barabasi-Albert models) fail to reproduce the
00
assortative mixing (r  0 for the Erdo-Renyi and Barabasi-Albert models).
Further, it was found that the assortativity depends on the type of network. While
social networks tend to be assortative, biological and technological networks are
often disassortative [24]. The latter property is undesirable for practical purposes,
because disassortative networks are known to be resilient to simple target attack, at
the least. So, for instance, in disease propagation, social networks would ideally be
vulnerable (i.e. the network is dismantled into connected components, isolating
the focus of disease) and technological and biological networks should be resilient
against attacks. The degree correlations are related to the network evolution process
and, therefore, should be taken into account in the development of new models as
done, for instance, in the papers by Catanzaro et al. [118] on social networks, Park
and Newman [119] on the Internet, and Berg et al. [120] on protein interaction
networks. Degree correlations also have strong inuence on dynamic processes
like instability [121], synchronization [122, 123] and spreading [115, 124, 125]. For
additional discussions about dynamic process as in networks see Ref. [24].
190
L. da F. Costa et al.
7.1. Assortativity
For networks with dierent types of vertices, a type mixing matrix E can be dened,
with elements est such that est is the number of edges connecting vertices of type s
to vertices of type t (or the total strength of the edges connecting the two vertices of
the given types, for weighted networks). It can be normalized as
E
E^ 
,
44
kEk
where kXk (cardinality) represents the sum of all elements of matrix X.
The probability of a vertex of type s having a neighbor of type t therefore is
e^
Ptype tjs  P st :
u e^su
45
P
Note that t Ptype tjs  1:
(type)
P
(t|s) and E^ can be used to quantify the tendency in the network of vertices of
some type to connect to vertices of the same type, called assortativity. We can dene
an assortativity coecient [23, 126] as:
P type
P
sjs  1
~
,
46
Q s
NT  1
where NT is the number of dierent vertex types in the network. It can be seen that
~  1, where Q
~  1 for a perfectly assortative network (only edges between
0Q
~  0 for random mixing. But there is a problem
vertices of the same type) and Q
~ regardless of
with this denition because each vertex type has the same weight in Q,
the number of vertices of that type. An alternative denition that avoids this problem
[127] is:
Tr E^  kE^ 2 k
Q
:
47
1  kE^ 2 k
It is interesting to associate the vertex type to its degree. The Pearson correlation
coecient of vertex degrees, equation (43), can be considered as an assortativity
coecient for this case.
7.2. Bipartivity degree
A special case of disassortativity is that of bipartite networks. A network is called
bipartite if its vertices can be separated into two sets such that edges exist only
191
between vertices of dierent sets. It is a known fact that a network is bipartite if and
only if it has no loops of odd length (e.g. [128]). Although some networks are
bipartite by construction, others, like a network of sexual contacts, are only approximately bipartite. A way to quantify how much a network is bipartite is therefore
needed. A possible measurement is based on the number of edges between vertices
of the same subset in the best possible division [128],
P
ij aij #i, # j
P
b1
,
48
ij aij
where #(i) maps a vertex i to its type and  is the Kronecker delta. The smallest value
of b for all possible divisions is the bipartivity of the network. The problem with this
measurement is that its computation is NP-complete, due to the necessity of evaluating b for the best possible division. A measurement that approximates b but is
computationally easier was proposed in [128], based on a process of marking the
minimum possible number of edges as responsible for the creation of loops of odd
length.
Another approach is based on the subgraph centrality [129] (section 12.3). The
subgraph centrality of the network, equation (91), is divided in part due to even
closed walks and part due to odd closed walks (a closed walk is a walk, possibly with
repetition of vertices, ending on the starting vertex). As odd closed walks are not
possible in bipartite networks, the fraction of the subgraph centrality of the network
due to even closed walks can be used as the bipartivity degree [129]:
PN
SCeven
j1 cosh j
 PN  ,
49
j
SC
j1 e
where SC is the subgraph centrality of the network (section 12.3), SCeven is the
subgraph centrality due to the even closed walks and the j are the eigenvalues
of the adjacency matrix of the network.
The maximum value of entropy is obtained for a uniform degree distribution and the
minimum value Hmin  0 is achieved whenever all vertices have the same degree [134].
192
L. da F. Costa et al.
Network entropy has been related to the robustness of networks, i.e. their resilience
to attacks [134], and the contribution of vertices to the network entropy is correlated
with lethality in protein interactions networks [135].
Sole and Valverde [136] suggested the use of the remaining degree distribution
to compute the entropy. The remaining degree of a vertex at one end of an edge is the
number of edges connected to that vertex not counting the original edge. The remaining degree distribution can be computed as
qk 
k  1Pk  1
:
hki
51
52
where the sum is taken over all shortest paths p(i, b) from i to b.
The average search information characterizes the ease or diculty of navigation in
a network and is given by [139]
1 X
S 2
Si, b:
55
N ib
This value depends on the structure of the network. As discussed by Rosvall et al.
[139], city networks are more dicult to navigate than their random counterparts.
In order to measure how dicult it is to locate vertices in the network starting
from a given vertex i, the access information is used,
1X
Ai 
Si, b,
56
N b
193
which measures the average number of questions needed to locate another vertex
starting from i. To quantify how dicult is to nd the vertex b starting from the
other vertices in the network, the hide information is used,
1X
Hb 
Si, b:
57
N i
P
P
Note that the average value of Ai and Hb for a network is S : i Ai  b Hb  SN.
Considering the exchange of messages in the network, it is possible to dene
entropies in order to quantify the predictability of the message ow. Assuming
that messages always ow through shortest paths and all pairs of vertices
exchange the same number of messages at the same rate, the following entropies
can be dened [137]:
X
Ti  
aji cij log2 cij ,
58
ij
X
Ri  
aji bij log2 bij ,
59
ij
where aji is an element of the adjacency matrix, cij is the fraction of messages targeted
at vertex i that comes through vertex j, and bij is the fraction of messages that goes
through vertex i coming from vertex j. In addition, T i is the target entropy of vertex i
and Ri is the road entropy of vertex i. Low values of these entropies mean that the
vertex from where the next message originates (to vertex i or passing through vertex i)
can be easily predicted.
As a general measurement of the ows of messages, we can dene target and road
entropies for the network as averages among all vertices
1X
T
T i,
60
N i
1X
Ri :
61
R
N i
As shown in [137], these quantities are related to the organization of the network: a
network with a low value of T has a star structure and a low value of R means that
the network is composed by hubs connected in a string.
Further works related to searchability in networks have been reported by Trusina
et al. [140], who dened search information weighted by the trac on the network,
and Rosvall et al. [141], who studied networks with higher order organization like
modular or hierarchical structure.
8.3. Energy of complex networks
By using concepts of statistical mechanics [130], it is possible to dene the energy
associated with networks and the respective partition function. Based on such concepts, Bianconi [142] proposed a theoretical approach to describe the emergence of
scale-free degree distribution or nite-scale degree distribution in complex networks.
In particular, the energy associated to a degree distribution Nk is given as
EfNk g  logN G ,
62
194
L. da F. Costa et al.
k!Nk :
63
eS fNk g N Nk Q
64
where {Nk} is the number of ways in which it is possible to distribute 2L edges into
a degree sequence {k1, . . . , kN}. By using such concepts, Bianconi showed that the
optimal degree distribution with respect to the free energy minimization is obtained
for scale-free degree distribution [142].
9. Centrality measurements
In networks, the greater the number of paths in which a vertex or edge participates,
the higher the importance of this vertex or edge for the network. Thus, assuming that
the interactions follow the shortest paths between two vertices, it is possible to
quantify the importance of a vertex or a edge in terms of its betweenness centrality
[143] dened as:
Bu 
X i, u, j
,
i, j
ij
65
where (i, u, j) is the number of shortest paths between vertices i and j that pass
through vertex or edge u, (i, j) is the total number of shortest paths between i and j,
and the sum is over all pairs i, j of distinct vertices.
When one takes into account the fact that the shortest paths might not be known
and instead a search algorithm is used for navigation (see section 4.1), the betweenness of a vertex or edge must be dened in terms of the probability of it being visited
by the search algorithm. This generalization, which was introduced by Arenas et al.
[144], subsumes the betweenness centrality based on random walks as proposed
by Newman [145].
The central point dominance is dened as [143]
CPD 
1 X
Bmax  Bi ,
N1 i
66
where Bmax is the largest value of betweenness centrality in the network. The central
point dominance will be 0 for a complete graph and 1 for a star graph in which there
is a central vertex included in all paths. Other centrality measurements can be found
in the interesting survey by Koschutzki et al. [146].
195
The eigenvalues and associated eigenvectors of a network are related to the diameter, the number of cycles and connectivity properties of the network [147, 148].
The quantity Dl  NMl is the number of paths returning to the same vertex in the
graph passing through l edges. Note that these paths can contain already visited
vertices. In a tree-like graph, a return walk is only possible going back through the
already visited edges, the presence of odd moments is a sure sign of cycles in the
graph; in particular, as a walk can go through three edges and return to its starting
vertex only by following three dierent edges (if self-connections are not allowed),
D3 is related with the number of triangles in the network [148].
In addition, spectral analysis allows the determination that a network is bipartite
(if it does not contain any odd cycle [129], see section 7.2), characterizing models of
real networks [149, 150], and visualizing networks [151]. In addition, spectral analysis
of networks is important to determine communities and subgraphs, as discussed in
the next section.
11. Community identication and measurements
Many real networks present an inhomogeneous connecting structure characterized
by the presence of groups whose vertices are more densely interconnected to one
another than with the rest of the network. This modular structure has been found
in many kinds of networks such as social networks [152, 153], metabolic networks
[154] and in the worldwide ight transportation network [89]. Figure 10 presents
a network with a well-dened community structure.
Community identication in large networks is particularly useful because vertices
belonging to the same community are more likely to share properties and dynamics.
In addition, the number and characteristics of the existing communities provide
subsidies for identifying the category of a network as well as understanding its
dynamic evolution and organization. In the case of the World Wide Web, for
instance, pages related to the same subject are typically organized into communities,
so that the identication of these communities can help the task of seeking for
information. Similarly, in the case of the Internet, information about communities
formed by routers geographically close one another can be considered in order to
improve the ow of data.
196
L. da F. Costa et al.
Figure 10. A network with community structure represented by the dashed lines. The
communities are the groups of more intensely interconnected vertices.
197
The situation Q  1 identies networks formed by disconnected modules. This quantity has been used in many community-nding algorithms, as briey reviewed in
the following.
Though there are many ways to dene modularity, a generally accepted denition
of a module does not exist [160]. The denitions described above estimate the modularity in terms of a given partitioning. Ziv et al. [161] proposed the modularity to be
dened in terms of information entropy (see section 8). This algorithm, which has
been called the Network Information Bottleneck, tends to allow performance better
than the algorithm based on betweenness centrality of Girvan and Newman [159].
It should be noted that this review of community nding methods focused on the
subject of how specic network measurements have been adopted to identify the
communities. Since we do not attempt to provide a comprehensive study of this
important subject, the interested reader should refer to recent papers by Newman
[162] and Danon et al. [163] for further information and a more complete review on
community nding methods. The following discussion has been organized into
subsections according to the nature of the adopted methodology.
11.1. Spectral methods
Spectral methods are based on the analysis of the eigenvectors of matrices derived
from the networks [164]. These methods have been discussed in a recent survey by
Newman [165]. The quantity measured corresponds to the eigenvalues of matrices
associated with the adjacency matrix. These matrices can be the Laplacian matrix
(also known as Kirchho matrix),
L  D  A,
70
71
P
where D is the diagonal matrix of vertex degrees with elements dii  j aij , dij  0
for i 6 j.
A particular method, called spectral bisection [165167], is based on the diagonalization of the Laplacian matrix. If the network is separated into c disconnected
components, L will be block diagonal and have c degenerated eigenvectors, all
corresponding to eigenvalue 0. However, if the separation is not clear, the diagonalization of L will produce one eigenvector with eigenvalues 0 and c  1 eigenvalues
slightly dierent from 0. The spectral bisection considers the case when c  2 and the
division of the network is obtained assigning positive components of the eigenvector
associated with the second eigenvalue (the positive eigenvalue most close to 0) to one
community and the negative ones to another community. Particularly, the second
eigenvalue, called algebraic connectivity, is a measurement of how good the division
is, with small values corresponding to better divisions. Although spectral bisection is
easy to implement, it tends to be a poor approach for detecting communities in real
networks [165]. There are many alternative methods based on spectral analysis [168],
to be found in refs [162, 163].
Recently, Newman [169] proposed a method which reformulates the modularity
concept in terms of the eigenvectors of a new characteristic matrix for the network,
198
L. da F. Costa et al.
called modularity matrix. For each subgraph g, its modularity matrix B(g) has
elements
X 
ki kj
ki ku
g
bij  aij 
 ij
aiu 
,
72
2M
2M
u2N g
for vertices i and j in g. Thus, in order to split the network in communities, rst the
modularity matrix is constructed and its most positive eigenvalue and corresponding
eigenvector are determined. According to the signs of the elements of this vector, the
network is divided into two parts (vertices with positive elements are assigned to a
community and vertices with negative elements to another). Next, the process is
repeated recursively to each community until a split which makes zero or a negative
contribution to the total modularity is reached. Following this idea, Newman proposed a new denition of communities as indivisible subgraphs, i.e. subgraphs whose
division would not increase the modularity. Currently, this method is believed to be
the most precise, as it is able to nd a division with the highest value of modularity
for many networks [169].
199
1.0
0.8
0.6
0.4
Girvan-Newman
Random walks
Extremal optimization
0.2
0.0
0
10
Figure 11. Comparison of precision between the methods developed by Girvan and Newman
(see section 11.2.1), the same method based on random walks [179], and the method developed
by Duch and Arenas, based on extremal optimization (see section 11.4.1). Each point in this
graph is an average of 100 realizations of networks created by using the model described
in section 3.5, with 128 vertices organized into 4 communities and varying the density of
connections inside and outside communities.
This measurement is based on the fact that edges which connect communities tend
to exhibit a small value for this coecient. The clustering coecient of edge (i, j) is
calculated as
Cij 
Zij  1
;
minki  1, kj  1
73
where Zij is the number of triangles to which (i, j) belongs. This method can be
generalized to more complex loops, e.g., squares. Though this method is simple
and fast (O(M4/N2)), it fails whenever the network has a small average clustering
coecient, because the value of Cij will be small for all edges. This suggests that the
method will work well only when applied to networks with a high average clustering
coecient, such as social networks [163].
200
L. da F. Costa et al.
s
X
aik  ajk 2 ,
74
k6i, j
201
This alteration on the local modularity makes the method more precise while not
aecting its execution time.
11.4.1. Extremal optimization. The extremal optimization method proposed by
Duch and Arenas [178] is a heuristic search for optimizing the value of the modularity Q. The local modularity represents the contribution of individual vertex i to the
modularity Q. If ci is the community of vertex i, the local modularity is given by
X
X
qi 
aij ci , cj ki
eci ck ;
78
ck
where eci cj are the elements of the community mixing matrix (page 196) and  is the
Kronecker delta. In order to keep the value of this contribution in the interval [1, 1]
and independent of vertex degree, it should be normalized by the degree of the
vertex, i.e. q^i  qi =ki . The value of q^ i is used as the tness for the extremal optimization algorithm. A heuristic search is performed to obtain the maximum value of the
modularity. Initially, the network is split into two random partitions with the same
number of vertices. After each step, the system self-organizes by moving the vertex
with lowest tness from one partition to another. The process stops when the
maximum value of Q is reached. After that, all links between both partitions are
deleted and the optimization of Q proceeds recursively considering every resulting
connected component. The process of community identication nishes when the
value of Q cannot be improved further.
Although this method is not particularly fast, scaling as O(N2 log N), it can achieve
high modularity values [178]. By comparing the precision of some methods as presented in gure 11, we can see that the extremal optimization method is more precise
than the methods based on removing edges with highest betweenness centrality
value. Moreover, it is clear that the computation of betweenness centrality by counting the number of shortest paths passing through each edge is more precise than
calculating this coecient by random walks [159].
11.5. Local methods
More recently, some methods have been developed to detect the local community of
a vertex based only on local information about the network topology. One such
method was proposed by Bagrow and Bolt [180], which is based on the change of
the hierarchical degree between two consecutive distances (see section 13). Starting
from a vertex v0, the vertices of successive hierarchical rings are added to the community, as long as the relation between the successive hierarchical degrees is greater
than a specied threshold 
kd v0 
> :
kd1 v0 
79
When the expansion reaches a distance d for which the above condition fails, the
community stops growing.
Despite its favorable speed, this approach has an important limitation: the division
is precise only when v0 is equidistant from all parts of its enclosing communitys
boundary [181]. In order to overcome this drawback, it has been suggested [180] that
the algorithm be executed N times starting from each vertex and then achieve a
202
L. da F. Costa et al.
consensus about the detected communities. However, this approach increases the
execution time of the algorithm.
Another local method was proposed by Clauset [181] which is based on computing
the local modularity. The idea is that of a step-by-step growth of the community
together with the exploration of the network. The community C starts with only the
original vertex v0. When a vertex is explored, a list of its neighbors is known. The set
U is a list of all vertices that are not in C but are adjacent to some of its vertices; the
set B (the boundary of C) is the subset of vertices in C that are adjacent of at least one
vertex in U. The local modularity is dened as the ratio of the number of edges with
one end point in B and neither end point in U to the number of edges with end points
in B. Considering undirected networks, this can be written as
P
i2B, j2C aij
R P
:
80
i2B, j aij
The algorithm consists in choosing iteratively from the set U the vertex that would
result in the largest increase (or smallest decrease) in the value of R when added to C.
The iteration stops when a pre-dened number of vertices was included in the community.
11.6. Method selection
Despite the many interesting alternative methods, including those briey reviewed
above, it should be noted that the problem of community nding remains a challenge
because no single method is fast and sensitive enough to ensure ideal results for
general, large networks, a problem which is compounded by the lack of a clear
denition of communities. If communities are to be identied with high precision,
the spectral method proposed by Newman [169] is a good choice. However, if priority is assigned to speed, methods such as those using greedy algorithms (runs in O(N
log2 N)) should be considered [176]. In brief, the choice of the best method to be used
depends on the conguration of the problem and the kind of desired results [163].
One fact that should have become clear from our brief review of community
nding approaches is the essential importance of the choice of the measurements
adopted to express the separation of the communities. As a matter of fact, such
measurements ultimately represent an objective denition of communities.
Therefore, an interesting perspective for further research would be to consider the
possible adaptation and combination of some of the measurements reported in this
survey with the specic objective of community characterization.
11.7. Roles of vertices
After community identication, it is possible to determine the role of vertices [154] by
using the z-score of the within-module degree, zi, and the participation coecient, Pi.
The z-score measures how well-connected vertex i is to the other vertices in the
community, being dened by
zi 
qi  qsi
,
qsi
81
203
Figure 12. A network such as that in (a) includes several subgraphs, such as cycles (b) and
trees (c).
where qi is the number of connections i makes with other vertices in its own
community si, q si is the average of q over all vertex in si, and qsi is the standard
deviation of q in si.
The participation coecient measures how well-distributed the edges of vertex i
are among dierent communities,
NM  2
X
qis
,
82
Pi  1 
ki
s1
where qis is the number of edges from vertex i to community s and ki is the degree of
vertex i. This value is zero if all edges are within its own community and it is close to
one if its edges are uniformly distributed among all communities. Based on these two
index, a zP parameter-space can be constructed, allowing the classication of vertices
into dierent roles (see e.g. [154]).
12. Subgraphs
A graph g is a subgraph of the graph G if N g  N G and Eg  EG, with the
edges in Eg extending over vertices in N g. If g contains all edges of G that connect
vertices in N g, the subgraph g is said to be implied by N g. Important subgraphs
include loops, trees (connected graphs without loops) and complete subnetworks
(cliques). Figure 12 shows a network and some subnetworks. There are many
ways to dene subgraphs in networks. An interesting way to describe the topology
of real networks in terms of subgraphs is by using the k-core decomposition. The
k-core is obtained by removing from the network all vertices with degree smaller
than k. After such a removal, some vertices in the resulting network may have degree
less than k; such vertices are removed and the network is analyzed again. When no
further removal is possible, the non-empty resulting subgraph is called k-core of the
original network [182]. An important application of such a concept is in network
visualization [183]. In this approach, the network is peeled layer by layer and the
structure is displayed from the outmost shells. An algorithm for this type of visualization is publicly available, namely the Large Network Visualization tool [183].
Recent studies about the properties and applications of k-core decomposition
in real networks have been performed [184, 185]. Important statistical properties
of k-core are discussed by Dorogvtesev et al. [182] and investigations about topology
of the Internet using k-core decomposition are presented by Carmi et al. [186].
204
L. da F. Costa et al.
Network motifs are subgraphs that appear more frequently in a real network than
could be statistically expected [189191] (see gure 13). Figure 14 shows some
(a)
(b)
Figure 13. In a real network (a), the number of motifs (represented here by three vertices
linked by dashed lines) is greater than in an equivalent random network (b).
(a)
(b)
(c)
(d)
(e)
(f)
(g)
(h)
(i)
Figure 14. Some types of motifs: (a) three-vertex feedback loop, (b) three chain, (c) feedforward loop, (d) bi-parallel, (e) four-vertex feedback loop, (f) bi-fan, (g) feedback with two
mutual dyads, (h) fully connected triad and (i) uplinked mutual dyad.
205
Nreal
 hNrand
i
i
i
irand
83
84
206
L. da F. Costa et al.
subgraph , dened as the ratio between geometric and arithmetic mean of its
weights, can be used:
g  P
Igng
wij
i, j2Eg
86
All possible subgraphs of the weighted graph can be categorized into sets of
topologically equivalent subgraphs.y Let M be one such
P set of topologically equivalent subgraphs.
The
intensity
of
M
is
given
by
I
M
g2M Ig and its coherence by
P
M  g2M g. An intensity score ZIM can be accordingly dened by
ZIM
IM  hIrand
i
M
87
Irand
M
M  hrand
i
M
rand
M
88
i and Irand
are the mean and the standard deviation of the intensities in
where hIrand
M
M
a randomized graph ensemble; hrand
i and rand
are the average and the standard
M
M
deviation of the coherence in the randomized ensemble. When the network is transformed to its unweighted version, ZIM and ZM tend to Z (see equation (83)).
1
X
Ak 
k0
k!
ii
89
where (Ak)ii is the ith diagonal element of the kth power of the adjacency matrix A,
and the factor k! assures that the sum converges and that smaller subgraphs have
more weight in the sum. Subgraph centrality can be easily computed [193] from the
spectral decomposition of the adjacency matrix,
SCi 
N
X
vj i2 ej ,
90
j1
yTwo subgraphs are topologically equivalent if the only dierence is the weight of the existing
edges.
207
where j is the jth eigenvalue and vj(i) is the ith element of the associated eigenvector.
This set of eigenvectors should be orthogonalized. The subgraph centrality of a
graph is given by [129]:
SC 
N
N
1X
1X
SCi 
ej :
N i1
N i1
91
(a)
(b)
Figure 15. Example of morphological operations: (a) Dilation: the dilation of the initial
subnetwork (dark gray vertices) corresponds to the dark and light gray vertices; (b)
Erosion: the erosion of the original subnetwork, given by the dark gray vertices in (a),
results in the subnetwork represented by the black vertices in (b).
208
L. da F. Costa et al.
92
"d g  "". . . g . . . :
|{z}
93
The rst operation converges to the entire network G and the second converges to an
empty network.
The d-ring of subgraph g, denoted Rd(g), is the subgraph implied by the set of
vertices
N d gnN d1 g;
the rs-ring of g, denoted Rrs(g), is the subgraph implied by
N s gnN r1 g:
Note that Rd(g)  Rdd(g). The same denitions can be extended to a single vertex
considering the subgraph implied by that vertex, and to an edge considering the
subgraph formed by the edge and the two vertices that it connects. In the case of
a single vertex i the abbreviations Rd(i) and Rrs(i) are used. For example, in gure 16,
R1(15) includes the vertices {8, 14, 16, 17}; R2(15) includes {1, 13, 18, 19}; for the
graph g implied by the vertices {1, 15, 22} (in black), R1(g) includes the vertices
in white: {2, 3, 4, 5, 6, 7, 8, 9, 14, 16, 17}.
The hierarchical degree of a subgraph g at distance d, henceforth represented
as kd(g), can be dened as the number of edges connecting rings Rd(g) to Rd1(g).
Note that k0(i) is equal to ki.
16
22
15
23
7
17
24
4
6
14
10
13
18
19
25
26
21
12
11
27
Figure 16. The subgraph of interest is dened by black vertices, g  {1, 15, 22}. The rst
hierarchical level of g is given by the rst dilation around g, represented by the white vertices;
the second hierarchical level is obtained dilating the subnetwork again, represented by the gray
vertices. The hierarchical degree of the rst level is given by the number of edges from white to
light gray vertices, k1(g)  12, and the hierarchical degree of the second level is the number of
edges from light gray to dark gray vertices, k2(g)  2.
209
Another measurement which can be hierarchically extended is the clustering coefcient. The rs-clustering coecient of g, Crs(g), can be dened as the number of edges
in the respective rs-ring nrs, divided by the total of possible edges between the vertices
in that ring, i.e. for undirected networks
Crs g 
2nrs g
:
jN Rrs gjjN Rrs gj  1
94
kd1 g
:
jN Rd gj
95
It is also possible to dene the divergence ratio, which corresponds to the reciprocal
of the convergence ratio
dvd g 
jN Rd gj
:
kd1 g
96
97
210
L. da F. Costa et al.
hMc i ldf ,
98
where the average mass hMc i is dened as the number of vertices in the cluster and
df is the fractal cluster dimension.
For a network whose vertices have a typical number of connections, both
exponents are the same, but this is not the case for scale-free networks.
Another scaling relation is found with a renormalization procedure based on the
box counting method [201]. A renormalized network is created with each box of
the original network transformed into a vertex and two new vertices are connected
if at least one edge exists between vertices of the corresponding boxes in the
original network. By considering the degree k0 of each vertex of the renormalized
network versus the maximum degree k in each box of the original network we
have that:
k
k0  ld
B k,
99
The exponents  (of the power law of the degree distribution), dB and dk are
related by [201]:
 1
dB
:
dk
100
211
%  a
:
1  a
103
This value is an absolute quantity, in the sense that values of 
 greater than zero
imply larger reciprocity than the random version (reciprocal networks), while values
below zero imply smaller reciprocity than a random network (antireciprocal networks). This concept can be easily extended to weighted networks by substituting
aij for wij in the above expressions.
15.3. Matching index
A matching index can be assigned to each edge in a network in order to quantify the
similarity between the connectivity of the two vertices adjacent to that edge [207].
A low value of the matching index identies an edge that connects two dissimilar
regions of the network, thus possibly playing an important role as a shortcut between
distant network regions [207]. The matching index of edge (i, j) is computed as the
number of matching connections of vertices i and j (i.e. connections to the same other
212
L. da F. Costa et al.
vertex k), divided by the total number of connections of both vertices (excluding
connections between i and j),
P
k6i, j aik ajk
P
ij  P
:
104
k6j aik 
k6i ajk
For directed networks, matching connections are only those in the same direction,
and incoming and outgoing connections of vertices i and j should be considered
separately. The matching index has also been adapted to consider all the immediate
neighbors of a node, instead of a single edge [208].
100 3
10
101
100
101
100
102
101
(b)
106 3
10
WS model
CN model
(a)
100
105
104
103
106
105
104
103
106
105
104
103
101
102
WS model
CN model
100
0.35
0.30
0.25
0.20
0.15
0.10
0.05
0.00
0.35
0.30
0.25
0.20
0.15
0.10
0.05
0.00
0.0
0.1
0.2
0.3
0.4
(c)
0.1
WS model
CN model
101
100
101
102
(d)
102
WS model
CN model
103
106
105
104
103
102
106
105
104
103
102
106
105
104
103
102
WS Model
p = 0.0002, N = 500
p = 0.0002, N = 1000
p = 0.0002, N = 2000
p = 0.02, N = 500
p = 0.02, N = 1000
p = 0.02, N = 2000
p = 0.10, N = 500
p = 0.10, N = 1000
p = 0.10, N = 2000
CN Model
p = 05%, N = 500
p = 05%, N = 1000
p = 05%, N = 2000
p = 10%, N = 500
p = 10%, N = 1000
p = 10%, N = 2000
p = 15%, N = 500
p = 15%, N = 1000
p = 15%, N = 2000
Figure 17. Trajectories dened by pairs of measurements. Each point corresponds to 10 network model realizations. Network sizes used are 500,
1000, and 2000; average degrees vary from 4 to 204 in steps of 20; for the community model, pout/pin is 5%, 10%, and 15%; for the WS model
the rewiring probability values are 0.0002, 0.02, and 0.1.
101
214
L. da F. Costa et al.
decreases faster for WS model than for GN, while C~ remains larger for the former
model than in the other cases. This eect can be explained by the fact that the
~
WS model is more regular than the GN and has larger C.
In the case of ER, CN and BA models, the values of  and C~ are smaller than for
the other models. For  the connections are not limited by proximity, adjacency or
geography. At the same time, loops of order three appear when new edges are added
~
to them, increasing C.
Another interesting fact observed from gure 17(a) is that all curves converge on
the same point, corresponding to fully connected graphs, as the networks become
denser. Therefore,  and C~ tend to unit value at that stage.
16.1.2. Average clustering coecient and average hierarchical clustering coecient
of second level. The combination of C~ and the average hierarchical clustering
coecient of second levely hC2 ii, where the average is taken over all vertices in
the network, see gure 17(b), tends to follow a power law for all trajectories except in
the case of the GN and WS models, whose curves have a minimum value for hC2 ii.
Nonetheless, the highest growth rate is observed for the trajectories of WS model
after the minimum value of hC2 ii is reached.
Another interesting characteristic of this combination of measurements is that C~ is
greater than hC2 ii. This can be explained by the fact of hC2 ii is related to the
presence of loops of order ve without additional connections between their vertices
[200]. Since loops of higher orders are less likely to appear in the considered
networks, C~ tends to become larger than hC2 ii.
16.1.3. Pearson correlation coecient and central point dominance. For all considered network models, except for the GN case, the Pearson correlation coecient of
vertex degrees represented by r (section 6), is close to zero even with the addition of
new edges, as can be seen in gure 17(c), which shows the trajectories dened by the
pair of measurements r and central point dominance CPD (section 9) as the average
degree increases. This property can be explained by the fact that in ER, CN and WS
models the edges are placed irrespectively to vertex degree, while the BA model is
based on preferential growth [117], which leads to non-assortative mixing (i.e. no
correlation between vertex degrees). The r value for the GN model is greater than
zero in almost all cases because its growing dynamics is based on the geographic
proximity of vertices. As the position of vertices is randomly chosen, some regions
may by result be highly populated, implying the respective vertices have a high
probability of becoming highly interconnected. On the other hand, vertices belonging
to the regions barely populated have small chances to become hubs while still
having a good chance of being connected. These two opposite behaviors tend to
imply a r value greater than zero.
The central point dominance is a measurement of the maximum betweenness
of any point in the network [143] (see section 9 for further details). By observing
gure 17(c), one can see that most network models exhibit average values of this
measurement close to zero, except for the BA, GN and WS cases. In BA networks,
yNote that C~ is identical to the average hierarchical clustering coecient at the rst level.
215
values signicantly larger than zero only occur in the beginning of the growth process
(i.e. in the presence of few edges).
For WS models, the way in which they are normally constructed (see section 3)
directly contributes to producing a network with modular structure, hence a high
CPD value. Nevertheless, when new edges are added, the network gets denser and
the value of this measurement goes to zero. In CN models, the CPD coecient
depends on the relation between the average vertex degrees inside and outside
communities, i.e. when the network is highly modular, the CPD value tends to
become larger.
16.1.4. Average hierarchical degree of second level and average hierarchical divergence ratio of third level. As shown in gure 17(d), all curves obtained for the
average hierarchical degree of second levely hk2 ii and the average hierarchical
divergence ratio of level three hdv3 ii have similar behavior. When the networks
are sparse and new edges are added, increasing the average vertex degree, the average
hierarchical counterpart increases until a maximum value. Afterwards, since the
networks have a nite size, further increase of the connectivity tends to reduce the
number of hierarchical levels in the networks and, as consequence, the average
hierarchical vertex degrees of levels higher than one tend to decrease. The hierarchical divergence ratio of level three decreases with larger average vertex degree.
16.1.5. Discussion. As presented in gure 17, each measurement is specically sensitive to the eects of addition of new edges to a network. Interestingly, the sensitivity also depends strongly on the network model. Some trajectories were closer to
one another for specic network models as a consequence of inherent structural
similarities. This eect is particularly pronounced in trajectories dened by the average clustering coecient and average shortest path length, where two classes of
trajectories appear, one for ER, BA and CN, and another for GN and WS.
The analysis of network dynamics provides insights about model similarities.
If network trajectories evolve in a similar fashion, it is possible to infer that these
networks have similar structure concerning the respective pair of measurements.
However, for other measurements, this similarity may be weaker or non-existent.
For instance, in the space dened by the average clustering coecient and average
shortest path length, the curves obtained for ER and BA evolve in similar fashion.
This behavior is not observed in the space dened by the central point dominance
and Pearson correlation coecient. Also, by inspecting the trajectories, it is possible
to determine the correlation between measurements during the networks evolution.
For instance, the dynamics of the average clustering coecient and the average
hierarchical clustering coecient of second level present correlation for ER, BA
and CN.
The trajectory-based study described here can be immediately extended to real
network analysis and modeling. In the case of the WWW, for instance, by inspecting
its evolution in the measurements space it is possible to develop more precise models
yNotice that hk2 ii (average taken over all vertices i in the network) depends on the network
connectivity.
216
L. da F. Costa et al.
1.5
1.3
3.0
Central point dominance
1.4
3.5
BA
ER
GN
WS
1.2
1.1
1.0
0.9
2.0
1.5
1.0
0.5
0.8
0.7
0.0
2.5
BA
ER
GN
WS
0.5
1.0
1.5
2.0
2.5
0.0
0.0
0.5
1.0
1.5
2.0
(a)
(b)
2.5
217
models are direct consequences of the fact that these networks are inherently characterized by low overall average shortest path length. A similar situation is veried
for the clustering coecient, which tends to be small in those two types of networks.
The evolution of the trajectory regarding the clustering coecient and average shortest path length for the GN and WS networks is a direct consequence of the fact that
the progressive edge rewiring tends to strongly reduce those two measurements.
The marked dierence of sensitivity of measurements to perturbations depending
on the type of network model suggests that quantications of the sensitivity (e.g. the
standard deviation or entropy) can be potentially useful as additional measurements
for network identication.
218
L. da F. Costa et al.
Figure 19. Example of two measurements which, though correlated, can still contribute to
category identication.
with those obtained when the three models are considered together. This is the case,
for instance, of the low correlation observed between the measurement average
shortest path length  and log-log degree distribution straightness st for each of
the three individual models and high correlation otherwise obtained when these
three models are considered jointly. This interesting behavior can be immediately
explained by considering gure 20, which illustrates that three low correlation groups
can result in global alignment, therefore implying the relatively strong overall negative correlation. Such situations indicate that the individual and global correlations
can provide complementary information about dierent types of relationships.
It is also clear from the results in table 3 that particularly high correlations were
obtained between the average shortest path length  and the vertex degree at the
second hierarchical level hk2 ii. This fact suggests that this specic hierarchical
vertex degree may be considered, at least for the three considered types of networks,
as an estimation of the average shortest path length, allowing substantial computational saving. Another interesting result is that the highest correlations were obtained
for the BA model, as a possible consequence of the presence of the respective hubs.
For instance, the correlation between the average shortest path length and the
average clustering coecient was found to be equal to 0.63 for the BA models.
This is a consequence of the fact that additional links tend to be established with the
hubs and therefore contribute to higher clustering and shortest paths.
219
Table 3. Correlations between measurements for the BA, ER and GN models and All
jointly. The values were estimated from 1000 realizations for each model of networks
with N  1000 and hki  4.
st
C~
CPD
hk2 ii
hC2 ii
hdv3 ii
BA
ER
GN
All
BA
ER
GN
All
BA
ER
GN
All
BA
ER
GN
All
BA
ER
GN
All
BA
ER
GN
All
BA
ER
GN
All
BA
ER
GN
All
st
C~
CPD
hk2 ii
hC2 ii
hdv3 ii
1.00
1.00
1.00
1.00
0.22
0.01
0.13
0.71
0.06
0.01
0.04
0.31
0.01
0.06
0.10
0.69
0.09
0.61
0.05
0.87
0.01
0.04
0.08
0.96
0.02
0.03
0.00
0.37
0.01
0.03
0.02
0.14
1.00
1.00
1.00
1.00
0.29
0.07
0.00
0.82
0.38
0.04
0.02
0.96
0.23
0.10
0.02
0.44
0.30
0.03
0.28
0.80
0.02
0.04
0.09
0.86
0.26
0.10
0.28
0.74
1.00
1.00
1.00
1.00
0.63
0.08
0.03
0.88
0.39
0.03
0.03
0.02
0.63
0.08
0.02
0.43
0.58
0.45
0.59
0.99
0.57
0.01
0.09
0.97
1.00
1.00
1.00
1.00
0.58
0.07
0.23
0.41
0.99
0.90
0.65
0.79
0.74
0.16
0.18
0.91
0.91
0.25
0.03
0.79
1.00
1.00
1.00
1.00
0.60
0.06
0.13
0.85
0.59
0.02
0.07
0.05
0.52
0.01
0.00
0.18
1.00
1.00
1.00
1.00
0.76
0.19
0.11
0.49
0.94
0.16
0.50
0.27
1.00
1.00
1.00
1.00
0.69
0.04
0.21
0.96
1.00
1.00
1.00
1.00
220
L. da F. Costa et al.
BA
ER
GN
4
1.00
0.95
0.90
0.85
0.80
Figure 20. Example of scatterplot showing the low correlation between log-log degree density straightness and the average shortest path length for all the individual models and the high
correlation for all models together. The networks have N  1000 and hki  4; 500 realizations
of each model were used.
221
around this center, implying that additional statistical measurements other than the
mean need to be used for proper characterization of the network under analysis.
Therefore, any objective attempt at characterizing, comparing or classifying complex
networks needs to take into account statistical distributions in phase spaces such as
that in gure 21. Such an important task can be eectively accomplished by using
traditional and well-established concepts and methods from Multivariate Statistics
(e.g., [68, 69, 211]) and Pattern Recognition (e.g. [68, 69, 212]).
It should be observed that while too small sets of measurements can prove to be
insucient to characterize a network, many highly correlated features (as illustrated
in the previous section) may not contribute substantially to the overall understanding
of the connectivity. It is therefore interesting to consider statistical methods
capable of reducing the dimensionality of the feature space while retaining the contribution of the more meaningful measurements. As far as the choice and interpretation of network measurements are concerned, two multivariate methods stand out as
being particularly useful, namely Principal Component Analysis  PCA (e.g. [68, 69])
and Canonical Variable Analysis (e.g., [69, 211]). While the former procedure allows
the reduction of the dimensionality of the measurement space, obtained in terms of
projections so as to concentrate the variation of the data along the rst new axes
(i.e. those associated to the highest covariance matrix eigenvalues), the latter method
implements such projections so as to achieve best separation, in terms of inter and
intra-class distances (see below), between the involved classes of networks under
analysis. In both these methods, the variables associated to each of the axes in the
new, dimensionally reduced feature space, correspond to linear combinations of the
original measurements. Consequently, some indication about the contribution of
each measurement for the description of the statistical distribution of the studied
networks can be obtained by considering the absolute values of the respective
weights in the linear combination. Such a procedure can be applied in order to
help identify the most meaningful measurements.
The current section presents and illustrates in a self-contained and accessible
fashion these two dimensionality reduction methods from multivariate statistics
(PCA and canonical analysis). The potential for applications of these methods is
00
illustrated with respect to three reference complex network models  namely Erdos
and Renyi random graph (ER), Barabasi-Albert (BA) and Geographical Network
model (GN), against which some real-world networks are classied.
18.1. Principal component analysis
Let the connectivity properties of a set of R complex networks, irrespective of their
type or origin, be described in terms of P scalar measurements xi, i  1, 2, . . . , P,
organized as the feature vector x~  x1 , x2 , . . . , xP T . The covariance matrix K can
be estimated as
T
~ x~  hxi
~
x~  hxi
,
106
K
R
~ is the average feature vector, each element of which corresponds to the
where hxi
average of the respective measurement. As K is a real and symmetric P 
 P matrix,
a set of P decreasing eigenvalues i and respectively associated eigenvectors v~i can
222
L. da F. Costa et al.
The original feature vectors x~ can now be transformed into a new coordinates
reference through the following linear transformation corresponding to axes
rotation:
X~ Tx~
108
Figure 22 shows the eect of projecting the cloud of points in gure 21 onto the
two main axes so that the variance of the samples is maximized. Although useful for
implementing dimensionality reduction  which favors visualization, redundancy
reduction, and computational savings  the principal component analysis method is
limited as it does not explicitly consider the category of each individual. This limitation is overcome in the canonical variable analysis described below.
223
Figure 22. The principal component projection of the distribution of measurements in gure
21. Measurement values were rst normalized by subtracting the corresponding mean value
and dividing by the standard deviation to avoid biases due to the dierent absolute values. The
rst and second PCA variable have projecting vectors (0.005, 0.707, 0.707) and (0.006,
~ ) respectively.
0.707, 0.707) in the space dened by (r,C,
Let us consider that the R complex networks of interest can be divided into Nc
classes, each one with Ni objects and identied as Ci, i  1, 2, . . . , Nc, and that each
object  is represented by its respective feature vector !
x  x1 , x2 , . . . , xP T (see the
previous section). The total scatter matrix, S, expressing the overall dispersion of
the measurements [68] is dened as follows
S
T
!
~ !
~ :
x  hxi
x  hxi
110
1
X 
!
T
~i !
~i ,
x  hxi
x  hxi
111
2Ci
Nc
X
Si :
112
i1
Finally, the interclass scatter matrix, characterizing the dispersion between each pair
of classes, is given as
Sinter 
Nc
X
i1
T
~ i  hxi
~ hxi
~ i  hxi
~ :
Ni hxi
113
224
L. da F. Costa et al.
114
The objective of the canonical analysis method is to maximize the interclass dispersion while minimizing the intraclass scattering (e.g. [211]). This can be achieved
through the following linear transformation
!;
115
!
x  x
!, !
!T
!
where   
1 2 , . . . , P  is chosen so that 1 maximizes the ratio
T
!
1 Sinter !
1
,
T
!
!
 S
1
116
intra 1
and !
j , j  2, 3, . . . , P, maximizes a similar ratio and
T
!
j Sintra !
117
j  0:
2 , . . . , !
P correspond to the eigenvectors of
It can be shown that the vectors !
1 , !
1
the matrix Sintra Sinter.
Figure 23 illustrates a phase space of reduced dimensionality (a) containing
two distributions of observations, as well as the respective PCA (b) and canonical
analysis (c) projections considering two dimensions. The potential of the canonical
approach for implementing dimensionality reduction while favoring well-separated
clusters is evident from this example.
yOptimal performance is guaranteed in case the involved mass and conditional properties
are completely known (see section 19 and [68, 69]).
225
4
Second canonical variable
(a)
2
4
distribution 1
distribution 2
4
2
0
2
distribution 1
distribution 2
4
2
Figure 23. A phase space (scatterplot) containing two distributions of points (a) and respective PCA (b) and canonical (c) projections. Note that neither the projections into the three
main planes (a) nor the PCA projection (b) can separate the distributions, which is suitably
accomplished by the canonical projection (c).
with unknown classication, the most likely category c to be assigned to it is the one
for which the respectively observed feature vector x~ produces the highest value of
~  . In case the probability functions are not available, it is still possible to use
P pxjC
approximate classication methods such as k-nearest neighbors (e.g. [69]), which
consists of identifying the set of the k individuals which are closer (i.e. smaller
distance between feature vectors) to the sample to be classied, and take as the
resulting category that corresponding to the most frequent class among the nearest
neighbors.
Let us illustrate the above concepts and methodology in terms of a situation
involving three categories C1, C2 and C3 of complex networks, namely
Geographical Network (GN), Watts-Strogatz small-world network (WS) and
00
Erdos and Renyi random graph (ER), characterized in terms of their normalized
average shortest path length l and Pearson correlation coecient of vertex degrees r.
The corresponding scatterplot is shown in gure 24(a).
226
L. da F. Costa et al.
Usually, we do not know the mass and conditional probabilities of each type of
networks, so they have to be estimated from the available data. This stage can be
understood as the training phase of the Bayesian decision theory method. There
are two main ways to estimate the probabilities required: parametric and
non-parametric. In the former, the mathematical form of the probability functions
is known (e.g., normal distribution) and the respective parameters (mean and
227
118
where x~i is the vector that stores the network set of measurements and ca is the
class of networks associated to the model a.
Figure 24(b) illustrates the parametric approach, considering three normal density
distributions, applied to the data in gure 24(a). These distributions were dened by
having their parameters (namely average vector and covariance matrix) estimated
from the respective experimental measurements. The separating frontiers are shown
in the projection at the bottom of the gure. The decision regions obtained by using
non-parametric estimation through Parzen windows are shown in gure 24(c).
Note that a very high dimensional feature space implies that a substantially high
number of individuals must be considered in order to obtain properly estimated
(i.e. not too sparse) densities. Therefore, it is essential to limit the number of measurements to a small set of more discriminative features. An interesting alternative
involves the use of canonical projections in order to reduce the dimensionality of
the problem. A key open question which is briey addressed in this section regards
which of the several topological measurements available for complex networks
characterization can yield the best characterization and discrimination among the
principal network models.
228
L. da F. Costa et al.
Network (GN) models. The following experimental networks are considered in our
experiments:
US Airlines Transportation Network (USATN): The USATN is composed by 332
US airports in 1997, connected by ights. The data was collected from the Pajek
datasets [215]. This kind of network exhibits a power law behavior as described in
[89, 216].
229
(a)
(b)
(c)
(d)
Figure 25. The Delaunay geographical network (DLN) for several numbers of random
rewirings: original (a) and after 60 (b), 120 (c) and 200 (d) rewirings.
Progressively rewired (degree preserving) versions of this network were also considered in order to illustrate the evolution of trajectories in decision spaces. Figure 25
illustrates four of these successive congurations.
A total of three sets of 300 realization of each reference model (BA, ER and GN)
were generated. The networks for each set were designed to have average vertex
degrees near the experimental value. The model and experimental networks were
characterized in terms of the following measurements: straightness st, average vertex
degree hki, Pearson correlation coecient of vertex degrees r, average clustering
~ average shortest path length , central point dominance CPD, average
coecient C,
hierarchical degree of second level hk2 ii, average hierarchical clustering coecient of
second level hC2 ii and average hierarchical divergence ratio of the third level hdv3 ii.
In order to provide a general and representative view of the eect of these measurements in the classication of real networks, we considered the following combinations of measurements:
(i) {, st},
~ },
(ii) {hki, C,
230
(iii)
(iv)
(v)
(vi)
(vii)
(viii)
L. da F. Costa et al.
Table 4 shows the results, i.e. the theoretical model and respective average vertex
degree which have been associated to each experimental network by the classication
procedure, obtained for each of these congurations. More specically, each experimental network was classied as having the same category as the theoretical model
dening the decision region in the canonical projection space where the feature
vector of the experimental data was mapped.
A number of interesting facts can be inferred from table 4. To begin with, the
compatibility between the type of network model expected and obtained for each of
the experimental networks varies considerably for each case. The best compatibility
was obtained for the DLN, i.e. the identied model was compatible with the expected
type (geographical) for all considered combinations of measurements. Compatible
average vertex degrees have also been obtained for cases (iii), (vi)(viii). Figure 26
illustrates the location of this network in the scatterplot dened by the canonical
projection of the combination of all measurements. In this gure, which also shows
the separating frontiers of the decision regions, the experimental network DLN
(represented as M) resulted closer to GN with average vertex degree of 6. PPIN
implied the highest number of incompatible classications which, instead of being
identied as a BA network (as could be expected [217]), was understood as GN
except for the cases {, st} and {st, r, CPD}. A similar situation was veried regarding the average vertex degrees. Figures 27(c) and (d) show the resulting position of
this network within the scatterplots obtained by canonical projection of the combination of all measurements (c) and all except those hierarchical (d). Note the good
agreement between the resulting categories obtained for these two cases. In both
cases, the PPIN resulted very close to the GN with average vertex degree of 3.03.
A particularly interesting result has been obtained for the USATN, which tended
to appear well away from all theoretical groups in most cases, as illustrated in the
~ }. Intermediate
scatterplot shown in gure 27(a) with respect to the case {hki, C,
results were obtained for the other networks. For instance, TRNE has been classied
as expected (i.e. as a BA network) in 2 cases, identied as an ER in only one case and
as a GN in 5 cases. Figure 27(b) shows the position of this network in the scatterplot
dened for all measurements. Note that TRNE appears almost in the middle of the
ER and GN types for average vertex degree of 2.45.
It is also possible to use hierarchical clustering algorithms (e.g. [68, 69, 200])
in order to obtain additional information about the relationship between the analyzed networks. Figure 28 shows the dendrogram obtained for the situation depicted
in gure 27(c) by using Wards agglomerative method. In this method the networks,
initially treated as individual clusters, are progressively merged in order to guarantee
minimal dispersion inside each cluster. The linkage distance is shown along the
y-axis, indicating the point where the clusters are merged (the sooner two clusters
are merged, the most similar they are). The similarity between the cases belonging to
BA hki 6.0
BA hki 2.0
ER hki 2.0
GN hki 4.0
BA hki 3.59
BA hki 2.45
BA hki 3.03
GN hki  6.0
GN hki  4.0
GN hki 3.03
GN hki 2.45
GN hki 3.59
(ii)
GN hki 6.0
GN hki 2.0
GN hki 4.0
BA hki 6.0
(iii)
(iv)
GN hki 4.0
ER hki 2.0
BA hki 4.0
BA hki 4.0
BA hki 10.0
* Class identified well away from all considered theoretical models (see, for instance, figure 27(c)).
BA hki 10.0
US Airlines transportation
network (USATN)
hki  12.8
Autonomous
System (AS)
hki  3.59
Transcriptional regulation
Network of the E. coli
(TRNE) hki  2.45
Protein-Protein interaction
Network of the
Saccharomyces
Cerevisiae (PPIN)
hki  3.03
Delaunay Network
(DLN) hki  6.0
(i)
Expected network
GN hki 4.0
GN hki 3.03
ER hki 2.45
GN hki 3.59
(v)
GN hki 6.0
GN hki 3.03
ER hki 2.45
GN hki 3.59
(vi)
GN hki 6.0
ER hki 2.0
GN hki 2.45
BA hki 6.0
(vii)
GN hki 6.0
GN hki 3.03
ER hki 2.45
GN hki 3.59
(viii)
The classes assigned to the real networks by considering each combination of measurements. The classes in bold mean wrong identied model and,
in italic style, wrong average vertex degree.
Experimental network
Table 4.
232
L. da F. Costa et al.
Table 5.
Measurement
Symbol
Equation
E
h
V
C and C~
Cw
	
kmax
knn(k)
r
~ Q
Q,
b and 
H(i)
S
Ai
Hi
T
R
Bi
CPD
Ml
Q
Pi
zi
SPi
SC
Crs
cvd(i)
dvd(i)
% and 
ij
(13)
(14)
(15)
(17)
(18) and (25)
(27)
(32)
(40)
(42)
(43)
(46) and (47)
(48) and (49)
(50)
(55)
(56)
(57)
(60)
(61)
(65)
(66)
(68)
(69)
(82)
(83)
(84)
(91)
(94)
(95)
(96)
(101) and (102)
(104)
each of the three types of networks is reected by the fact that three respective main
branches are obtained in the dendrogram in gure 28. The GN cluster incorporates
the experimental protein-protein network, to which it is most closely related by the
measurements. Note that the GN group, including the protein-protein network, is
signicantly dierent from the ER and BA models at the right-hand side of the
gure, as indicated by the high linkage distance at which these two groups (i.e. the
GN and ER/BA) are merged.
The results discussed above illustrate the classication procedure and its potential
for identifying the category of networks of unknown nature. The fact that the
assigned category sometimes varies according to the choice of measurements suggests
the presence of specic topological features in some experimental networks which are
not fully compatible with any of the assumed theoretical reference models. Indeed,
the consideration of a more comprehensive set of measurements can, in principle,
provide a more meaningful subclassication of the networks. Such a possibility is
particularly important in the case of scale-free networks, which are known to involve
233
subtypes [24]. For instance, TRNE has been identied in our experiments as having
BA type while considering two measurements (i.e. {st, }), but was understood as a
~ }) and as ER when we
GN model by considering three measurements (i.e. {hki, C,
~
considered six measurements (i.e. {hki, C, , st, r, CPD}).
It should be always kept in mind that the consideration of an excessive number
of measurements may ultimately compromise the quality of the classication.
Methodologies such as the canonical analysis followed by Bayesian classication
can be used to identify the features which contribute particularly to the correct
classications. This can be done by considering the measurements which contribute
more intensely to the canonical projections providing the largest number of correct
classications. A simpler methodology involves the application of the principal component analysis to remove the redundancies between the measurements. In the case
of a reduced number of measurements, it is also possible to consider all the respective
combinations and identify which of them yields the best classications. Another
interesting possibility for investigating complex network connectivity is to consider
outliers analysis (e.g. [208]). The reader interested in additional information on
multivariate statistics and feature selection is referred to the specialized literature
(e.g., [68, 69, 211, 212]) for more in-depth discussion and coverage. Many other
methods from multivariate statistical analysis, including hierarchical clustering and
structural equation modeling, can also be valuable for investigations in complex
network research. Though the potential of hierarchical clustering for suggesting
relationships between the classes is briey illustrated in the following, further information about such methods can be found in textbooks such as [68, 69, 211213, 220].
(a)
Figure 27. Examples of classication by canonical variable analysis and Bayesian decision theory: (a) US Airlines Transportation Network
(USATN); (b) the Transcriptional Regulation Network of the E. coli (TRNE); and (c) the Protein-Protein Interaction Network of the
Saccharomyces Cerevisiae (PPIN), considering all measurements; (d) the same protein network as in (c) but excluding the hierarchical measurements. Note the presence of the separating frontiers between the decision regions in the scatterplots. The arrows indicate the mapped experimental
networks.
(d)
(b)
(c)
234
L. da F. Costa et al.
235
Figure 28. Dendrogram obtained for the protein-protein interaction network considering all
measurements except those hierarchical. Note that the BA, ER and GN networks resulted in
well-separated branches, while the protein-protein network was included into the latter group.
236
L. da F. Costa et al.
ments can behave very dierently with respect to such induced changes. Because one
of the most challenging issues related to network categorization regards the choice of
the features to be taken into account, we provided a self-contained discussion about
how multivariate statistics concepts and methods can be applied for that aim. More
specically, we showed how high dimensional measurement spaces can be eectively
projected, by using principal component analysis, into lower-dimensional spaces
favoring visualization and application of computationally intensive measurements.
We also described how two useful methods, namely canonical analysis and Bayesian
decision theory, can be combined to provide the means for semi-automated identication of the eective linear combinations of measurements, in the sense of allowing
good discrimination between network categories. The potential of such multivariate
methodologies was illustrated for theoretical models and experimental networks. The
results clearly suggested that considering a comprehensive set of measurements can
provide more complete characterization of the topological properties of the networks
to the point of requiring a revision of the traditional classication of experimental
networks into subclasses or new models.
All in all, this survey provides for the rst time, an integrated presentation and
discussion of a comprehensive set of measurements previously covered in separate
works. In addition, it addresses important issues related to the application of these
measurements for characterization and classication of networks, including dynamic
representations in terms of trajectories, redundancy between measurements as quantied by correlations, perturbation eects and a powerful multivariate framework for
classication of networks of unknown category. The systematic application of such
concepts and tools is poised to yield a wealth of new results in the study of complex
networks.
Acknowledgements
We are grateful to Lucas Antiqueira, Carlos A.-A. Castillo-Ocaranza, Ernesto
Estrada, A. D az-Guilera, Shalev Itzkovitz, Marcus Kaiser, Xiang Lee, Jon
Machta, Adilson E. Motter, Osvaldo N. Oliveira-Jr, Andrea Scharnhorst,
Matheus Viana, and Duncan Watts for comments and suggestions. Luciano da
F. Costa is grateful to FAPESP (procs. 99/12765-2 and 05/00587-5), CNPq (proc.
308231/03-1) and the Human Frontier Science Program (RGP39/2002) for nancial
support. Francisco A. Rodrigues is grateful to FAPESP (proc. 04/00492-1) and
Paulino R. Villas Boas is grateful to CNPq (proc. 141390/2004-2).
References
[1]
[2]
[3]
[4]
[5]
[6]
[7]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[19]
[20]
[21]
[22]
[23]
[24]
[25]
[26]
[27]
[28]
[29]
[30]
[31]
[32]
[33]
[34]
[35]
[36]
[37]
[38]
[39]
[40]
[41]
[42]
[43]
[44]
[45]
237
238
L. da F. Costa et al.
[46] S. Wasserman and K. Faust, Social Network Analysis (Cambridge University Press, Cambridge, 1994).
[47] P. Hage and F. Harary, Island Networks: Communication, Kinship and Classication
Structures in Oceania (Cambridge University Press, New York, 1996).
[48] W.E. Baker, Networking Smart: How To Build Relationships for Personal and Organizational Success (Backinprint.com, 2000).
[49] W.E. Baker, Achieving Success Through Social Capital: Tapping Hidden Resources in Your
Personal and Business Networks (Jossey-Bass, London, 2000).
[50] R.R. McNeill and W.H. McNeill, The Human Web: A Birds-Eye View of World History
(W.W. Norton & Company, New York, 2003).
[51] P.R. Monge and N.S. Contractor, Theories of Communication Networks (Oxford
Univeristy Press, New York, 2003).
[52] P.J. Carrington, J. Scott and S. Wasserman (Eds), Models and Methods in Social Network
Analysis (Cambridge University Press, Cambridge, 2005).
[53] L.C. Freeman, The Development of Social Network Analysis: A Study in the Sociology of
Science (Empirical Press, New York, 2004).
[54] P. Csermely, Weak Links: Stabilizers of Complex Systems from Proteins to Social
Networks (Springer, Berlin, 2006).
[55] D. Messner, The Network Society: Economic Development and International Competitiveness as Problems of Social Governance (Frank Cass Publishers, Portland, 1997).
[56] Ross Dawson, Living Networks: Leasing your Company, Customers, and Partners in the
Hyper-Connected Economy (Prentice Hall, New Jersey, 2003).
[57] C. Westland, Financial Dynamics: A System for Valuing Technology Companies (John
Wiley & Sons, London, 2003).
[58] Y. Benkler, The Wealth of Networks: How Social Production Transforms Markets and
Freedom (Yale University Press, Yale, 2006).
[59] M. Dodge and R. Kitchin, Mapping Cyberspace (Routledge, New York, 2001).
[60] M. Dodge and R. Kitchin, Atlas of Cyberspace (Addison-Wesley, Great Britian, 2001).
[61] P. Baldi, P. Frasconi and P. Smyth, Modeling the Internet and the Web: Probabilistic
Methods and Algorithms (John Wiley & Sons, England, 2003).
[62] R. Pastor-Satorras and A. Vespignani, Evolution and Structure of the Internet:
A Statistical Physics Approach (Cambridge University Press, Cambridge, 2004).
[63] D. Stauer and A. Aharony, Introduction to Percolation Theory (Taylor and Francis,
London, 1994).
[64] A. Bunde and S. Havlin, Fractals in Science (Springer, Berlin, 1995).
[65] A. Bunde and S. Havlin, Fractals and Disordered Systems (Springer, Berlin, 1996).
[66] Y. Bar-Yam, Dynamics of Complex Systems (Perseus Books, New York, 1992).
[67] N. Boccara, Modeling Complex Systems (Springer-Verlag New York, 2004).
[68] L. da F. Costa and R.M. Cesar Jr, Shape Analysis and Classication: Theory and Practice
(CRC Press, New York, 2001).
[69] R.O. Duda, P.E. Hart and D.G. Stork, Pattern Classication (John Wiley & Sons, Inc.,
New York, 2001).
[70] E. Ziv, R. Koytche, M. Middendorf and C. Wiggins, Phys. Rev. E 71 016110 (2005).
[71] M. Barthelemy, A. Barrat, R. Pastor-Satorras and A. Vespignani, Physica A 346 34
(2005).
[72] S. Milgram, Psy. Today 1 60 (1967).
[73] R. Monasson, Eur. Phys. J. B 12 (1999).
[74] M.E.J. Newman and D.J. Watts, Phys. Rev. Lett. A 263 341 (1999).
[75] E.A. Bender and E.R. Can, J. Combinat. Theory, Ser. A 24 296 (1978).
[76] M. Molloy and B. Reed, Rand. Struct. Algor. 6 161 (1995).
[77] M. Molloy and B. Reed, Prob. Comp. 7 295 (1998).
[78] M.E.J. Newman (edited by S. Bornholdt and H.G. Schuster), Handbook of Graphs and
Networks: From the Genome to the Internet, (Wiley-VCH, New York, 2003).
[79] M.E.J. Newman, D.J. Watts and S.H. Strogatz, Proc. Nat. Acad. Sci. USA 99 2566
(2002).
[80] M.E.J. Newman, S.H. Strogatz and D.J. Watts, Phys. Rev. E 64 26118 (2001).
239
[81] R. Milo, N. Kashtan, S. Itzkovitz, M.E.J. Newman and U. Alon. Cond. Mat. 0312028
(2003).
[82] W. Aiello, F. Chung, and L. Lu, Proceedings of the thirty-second annual ACM symposium
on Theory of computing (Portland, Oregon, ACM Press, 2000), pp. 171180.
[83] F. Chung and L. Lu, Proc. Nat. Acad. Sci. USA 99 15879 (2002).
[84] R. Cohen and S. Havlin, Phys. Rev. Lett. 90 58701 (2003).
[85] M.T. Gastner and M.E.J. Newman, The European Physical Journal B, 49 247 (2006).
[86] R. Albert, I. Albert and G.L. Nakarado, Phys. Rev. E 69 025103 (2004).
[87] R. Kinney, P. Crucitti, R. Albert and V. Latora, Eur. Phys. J. B 46 101 (2005).
[88] A. Barrat, M. Barthelemy, R. Pastor-Satorras and A. Vespignani, Proc. Nat. Acad. Sci.
USA 101 3747 (2004).
[89] R. Guimera`, S. Mossa, A. Turtschi and L.A.N. Amaral, Proc. Nat. Acad. Sci. USA 102
7794 (2005).
[90] Y. Hayashi, Physics 0512011 (2005).
[91] V. Latora and M. Marchiori, Physica A 314 109 (2002).
[92] O. Sporns, Complexity 8 (2002).
[93] M. Kaiser and C.C. Hilgetag, Phys. Rev. E 69 036103 (2004).
[94] V. Latora and M. Marchiori, Phys. Rev. Lett. 87 198701 (2001).
[95] R. Guimera`, A. D az-Guilera, F. Vega-Redondo, A. Cabrales and A. Arenas, Phys. Rev.
Lett. 89 248701 (2002).
[96] V. Goldshtein, G.A. Koganov and G.I. Surdutovich, Cond. Mat. 0409298 (2004).
[97] V. Latora and M. Marchiori, Phys. Rev. E 71 015103R (2005).
[98] M.E.J. Newman, Phys. Rev. E 64 016131 (2001).
[99] J.-P. Onnela, J. Saramaki, J. Kertesz and K. Kaski, Phys. Rev. E 71 065103(R) (2005).
[100] E. Ravasz and A.-L. Barabasi, Phys. Rev. E 67 026112 (2003).
[101] S.N. Soer and A. Vazquez, Phys. Rev. E 71 057101 (2005).
[102] H.J. Kim and J.M. Kim, Phys. Rev. E 72 036109 (2005).
[103] G. Caldarelli, R. Pastor-Satorras and A. Vespignani, Eur. Phys. J. B 38 183 (2004).
[104] H.D. Rozenfeld, J.E. Kirk, E.M. Bollt and D. ben Avraham, J. Phys. A: Math. Gen. 38
4589 (2005).
[105] P.G. Lind, M.C. Gonzalez and H.J. Herrmann, Phys. Rev. E 72 056127 (2005).
[106] K. Klemm and P.F. Stadler, Cond. Mat. 0506493 (2005).
[107] G. Bianconi and A. Capocci, Phys. Rev. Lett. 90 078701 (2003).
[108] G. Bianconi, G. Caldarelli and A. Capocci, Phys. Rev. E 71 066116 (2005).
[109] G. Bianconi and M. Marsili, J. Stat. Mech.: Theory Exper. P06005 (2005).
[110] G. Bianconi and M. Marsili, Phys. Rev. E 73 066127 (2006).
[111] V. Colizza, A. Flammini, M.A. Serrano and A. Vespignani, Nature Phys. 2 110 (2006).
[112] S. Zhou and R.J. Mondragon, Commun. Lett. IEEE 8 180 (2004).
[113] S.N. Dorogovtsev and J.F.F. Mendes, Cond. Mat. 0404593 (2004).
[114] S. Maslov and K. Sneppen, Science 296 910 (2002).
[115] M. Boguna and R. Pastor-Satorras, Phys. Rev. E 66 047104 (2002).
[116] R. Pastor-Satorras, A. Vazquez and A. Vespignani, Phys. Rev. Lett. 87 258701 (2001).
[117] M.E.J. Newman, Phys. Rev. Lett. 89 208701 (2002).
[118] M. Catanzaro, G. Caldarelli and L. Pietronero, Phys. Rev. E 70 037101 (2004).
[119] J. Park and M.E.J. Newman, Phys. Rev. E 68 026112 (2003).
[120] J. Berg, M. Lassig and A. Wagner, BMC Evolut. Biol. 4 51 (2004).
[121] M. Brede and S. Sinha, Cond. Mat. 0507710 (2005).
[122] M. di Bernardo, F. Garofalo and F. Sorrentino, Cond. Mat. 0506236 (2005).
[123] M. di Bernardo, F. Garofalo and F. Sorrentino, Int. J. Bifurc. Chaos, in press (2006).
[124] N. Madar, T. Kalisky, R. Cohen, D. ben Avraham and S. Havlin, Eur. Phys. J. B 38 269
(2004).
[125] T. Zhou, Z.-Q. Fu and B.-H. Wang, Prog. Nat. Sci. 16 452 (2006).
[126] S. Gupta, R.M. Anderson and R.M. May, AIDS 03 807 (1989).
[127] M.E.J. Newman, Phys. Rev. E 67 026126 (2003).
[128] P. Holme, F. Liljeros, C.R. Edling and B.J. Kim, Phys. Rev. E 68 056107 (2003).
[129] E. Estrada and J.A. Rodr guez-Velaquez, Phys. Rev. E 72 046105 (2005).
240
[130]
[131]
[132]
[133]
[134]
[135]
[136]
[137]
[138]
[139]
[140]
[141]
[142]
[143]
[144]
[145]
[146]
[147]
[148]
[149]
[150]
[151]
[152]
[153]
[154]
[155]
[156]
[157]
[158]
[159]
[160]
[161]
[162]
[163]
[164]
[165]
[166]
[167]
[168]
[169]
[170]
[171]
[172]
L. da F. Costa et al.
F. Reif, Fundamentals of Statistical and Thermal Physics (McGraw-Hill, London, 1965).
L. Brillouin, Science and Information Theory (Dover Phoenix Editions, 2004).
L.E. Reichl, A Modern Course in Statistical Physics (Wiley-Interscience, 1998).
C.E. Shannon and W. Weaver, The Mathematical Theory of Communication (University
of Illinois, 1963).
B. Wang, H. Tang, C. Guo and Z. Xiu, Cond. Mat. 0506725 (2005).
L. Demetrius and T. Manke, Physica A 346 682 (2004).
R.V. Sole and S. Valverde, Lecture Notes in Phyics (Springer, Berlin, 2004), pp. 169190.
K. Sneppen, A. Trusina and M. Rosvall, Europhys. Lett. 69 853 (2005).
M. Rosvall and K. Sneppen, Cond. Mat. 0604036 (2006).
M. Rosvall, A. Trusina, P. Minhagen and K. Sneppen, Phys. Rev. Lett. 94 028701
(2005).
A. Trusina, M. Rosvall and K. Sneppen, Phys. Rev. Lett. 94 238701 (2005).
M. Rosvall, A. Gronlund, P. Minnhagen and K. Sneppen, Phys. Rev. E 72 046117
(2005).
G. Bianconi, Cond. Mat. 0606365 (2006).
L.C. Freeman, Sociometry 40 35 (1977).
A. Arenas, A. Cabrales, A. D az-Guilera, R. Guimera` and F. Vega-Redondo, Statistical
Mechanics of Complex Networks, volume 625 of Lecture Notes in Physics (Springer,
Berlin, 2003).
M.E.J. Newman, Soci. Networ. 27 39 (2005).
D. Koschutzki, K.A. Lehmann, L. Peeters, S. Richter, D. Tenfelde-Podehl and
O. Zlotowski, Lecture Notes in Computer Science, 3418 (2005).
I.J. Farkas, I. Derenyi, A.-L. Barabasi and T. Vicsek, Phys. Rev. E 64 026704 (2001).
K.-I. Goh, B. Kahng and D. Kim, Phys. Rev. E 64 051903 (2001).
V. Rosato and F. Tiriticco, Eur. Lett. 66 471 (2004).
M.L. Mehta, Random Matrices (Academic Press, London, 1991).
A.J. Seary and W.D. Richards, Dynamic Social Network Modeling and Analysis
(National Academy Press, 2003), pp. 209228.
A. Arenas, L. Danon, A. D az-Guilera, P.M. Gleiser and R. Guimera`, Eur. Phys. J. B 38
373 (2004).
P.M. Gleiser and L. Danon, Adv. Complex Syst. 6 (2003).
R. Guimera` and L.A.N. Amaral, Nature 433 895 (2005).
F. Radicchi, C. Castellano, F. Cecconi, V. Loreto and D. Parisi, Proc. Nat. Acad. Sci.
USA 101 2658 (2004).
J. Reichardt and S. Bornholdt, Cond. Mat. 0603718 (2006).
J. Reichardt and S. Bornholdt, Cond. Mat. 0606220 (2006).
J. Reichardt and S. Bornholdt, Phys. Rev. Lett. 93 218701 (2004).
M.E.J. Newman and M. Girvan, Phys. Rev. E 69 026113 (2004).
G. Schlosser and G.P. Wagner, Modularity in Development and Evolution (University of
Chicago Press, Chicago, 2004).
E. Ziv, M. Middendorf and C.H. Wiggins, Physical Review E 71 046117 (2005).
M.E.J. Newman, Eur. Phys. J. B 38 321 (2004).
L. Danon, J. Duch, A. Arenas and A. D az-Guilera, J. Statist. Mech.: Theory Exper.
P09008 (2005).
A.J. Seary and W.D. Richards, Proceedings of the International Conference on Social
Networks, volume 1 (1995).
M.E.J. Newman, Physics 0605087 (2006).
M Fiedler, Czechosl. Math. J. 23 298 (1973).
A. Pothen, H. Simon and K.P. Liou, SIAM J. Matrix Anal. Appl. 11 430 (1990).
A. Capocci, V.D.P. Servedio, G. Caldarelli and F. Colaiori, Phys. A 352 669 (2005).
M.E.J. Newman, Proc. Nat. Acad. Sci. USA 103 8577 (2006).
J.R. Tyler, D.M. Wilkinson and B.A. Huberman, Proceedings of the First International
Conference on Communities and Technologies (2003).
L. da F. Costa, Phys. Rev. E 70 056106 (2004).
M.R. Anderberg, Cluster analysis for applications (Academic Press, London, 1973).
241
[173] A.K. Jain and R.C. Dubes, Algorithms for clustering data (Prentice Hall, New York,
1988).
[174] H.C. Romesburg, Cluster analysis for researchers (Robert E. Krieger, London, 1990).
[175] J. Hopcroft, O. Khan, B. Kulis and B. Selman. Proc. Nat. Acad. Sci. USA 101 5249
(2004).
[176] A. Clauset, M.E.J. Newman and C. Moore, Phys. Rev. E 70 066111 (2004).
[177] L. Danon, A. D az-Guilera and A. Arenas, Physics. 0601144 (2006).
[178] J. Duch and A. Arenas, Phys. Rev. E 72 027104 (2005).
[179] M.E.J. Newman, Phys. Rev. E 69 026113 (2004).
[180] J.P. Bagrow and E.M. Bollt, Phys. Rev. E 72 046108 (2005).
[181] A. Clauset, Phys. Rev. E 72 026132 (2005).
[182] S.N. Dorogovtsev, A.V. Goltsev and J.F.F. Mendes, Phys. Rev. Lett. 96 40601 (2006).
[183] J.I. Alvarez-Hamelin, L. DallAsta, A. Barrat and A. Vespignani, cs.NI/0504107 (2005).
[184] S.N. Dorogovtsev, J.F.F. Mendes, A.M. Povolotsky and A.N. Samukhin, Phys. Rev.
Lett. 95 195701 (2005).
[185] A.V. Goltsev, S.N. Dorogovtsev and J.F.F. Mendes, Phys. Rev. E 73 056101 (2006).
[186] S. Carmi, S. Havlin, S. Kirkpatrick, Y. Shavitt and E. Shir, cs.NI/0607080 (2006).
[187] S. Wuchty and E. Almaas, Proteomics 5 444 (2005).
[188] M. Altaf-Ul-Amin, K. Nishikata, T. Koma, T. Miyasato, Y. Shinbo, M. Arifuzzaman,
C. Wada, M. Maeda, T. Oshima, H. Mori and S. Kanaya, Genome Inform. 14 498
(2003).
[189] S.S. Shen-Orr, R. Milo, S. Mangan and U. Alon, Nature Genet. 31 64 (2002).
[190] R. Milo, S. Shen-Orr, S. Itzkovitz, N. Kashtan, D. Chklovskii and U. Alon, Science 298
824 (2002).
[191] M. Middendorf, E. Ziv and C.H. Wiggins, Proc. Nat. Acad. Sci. 102 3192 (2005).
[192] R. Milo, S. Itzkovitz, N. Kashtan, R. Levitt, S. Shen-Orr, I. Ayzenshtat, M. Sheer and
U. Alon, Science 303 1538 (2004).
[193] E. Estrada and J.A. Rodr guez-Velazquez, Phys. Rev. E 71 056103 (2005).
[194] L. Vincent, Sig. Process. 16 365 (1989).
[195] E.R. Dougherty and R.A. Lotufo, Hands-on Morphological Image Processing (SPIE
Press, New York, 2003).
[196] H. Heijmans, P. Nacken, A. Toet and L. Vincent, J. Visual Commun. Image Repres.
3 24 (1990).
[197] M.P. Viana and L. da F. Costa, Cond. Mat. 0504346 (2005).
[198] L. da F. Costa, Phys. Rev. Lett. 93 098702 (2004).
[199] L. da F. Costa and L.E.C. da Rocha, Eur. Phys. J. B 50 (2006).
[200] L. da F. Costa and F.N. Silva, J. Stat. Phys. in press (2004).
[201] C. Song, S. Havlin and H.A. Makse, Nature 433 392 (2005).
[202] B. Machta and J. Machta, Phys. Rev. E 71 026704 (2005).
[203] B. Codenotti and M. Leoncini, Introduction to Parallel Processing (Addison-Wesley,
London, 1993).
[204] H. Meyer-Ortmanns, Cond. Mat. 0311109 (2003).
[205] J.C. Claussen, q-bio, MN/0410024 (2004).
[206] D. Garlaschelli and M.I. Loredo, Phys. Rev. Lett. 93 268701 (2004).
[207] M. Kaiser and C.C. Hilgetag, Biol. Cybern. 90 311 (2004).
[208] L. da F. Costa, M. Kaiser and C. Hilgetag, Physics 0607272 (2006).
[209] P.L. Krapivsky and S. Redner, J. Phys. A: Math. Gen. 35 (2002).
[210] A.L. Edwards, An Introduction to Linear Regression and Correlation (W.H. Freeman and
Co, San Francisco, 1993).
[211] G.J. McLachlan, Discriminant Analysis and Statistical Pattern Recognition (Wiley,
London, 2004).
[212] K. Fukunaga, Introduction to Statistical Pattern Recognition (Academic Press,
New York, 1990).
[213] R.A. Johnson and D.W. Wichern, Applied Multivariate Statistical Analysis (Prentice
Hall, New York, 2002).
242
L. da F. Costa et al.
[214] P.R.R. Prado, F.F. Franco, M.H. Manfrin and L. da F. Costa, Proceedings of the
Third Brazilian Symposium of Mathematical and Computacional Biology I (E-papers
publishing, Rio de Janeiro, 2004) pp. 329340.
[215] V. Batagelj and A. Mrvar, Pajek datasets. University of Ljubljana, Slovenia, http://
vlado.fmf.uni-lj.si/pub/networks/data (2006).
[216] R. Guimera` and L.A.N. Amaral, Eur. Phys. J. B 38 381 (2004).
[217] H. Jeong, S.P. Mason, A.-L. Barabasi and Z.N. Oltvai, Nature 411 41 (2001).
[218] S.H. Yook, H. Jeong and A.-L. Barabasi, Proc. Nat. Acad. Sci. USA 99 13382 (2002).
[219] D. Stoyan, W.S. Kendall and J. Mecke, Stochastic Geometry and Its Applications
(John Wiley and Sons, London, 1996).
[220] J.F. Hair, R.E. Anderson, R.L. Tatham and W.C. Black, Multivariate Data Analysis
(Prentice-Hall Int. Inc., New Jersey, 1998).