Normalization and convergence

Here again is our example input to the NetRank algorithm:

3 nodes

The table below shows iterations 0-7 of the NetRank calculation for this graph:

Node x
NR0(x)
NR1(x)
NR2(x)
NR3(x)
NR4(x)
NR5(x)
NR6(x)NR7(x)
1
1
2 2 3 4 5
7 9
2
1
1 2 2 3 4 5 7
3
1
1 1 2 2 3 4 5

The scores are different and bigger with each iteration. Evidently these numbers will change and grow for as long as the NetRank calculation continues to iterate. These observations lead us to two questions:

  • In what sense are the influence scores improving with each iteration?
  • How many iterations must be done before the algorithm can stop?

We answer these questions by introducing normalization and convergence.

Normalization is a calculation that makes it easier to compare one column of numbers to another column. Normalizing column NRi of the above table involves two steps: (1) add the numbers in column NRi, and (2) divide each of the numbers in column NRi by that sum.

Totals
of each
column
above
34
5
7 912
1621
  Dividing by above totals yields normalized data below
Node x
normalized
NR0(x)
normalized
NR1(x)
normalized
NR2(x)
normalized
NR3(x)
normalized
NR4(x)
normalized
NR5(x)
normalized
NR6(x)
normalized
NR7(x)
1
1/3
2/4 2/5 3/7 4/9 5/12
7/16 9/21
2
1/3
1/4 2/5 2/7 3/9 4/12 5/16 7/21
3
1/3
1/4 1/5 2/7 2/9 3/12 4/16 5/21

The table above shows fractions to highlight the arithmetic of normalization. We write the exact same table a second time using decimals, in order to make it easier to compare one column of numbers to another (which is the point of normalization).

Totals
of each
column
34
5
7 912
1621
  The normalized fractions above are rewritten below as decimals
Node x
normalized
NR0(x)
normalized
NR1(x)
normalized
NR2(x)
normalized
NR3(x)
normalized
NR4(x)
normalized
NR5(x)
normalized
NR6(x)
normalized
NR7(x)
1
0.33
0.50 0.40 0.43 0.44 0.42
0.44 0.43
2
0.33
0.25 0.40 0.29 0.33 0.33 0.31 0.33
3
0.33
0.25 0.20 0.29 0.29 0.25 0.25 0.24

Convergence is a property illustrated by the above table of decimals. With each iteration, the changes from one column to the next become smaller and smaller. If we were to expand the above table to show column NR16, we would see that, after normalization, NR16(1) = 0.43, NR16(2) = 0.32, and NR16(3) = 0.25. Furthermore, the normalized NRi values would never change (to the nearest 0.01) after iteration 16.

This point -- when further iterations of a calculation do not change the answer according to our desired level of precision (e.g., 0.01) -- is the point of convergence.

By normalizing the results of the NetRank calculation, we find that after enough iterations the values will converge. How many iterations is enough depends on (1) our desired level of precision, and (2) the input graph. More precision and larger graphs will require more iterations for convergence to occur.

Conclusion: Having started with the following input graph:

3 nodes

We have used NetRank to calculate the relative influence of each node of the graph.

  • The normalized NetRank score of node 1 is 0.43
  • The normalized NetRank score of node 2 is 0.32
  • The normalized NetRank score of node 3 is 0.25
 
Joomla Templates by Joomlashack