We discuss a number of issues in the definition, computation and comparison of PageRank values that have been addressed sparsely in the literature, often with contradictory approaches. We study the difference between weakly and strongly preferential PageRank, which patch the dangling nodes with different distributions, extending analytical formulae known for the strongly preferential case, and corroborating our results with experiments on a snapshot of 100 millions of pages of the .uk domain. The experiments show that the two PageRank versions are poorly correlated, and results about each one cannot be blindly applied to the other; moreover, our computations highlight some new concerns about the usage of exchange-based correlation indices (such as Kendall's τ) on approximated rankings.
Traps and pitfalls of topic-biased PageRank / P. Boldi, R. Posenato, M. Santini, S. Vigna - In: Algorithms and Models for the Web-Graph : Fourth International Workshop, WAW 2006, Banff, Canada, November 30-December 1, 2006 : Revised Papers / [a cura di] W. Aiello, A. Broder, J. Janssen, E. Milios. - Berlin : Springer, 2008. - ISBN 9783540788072. - pp. 107-116 (( Intervento presentato al 4. convegno Workshop on Algorithms and Models for the Web-Graph tenutosi a Banff, Canada nel 2006 [10.1007/978-3-540-78808-9_10].
Traps and pitfalls of topic-biased PageRank.
P. BoldiPrimo
;M. SantiniPenultimo
;S. VignaUltimo
2008
Abstract
We discuss a number of issues in the definition, computation and comparison of PageRank values that have been addressed sparsely in the literature, often with contradictory approaches. We study the difference between weakly and strongly preferential PageRank, which patch the dangling nodes with different distributions, extending analytical formulae known for the strongly preferential case, and corroborating our results with experiments on a snapshot of 100 millions of pages of the .uk domain. The experiments show that the two PageRank versions are poorly correlated, and results about each one cannot be blindly applied to the other; moreover, our computations highlight some new concerns about the usage of exchange-based correlation indices (such as Kendall's τ) on approximated rankings.Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.