We discuss a number of issues in the definition, computation and comparison of PageRank values that have been addressed sparsely in the literature, often with contradictory approaches. We study the difference between weakly and strongly preferential PageRank, which patch the dangling nodes with different distributions, extending analytical formulae known for the strongly preferential case, and corroborating our results with experiments on a snapshot of 100 millions of pages of the .uk domain. The experiments show that the two PageRank versions are poorly correlated, and results about each one cannot be blindly applied to the other; moreover, our computations highlight some new concerns about the usage of exchange-based correlation indices (such as Kendall's τ) on approximated rankings.

Traps and pitfalls of topic-biased PageRank / P. Boldi, R. Posenato, M. Santini, S. Vigna - In: Algorithms and Models for the Web-Graph : Fourth International Workshop, WAW 2006, Banff, Canada, November 30-December 1, 2006 : Revised Papers / [a cura di] W. Aiello, A. Broder, J. Janssen, E. Milios. - Berlin : Springer, 2008. - ISBN 9783540788072. - pp. 107-116 (( Intervento presentato al 4. convegno Workshop on Algorithms and Models for the Web-Graph tenutosi a Banff, Canada nel 2006 [10.1007/978-3-540-78808-9_10].

Traps and pitfalls of topic-biased PageRank.

P. Boldi
Primo
;
M. Santini
Penultimo
;
S. Vigna
Ultimo
2008

Abstract

We discuss a number of issues in the definition, computation and comparison of PageRank values that have been addressed sparsely in the literature, often with contradictory approaches. We study the difference between weakly and strongly preferential PageRank, which patch the dangling nodes with different distributions, extending analytical formulae known for the strongly preferential case, and corroborating our results with experiments on a snapshot of 100 millions of pages of the .uk domain. The experiments show that the two PageRank versions are poorly correlated, and results about each one cannot be blindly applied to the other; moreover, our computations highlight some new concerns about the usage of exchange-based correlation indices (such as Kendall's τ) on approximated rankings.
PageRank ; Kendall's τ ;
Settore INF/01 - Informatica
2008
Book Part (author)
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/47667
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 14
  • ???jsp.display-item.citation.isi??? 2
social impact