Although web crawlers have been around for twenty years by now, there is virtually no freely available, open-source crawling software that guarantees high throughput, over- comes the limits of single-machine tools and at the same time scales linearly with the amount of resources available. This paper aims at filling this gap.
BUbiNG: Massive crawling for the masses / P. Boldi, A. Marino, M. Santini, S. Vigna - In: WWW 2014 Companion : Proceedings of the 23rd International Conference on World Wide Web[s.l] : ACM, 2014. - ISBN 9781450327459. - pp. 227-228 (( Intervento presentato al 23. convegno WWW tenutosi a Seoul nel 2014 [10.1145/2567948.2577304].
BUbiNG: Massive crawling for the masses
P. BoldiPrimo
;A. MarinoSecondo
;M. SantiniPenultimo
;S. VignaUltimo
2014
Abstract
Although web crawlers have been around for twenty years by now, there is virtually no freely available, open-source crawling software that guarantees high throughput, over- comes the limits of single-machine tools and at the same time scales linearly with the amount of resources available. This paper aims at filling this gap.File | Dimensione | Formato | |
---|---|---|---|
p227-boldi.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
314.71 kB
Formato
Adobe PDF
|
314.71 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.