Although web crawlers have been around for twenty years by now, there is virtually no freely available, open-source crawling software that guarantees high throughput, over- comes the limits of single-machine tools and at the same time scales linearly with the amount of resources available. This paper aims at filling this gap.

BUbiNG: Massive crawling for the masses / P. Boldi, A. Marino, M. Santini, S. Vigna - In: WWW 2014 Companion : Proceedings of the 23rd International Conference on World Wide Web[s.l] : ACM, 2014. - ISBN 9781450327459. - pp. 227-228 (( Intervento presentato al 23. convegno WWW tenutosi a Seoul nel 2014 [10.1145/2567948.2577304].

BUbiNG: Massive crawling for the masses

P. Boldi
Primo
;
A. Marino
Secondo
;
M. Santini
Penultimo
;
S. Vigna
Ultimo
2014

Abstract

Although web crawlers have been around for twenty years by now, there is virtually no freely available, open-source crawling software that guarantees high throughput, over- comes the limits of single-machine tools and at the same time scales linearly with the amount of resources available. This paper aims at filling this gap.
Computer Networks and Communications; Software
Settore INF/01 - Informatica
2014
Google
International World Wide Web Conference Steering Committee (IW3C2)
Microsoft
NAVER
SK Planet
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
p227-boldi.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 314.71 kB
Formato Adobe PDF
314.71 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/495566
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 64
  • ???jsp.display-item.citation.isi??? 47
social impact