Minimal perfect hash functions have been shown to be useful to compress data in several data management tasks. In particular, order-preserving minimal perfect hash functions have been used to retrieve the position of a key in a given list of keys: however, the ability to preserve any given order leads to an unavoidable Ω(n log n) lower bound on the number of bits required to store the function. Recently, it was observed that very frequently the keys to be hashed are sorted in their intrinsic (i.e., lexicographical) order. This is typically the case of dictionaries of search engines, list of URLs of web graphs, etc. We refer to this restricted version of the problem as monotone minimal perfect hashing. We analyse experimentally the data structures proposed in our paper "Monotone Minimal Perfect Hashing: Searching a Sorted Table with O(1) Accesses", and along our way we propose some new methods that, albeit asymptotically equivalent or worse, perform very well in practise, and provide a balance between access speed, ease of construction, and space usage.

Theory and practise of monotone minimal perfect hashing / D. Belazzougui, R. Pagh, P. Boldi, S. Vigna - In: ACM-SIAM Symposium on discrete algorithms : January 4-6, 2009, New York Marriott Downtime / [a cura di] M. Claire. - Philadelphia : SIAM, 2009. - pp. 132-144 (( Intervento presentato al 10. convegno Workshop on Algorithm Engineering and Experiments (ALENEX) tenutosi a New York nel 2009.

Theory and practise of monotone minimal perfect hashing

P. Boldi
Penultimo
;
S. Vigna
Ultimo
2009

Abstract

Minimal perfect hash functions have been shown to be useful to compress data in several data management tasks. In particular, order-preserving minimal perfect hash functions have been used to retrieve the position of a key in a given list of keys: however, the ability to preserve any given order leads to an unavoidable Ω(n log n) lower bound on the number of bits required to store the function. Recently, it was observed that very frequently the keys to be hashed are sorted in their intrinsic (i.e., lexicographical) order. This is typically the case of dictionaries of search engines, list of URLs of web graphs, etc. We refer to this restricted version of the problem as monotone minimal perfect hashing. We analyse experimentally the data structures proposed in our paper "Monotone Minimal Perfect Hashing: Searching a Sorted Table with O(1) Accesses", and along our way we propose some new methods that, albeit asymptotically equivalent or worse, perform very well in practise, and provide a balance between access speed, ease of construction, and space usage.
Algoritmi ; strutture dati succinte
Settore INF/01 - Informatica
2009
SIAM
ACM
http://www.siam.org/proceedings/alenex/2009/alx09_013_belazzouguid.pdf
Book Part (author)
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/49218
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 12
  • ???jsp.display-item.citation.isi??? ND
social impact