A prefix search returns the strings out of a given collection S that start with a given prefix. Traditionally, prefix search is solved by data structures that are also dictionaries, that is, they actually contain the strings in S. For very large collections stored in slow-access memory, we propose extremely compact data structures that solve weak prefix searches—they return the correct result only if some string in S starts with the given prefix. Our data structures for weak prefix search use O(|S |log ℓ) bits in the worst case, where ℓ is the average string length, as opposed to O(|S|ℓ) bits for a dictionary. We show a lower bound implying that this space usage is optimal.

Fast prefix search in little space, with applications / D. Belazzougui, P. Boldi, R. Pagh, S. Vigna - In: Algorithms – ESA 2010 : 18th Annual European Symposium, Liverpool, UK, September 6-8, 2010 : roceedings, part I / [a cura di] M. de Berg, U. Meyer. - Berlin : Springer, 2010. - ISBN 9783642157752. - pp. 427-438 (( Intervento presentato al 18. convegno Annual European Symposium on Algorithms tenutosi a Liverpool, UK nel 2010 [10.1007/978-3-642-15775-2_37].

Fast prefix search in little space, with applications

P. Boldi
Secondo
;
S. Vigna
Ultimo
2010

Abstract

A prefix search returns the strings out of a given collection S that start with a given prefix. Traditionally, prefix search is solved by data structures that are also dictionaries, that is, they actually contain the strings in S. For very large collections stored in slow-access memory, we propose extremely compact data structures that solve weak prefix searches—they return the correct result only if some string in S starts with the given prefix. Our data structures for weak prefix search use O(|S |log ℓ) bits in the worst case, where ℓ is the average string length, as opposed to O(|S|ℓ) bits for a dictionary. We show a lower bound implying that this space usage is optimal.
Settore INF/01 - Informatica
2010
Book Part (author)
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/154385
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 36
  • ???jsp.display-item.citation.isi??? 26
social impact