A prefix search returns the strings out of a given collection S that start with a given prefix. Traditionally, prefix search is solved by data structures that are also dictionaries, that is, they actually contain the strings in S. For very large collections stored in slow-access memory, we propose extremely compact data structures that solve weak prefix searches—they return the correct result only if some string in S starts with the given prefix. Our data structures for weak prefix search use O(|S |log ℓ) bits in the worst case, where ℓ is the average string length, as opposed to O(|S|ℓ) bits for a dictionary. We show a lower bound implying that this space usage is optimal.
Fast prefix search in little space, with applications / D. Belazzougui, P. Boldi, R. Pagh, S. Vigna - In: Algorithms – ESA 2010 : 18th Annual European Symposium, Liverpool, UK, September 6-8, 2010 : roceedings, part I / [a cura di] M. de Berg, U. Meyer. - Berlin : Springer, 2010. - ISBN 9783642157752. - pp. 427-438 (( Intervento presentato al 18. convegno Annual European Symposium on Algorithms tenutosi a Liverpool, UK nel 2010 [10.1007/978-3-642-15775-2_37].
Fast prefix search in little space, with applications
P. BoldiSecondo
;S. VignaUltimo
2010
Abstract
A prefix search returns the strings out of a given collection S that start with a given prefix. Traditionally, prefix search is solved by data structures that are also dictionaries, that is, they actually contain the strings in S. For very large collections stored in slow-access memory, we propose extremely compact data structures that solve weak prefix searches—they return the correct result only if some string in S starts with the given prefix. Our data structures for weak prefix search use O(|S |log ℓ) bits in the worst case, where ℓ is the average string length, as opposed to O(|S|ℓ) bits for a dictionary. We show a lower bound implying that this space usage is optimal.Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.