Approximate Subtree Identification in Heterogeneous XML Documents Collections

Sanz, I.; Mesiti, M.; Guerrini, G.; Berlanga Llavori1, R.

doi:10.1007/11547273_1

Due to the heterogeneous nature of XML data for internet applications exact matching of queries is often inadequate. The need arises to quickly identify subtrees of XML documents in a collection that are similar to a given pattern. In this paper we discuss different similarity measures between a pattern and subtrees of documents in the collection. An efficient algorithm for the identification of document subtrees, approximately conforming to the pattern, by indexing structures is then introduced.

Approximate Subtree Identification in Heterogeneous XML Documents Collections / I. Sanz, M. Mesiti, G. Guerrini, R. Berlanga Llavori1 - In: Database and XML Technologies / [a cura di] S. Bressan, S. Ceri, E. Hunt, Z.G. Ives, Z. Bellahsène, M. Rys, R. Unland. - Berlin : Springer, 2005 Aug 28. - ISBN 9783540285830. - pp. 192-206 (( Intervento presentato al 3. convegno XML Database Symposium tenutosi a Trondheim nel 2005.

Approximate Subtree Identification in Heterogeneous XML Documents Collections

I. Sanz;M. Mesiti;G. Guerrini;R. Berlanga Llavori1

2005

Abstract

Due to the heterogeneous nature of XML data for internet applications exact matching of queries is often inadequate. The need arises to quickly identify subtrees of XML documents in a collection that are similar to a given pattern. In this paper we discuss different similarity measures between a pattern and subtrees of documents in the collection. An efficient algorithm for the identification of document subtrees, approximately conforming to the pattern, by indexing structures is then introduced.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				XML; Approximate retrieval; subtree matching
			
	Settori scientifico-disciplinari del contributo (sola visualizzazione)
	
				Settore INF/01 - Informatica
			
	Data di pubblicazione
	
				28-ago-2005
			
	DOI
	
				https://dx.doi.org/10.1007/11547273_1
			
	Tipologia
	
				Book Part (author)
			
	Appare nelle tipologie:
	
				03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
Sanz2005_Chapter_ApproximateSubtreeIdentificati.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 368.32 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	368.32 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/6030

Citazioni

ND

17

14

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca