In this paper we propose a matching algorithm for measuring the structural similarity between an XML document and a DTD. The matching algorithm, by comparing the document structure against the one the DTD requires, is able to identify commonalities and differences. Differences can be due to the presence of extra elements with respect to those the DTD requires and to the absence of required elements. The evaluation of commonalities and differences gives raise to a numerical rank of the structural similarity. Moreover, in the paper, some applications of the matching algorithm are discussed. Specifically, the matching algorithm is exploited for the classification of XML documents against a set of DTDs, the evolution of the DTD structure, the evaluation of structural queries, the selective dissemination of XML documents, and the protection of XML document contents.

A matching algorithm for measuring the structural similarity between an XML document and a DTD and its applications / E. Bertino, G. Guerrini, M. Mesiti. - In: INFORMATION SYSTEMS. - ISSN 0306-4379. - 29:1(2004 Mar), pp. 23-46.

A matching algorithm for measuring the structural similarity between an XML document and a DTD and its applications

M. Mesiti
Ultimo
2004

Abstract

In this paper we propose a matching algorithm for measuring the structural similarity between an XML document and a DTD. The matching algorithm, by comparing the document structure against the one the DTD requires, is able to identify commonalities and differences. Differences can be due to the presence of extra elements with respect to those the DTD requires and to the absence of required elements. The evaluation of commonalities and differences gives raise to a numerical rank of the structural similarity. Moreover, in the paper, some applications of the matching algorithm are discussed. Specifically, the matching algorithm is exploited for the classification of XML documents against a set of DTDs, the evolution of the DTD structure, the evaluation of structural queries, the selective dissemination of XML documents, and the protection of XML document contents.
Structural similarity ; XML document handling and querying ; document classification ; structure evolution ; structural queries ; selective dissemination of documents ; document protection
Settore INF/01 - Informatica
mar-2004
Article (author)
File in questo prodotto:
File Dimensione Formato  
AmatchingAlgorithm.pdf

accesso aperto

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione 460.31 kB
Formato Adobe PDF
460.31 kB Adobe PDF Visualizza/Apri
BGM03.pdf

accesso aperto

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione 482.56 kB
Formato Adobe PDF
482.56 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/173359
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 99
  • ???jsp.display-item.citation.isi??? 65
social impact