This chapter discusses existing approaches to evaluate and measure structural similarity in sources of XML documents. A relevant peculiarity of XML documents, indeed, is that information on the document structure is available in the document itself. In the chapter we present different approaches aiming at evaluating structural similarity at three different levels: among documents, between a document and a schema, and among schemas. The most relevant applications of such measures are for document classification and schema extraction, and for document and schema structural clustering, though other interesting applications such as document change detection and structural querying can be devised, and will be discussed throughout the chapter.
|Titolo:||Evaluation and Applications of Structural Similarity Measures in Sources of XML Documents|
|Parole Chiave:||XML, Similarity measures, clustering, classification|
|Settore Scientifico Disciplinare:||Settore INF/01 - Informatica|
|Data di pubblicazione:||2006|
|Digital Object Identifier (DOI):||10.4018/978-1-59140-655-6|
|Tipologia:||Book Part (author)|
|Appare nelle tipologie:||03 - Contributo in volume|