We propose a software architecture for semantics-based annotation of data extracted from Web sources. Starting from the LiXto suite, which enables semi-automated extraction of XML data from regular documents, we present a solution for attaching background information to individual tags by means of so-called decorations. Decoration is carried out as an inferential activity in the formal context of Answer Set Programming. We discuss a motivating example that will serve as a validation to our approach.
Declarative web data extraction and annotation / C. Bernardoni, G. Fiumara, M. Marchi, A. Provetti - In: Proceedings of the 20th Workshop on Logic Programming, WLP 2006 / [a cura di] M. Fink, H. Tompits, S. Woltran. - [s.l] : Institut fur Informationssysteme Arbeitsbereich, 2006. - pp. 137-144 (( Intervento presentato al 20. convegno Workshop on Logic Programming, WLP 2006 tenutosi a Wien nel 2006.
Declarative web data extraction and annotation
M. MarchiPenultimo
;A. ProvettiUltimo
2006
Abstract
We propose a software architecture for semantics-based annotation of data extracted from Web sources. Starting from the LiXto suite, which enables semi-automated extraction of XML data from regular documents, we present a solution for attaching background information to individual tags by means of so-called decorations. Decoration is carried out as an inferential activity in the formal context of Answer Set Programming. We discuss a motivating example that will serve as a validation to our approach.File | Dimensione | Formato | |
---|---|---|---|
25-final.pdf
accesso aperto
Descrizione: Versione finale
Tipologia:
Publisher's version/PDF
Dimensione
131.09 kB
Formato
Adobe PDF
|
131.09 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.