The acquisition and integration of data contained in spreadsheet tables is a complex task because they do not impose any regular structure on the organization of the data, or constraints on valid values. Moreover, mistakes can occur due to the passage from a format to another one or misspelt words in the original sources. The automatic extraction of their content, interpretation and integration is thus a complex task. In this paper, we outline the characteristics of a semi-automatic, interactive tool conceived for creating a knowledge base by extracting semantic information from heterogeneous spreadsheets.
A Web Tool for the Semantic Integration of Heterogeneous and Complex Spreadsheet Tables / S. Bonfitto, L. Cappelletti, E. Casiraghi, P. Perlasca, F. Trovato, G. Valentini, M. Mesiti (CEUR WORKSHOP PROCEEDINGS). - In: SEBD 2021 : Italian Symposium on Advanced Database Systems / [a cura di] S. Greco, M. Lenzerini, E. Masciari, A. Tagarelli *. - [s.l] : CEUR, 2021. - pp. 116-127 (( Intervento presentato al 29. convegno Italian Symposium on Advanced Database Systems tenutosi a Pizzo Calabro nel 2021.
A Web Tool for the Semantic Integration of Heterogeneous and Complex Spreadsheet Tables
S. Bonfitto
Membro del Collaboration Group
;L. CappellettiMembro del Collaboration Group
;E. CasiraghiMembro del Collaboration Group
;P. PerlascaMembro del Collaboration Group
;G. ValentiniMembro del Collaboration Group
;M. MesitiSupervision
2021
Abstract
The acquisition and integration of data contained in spreadsheet tables is a complex task because they do not impose any regular structure on the organization of the data, or constraints on valid values. Moreover, mistakes can occur due to the passage from a format to another one or misspelt words in the original sources. The automatic extraction of their content, interpretation and integration is thus a complex task. In this paper, we outline the characteristics of a semi-automatic, interactive tool conceived for creating a knowledge base by extracting semantic information from heterogeneous spreadsheets.File | Dimensione | Formato | |
---|---|---|---|
SEBD_pubblicato.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Dimensione
1.51 MB
Formato
Adobe PDF
|
1.51 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.