The acquisition and integration of data contained in spreadsheet tables is a complex task because they do not impose any regular structure on the organization of the data, or constraints on valid values. Moreover, mistakes can occur due to the passage from a format to another one or misspelt words in the original sources. The automatic extraction of their content, interpretation and integration is thus a complex task. In this paper, we outline the characteristics of a semi-automatic, interactive tool conceived for creating a knowledge base by extracting semantic information from heterogeneous spreadsheets.

A Web Tool for the Semantic Integration of Heterogeneous and Complex Spreadsheet Tables / S. Bonfitto, L. Cappelletti, E. Casiraghi, P. Perlasca, F. Trovato, G. Valentini, M. Mesiti (CEUR WORKSHOP PROCEEDINGS). - In: SEBD 2021 : Italian Symposium on Advanced Database Systems / [a cura di] S. Greco, M. Lenzerini, E. Masciari, A. Tagarelli *. - [s.l] : CEUR, 2021. - pp. 116-127 (( Intervento presentato al 29. convegno Italian Symposium on Advanced Database Systems tenutosi a Pizzo Calabro nel 2021.

A Web Tool for the Semantic Integration of Heterogeneous and Complex Spreadsheet Tables

S. Bonfitto
Membro del Collaboration Group
;
L. Cappelletti
Membro del Collaboration Group
;
E. Casiraghi
Membro del Collaboration Group
;
P. Perlasca
Membro del Collaboration Group
;
G. Valentini
Membro del Collaboration Group
;
M. Mesiti
Supervision
2021

Abstract

The acquisition and integration of data contained in spreadsheet tables is a complex task because they do not impose any regular structure on the organization of the data, or constraints on valid values. Moreover, mistakes can occur due to the passage from a format to another one or misspelt words in the original sources. The automatic extraction of their content, interpretation and integration is thus a complex task. In this paper, we outline the characteristics of a semi-automatic, interactive tool conceived for creating a knowledge base by extracting semantic information from heterogeneous spreadsheets.
Heterogeneous Spreadsheet Tables; Semantic Table Interpretation; User Interfaces; Machine Learning
Settore INF/01 - Informatica
2021
http://ceur-ws.org/Vol-2994/paper11.pdf
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
SEBD_pubblicato.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 1.51 MB
Formato Adobe PDF
1.51 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/880638
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact