In the era of big data, the capability to identify very quickly prominent summary information about a target entity of interest, like a person or an event, from large datasets is essential, and exploratory analysis techniques help in this direction. In this paper, we provide a solution based on smart entity views and on pre-defined analysis operators which exploit keywords available in the entity view together with similarity information to produce summary information about the view contents from both a thematic and analytics perspective. In particular, smart entity views can be analyzed according to the following exploratory paradigms: entity expansion, entity visualization, and entity analytics. The proposed approach is discussed by referring to a case study of twitter dataset related to the 'Expo2015' event as target entity.
Exploratory analysis of large web datasets / S. Castano, A. Ferrara, S. Montanelli - In: Research and Technologies for Society and Industry Leveraging a better tomorrow (RTSI), 2015 IEEE 1st International Forum on[s.l] : IEEE, 2015. - ISBN 9781467381666. - pp. 243-248 (( Intervento presentato al 1. convegno International Forum on Research and Technologies for Society and Industry tenutosi a Torino nel 2015 [10.1109/RTSI.2015.7325105].
Exploratory analysis of large web datasets
S. CastanoPrimo
;A. FerraraSecondo
;S. MontanelliUltimo
2015
Abstract
In the era of big data, the capability to identify very quickly prominent summary information about a target entity of interest, like a person or an event, from large datasets is essential, and exploratory analysis techniques help in this direction. In this paper, we provide a solution based on smart entity views and on pre-defined analysis operators which exploit keywords available in the entity view together with similarity information to produce summary information about the view contents from both a thematic and analytics perspective. In particular, smart entity views can be analyzed according to the following exploratory paradigms: entity expansion, entity visualization, and entity analytics. The proposed approach is discussed by referring to a case study of twitter dataset related to the 'Expo2015' event as target entity.File | Dimensione | Formato | |
---|---|---|---|
07325105.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
794.59 kB
Formato
Adobe PDF
|
794.59 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.