In the era of big data, the capability to identify very quickly prominent summary information about a target entity of interest, like a person or an event, from large datasets is essential, and exploratory analysis techniques help in this direction. In this paper, we provide a solution based on smart entity views and on pre-defined analysis operators which exploit keywords available in the entity view together with similarity information to produce summary information about the view contents from both a thematic and analytics perspective. In particular, smart entity views can be analyzed according to the following exploratory paradigms: entity expansion, entity visualization, and entity analytics. The proposed approach is discussed by referring to a case study of twitter dataset related to the 'Expo2015' event as target entity.

Exploratory analysis of large web datasets / S. Castano, A. Ferrara, S. Montanelli - In: Research and Technologies for Society and Industry Leveraging a better tomorrow (RTSI), 2015 IEEE 1st International Forum on[s.l] : IEEE, 2015. - ISBN 9781467381666. - pp. 243-248 (( Intervento presentato al 1. convegno International Forum on Research and Technologies for Society and Industry tenutosi a Torino nel 2015 [10.1109/RTSI.2015.7325105].

Exploratory analysis of large web datasets

S. Castano
Primo
;
A. Ferrara
Secondo
;
S. Montanelli
Ultimo
2015

Abstract

In the era of big data, the capability to identify very quickly prominent summary information about a target entity of interest, like a person or an event, from large datasets is essential, and exploratory analysis techniques help in this direction. In this paper, we provide a solution based on smart entity views and on pre-defined analysis operators which exploit keywords available in the entity view together with similarity information to produce summary information about the view contents from both a thematic and analytics perspective. In particular, smart entity views can be analyzed according to the following exploratory paradigms: entity expansion, entity visualization, and entity analytics. The proposed approach is discussed by referring to a case study of twitter dataset related to the 'Expo2015' event as target entity.
Settore INF/01 - Informatica
2015
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
07325105.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 794.59 kB
Formato Adobe PDF
794.59 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/387148
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact