Crowdclustering has been recently proposed to engage humans in automated categorization tasks and it shows to be effective especially when digital resources are involved, with complex features to be abstracted for an automated procedure, like images or multimedia resources. In this paper, we propose the HC2 crowdclustering approach for unsupervised classification of digital resources, by allowing the classification categories to dynamically emerge from the crowd. In HC2 , crowd workers actively participate to clustering activities i) by resolving tasks in which they are asked to visually recognize groups of similar resources and ii) by labeling recognized clusters with prominent keywords. To increase flexibility, HC2 can be interactively configured to dynamically set the balance between human engagement and automated procedures in cluster formation, according to the kind and nature of resources to be classified as it will be discussed in the experimental evaluation.

Crowdclustering digital resources / S. Castano, A. Ferrara, S. Montanelli - In: Italian Symposium on Advanced Database Systems / [a cura di] M.A. Bochicchio, G. Mecca. - [s.l] : Matematicamente.it, 2016. - ISBN 9788896354889. - pp. 7-18 (( Intervento presentato al 24. convegno Italian Symposium on Advanced Database Systems tenutosi a Ugento nel 2016.

Crowdclustering digital resources

S. Castano;A. Ferrara;S. Montanelli
2016

Abstract

Crowdclustering has been recently proposed to engage humans in automated categorization tasks and it shows to be effective especially when digital resources are involved, with complex features to be abstracted for an automated procedure, like images or multimedia resources. In this paper, we propose the HC2 crowdclustering approach for unsupervised classification of digital resources, by allowing the classification categories to dynamically emerge from the crowd. In HC2 , crowd workers actively participate to clustering activities i) by resolving tasks in which they are asked to visually recognize groups of similar resources and ii) by labeling recognized clusters with prominent keywords. To increase flexibility, HC2 can be interactively configured to dynamically set the balance between human engagement and automated procedures in cluster formation, according to the kind and nature of resources to be classified as it will be discussed in the experimental evaluation.
No
English
crowdclustering; cluster similarity evaluation; consensusbased crowdsourcing
Settore INF/01 - Informatica
Intervento a convegno
Esperti anonimi
Pubblicazione scientifica
Italian Symposium on Advanced Database Systems
M.A. Bochicchio, G. Mecca
Matematicamente.it
2016
7
18
12
9788896354889
Volume a diffusione nazionale
Italian Symposium on Advanced Database Systems
Ugento
2016
24
Convegno nazionale
Intervento inviato
Aderisco
S. Castano, A. Ferrara, S. Montanelli
Book Part (author)
reserved
273
Crowdclustering digital resources / S. Castano, A. Ferrara, S. Montanelli - In: Italian Symposium on Advanced Database Systems / [a cura di] M.A. Bochicchio, G. Mecca. - [s.l] : Matematicamente.it, 2016. - ISBN 9788896354889. - pp. 7-18 (( Intervento presentato al 24. convegno Italian Symposium on Advanced Database Systems tenutosi a Ugento nel 2016.
info:eu-repo/semantics/bookPart
3
Prodotti della ricerca::03 - Contributo in volume
File in questo prodotto:
File Dimensione Formato  
castanoetal.pdf

accesso riservato

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione 1.61 MB
Formato Adobe PDF
1.61 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/456283
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact