Engaging humans in the resolution of classification tasks has been shown to be effective especially when digital resources are considered, with complex features to be abstracted for an automated procedure, like images or multimedia web resources. In this paper, we propose the HC2 crowdclustering approach for unsupervised classification of web resources, by allowing the classification categories to dynamically emerge from the crowd. In HC2, crowd workers actively participate to clustering activities (i) by resolving tasks in which they are asked to visually recognize groups of similar resources and (ii) by labeling recognized clusters with prominent keywords. To increase flexibility, HC2 can be interactively configured to dynamically set the balance between human engagement and automated procedures in cluster formation, according to the kind and nature of resources to be classified. For experimentation and evaluation, the HC2 approach has been deployed on the Argo platform providing crowdsourcing techniques for consensus-based task execution.

Human-in-the-loop web resource classification / S. Castano, A. Ferrara, S. Montanelli (LECTURE NOTES IN COMPUTER SCIENCE). - In: On the Move to Meaningful Internet Systems: OTM 2016 Conferences / [a cura di] C. Debruyne, H. Panetto, R. Meersman, T.S. Dillon, E. Kühn, D. O'Sullivan, C. A. Ardagna. - [s.l] : Springer, 2016. - ISBN 9783319484716. - pp. 229-244 (( convegno On The Move (OTM) tenutosi a Rhodes nel 2016 [10.1007/978-3-319-48472-3_13].

Human-in-the-loop web resource classification

S. Castano;A. Ferrara;S. Montanelli
2016

Abstract

Engaging humans in the resolution of classification tasks has been shown to be effective especially when digital resources are considered, with complex features to be abstracted for an automated procedure, like images or multimedia web resources. In this paper, we propose the HC2 crowdclustering approach for unsupervised classification of web resources, by allowing the classification categories to dynamically emerge from the crowd. In HC2, crowd workers actively participate to clustering activities (i) by resolving tasks in which they are asked to visually recognize groups of similar resources and (ii) by labeling recognized clusters with prominent keywords. To increase flexibility, HC2 can be interactively configured to dynamically set the balance between human engagement and automated procedures in cluster formation, according to the kind and nature of resources to be classified. For experimentation and evaluation, the HC2 approach has been deployed on the Argo platform providing crowdsourcing techniques for consensus-based task execution.
Crowdclustering; Large-scale social computing; Consensus based crowdsourcing
Settore INF/01 - Informatica
2016
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
chp%3A10.1007%2F978-3-319-48472-3_13.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 1.01 MB
Formato Adobe PDF
1.01 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/456266
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 1
social impact