Engaging humans in the resolution of classification tasks has been shown to be effective especially when digital resources are considered, with complex features to be abstracted for an automated procedure, like images or multimedia web resources. In this paper, we propose the HC2 crowdclustering approach for unsupervised classification of web resources, by allowing the classification categories to dynamically emerge from the crowd. In HC2, crowd workers actively participate to clustering activities (i) by resolving tasks in which they are asked to visually recognize groups of similar resources and (ii) by labeling recognized clusters with prominent keywords. To increase flexibility, HC2 can be interactively configured to dynamically set the balance between human engagement and automated procedures in cluster formation, according to the kind and nature of resources to be classified. For experimentation and evaluation, the HC2 approach has been deployed on the Argo platform providing crowdsourcing techniques for consensus-based task execution.
Human-in-the-loop web resource classification / S. Castano, A. Ferrara, S. Montanelli (LECTURE NOTES IN COMPUTER SCIENCE). - In: On the Move to Meaningful Internet Systems: OTM 2016 Conferences / [a cura di] C. Debruyne, H. Panetto, R. Meersman, T.S. Dillon, E. Kühn, D. O'Sullivan, C. A. Ardagna. - [s.l] : Springer, 2016. - ISBN 9783319484716. - pp. 229-244 (( convegno On The Move (OTM) tenutosi a Rhodes nel 2016 [10.1007/978-3-319-48472-3_13].
Human-in-the-loop web resource classification
S. Castano;A. Ferrara;S. Montanelli
2016
Abstract
Engaging humans in the resolution of classification tasks has been shown to be effective especially when digital resources are considered, with complex features to be abstracted for an automated procedure, like images or multimedia web resources. In this paper, we propose the HC2 crowdclustering approach for unsupervised classification of web resources, by allowing the classification categories to dynamically emerge from the crowd. In HC2, crowd workers actively participate to clustering activities (i) by resolving tasks in which they are asked to visually recognize groups of similar resources and (ii) by labeling recognized clusters with prominent keywords. To increase flexibility, HC2 can be interactively configured to dynamically set the balance between human engagement and automated procedures in cluster formation, according to the kind and nature of resources to be classified. For experimentation and evaluation, the HC2 approach has been deployed on the Argo platform providing crowdsourcing techniques for consensus-based task execution.File | Dimensione | Formato | |
---|---|---|---|
chp%3A10.1007%2F978-3-319-48472-3_13.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
1.01 MB
Formato
Adobe PDF
|
1.01 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.