Folksonomies - networks of users, resources, and tags allow users to easily retrieve, organize and browse web contents. However, their advantages are still limited according to the noisiness of user provided tags. To overcome this problem, we propose an approach for identifying related tags in folksonomies. The approach uses tag co-occurrence statistics and Laplacian score feature selection to create probability distribution for each tag. Consequently, related tags are determined according to the distance between their distributions. In this regards, we propose a distance metric based on Jensen-Shannon Divergence. The new metric named AJSD deals with the noise in the measurements due to statistical fluctuations in tag co-occurrences. We experimentally evaluated our approach using WordNet and compared it to a common tag relatedness approach based on the cosine similarity. The results show the effectiveness of our approach and its advantage over the adversary method.

Tag relatedness using Laplacian score feature selection and adapted Jensen-Shannon divergence / H. Mousselly Sergieh, M. Döller, E. Egyed Zsigmond, G. Gianini, H. Kosch, J. Pinon - In: Multimedia modeling : 20th Anniversary international conference, MMM 2014 : Dublin, Ireland, january 6-10, 2014 : proceedings. Part 1 / [a cura di] H. Mousselly-Sergieh, M. Döller, E. Egyed-Zsigmond, G. Gianini, H. Kosch, J. -M. Pinon. - Berlin : Springer, 2014. - ISBN 9783319041131. - pp. 159-171 (( Intervento presentato al 20. convegno Anniversary International Conference (MMM) tenutosi a Dublin nel 2014 [10.1007/978-3-319-04114-8_14].

Tag relatedness using Laplacian score feature selection and adapted Jensen-Shannon divergence

G. Gianini;
2014

Abstract

Folksonomies - networks of users, resources, and tags allow users to easily retrieve, organize and browse web contents. However, their advantages are still limited according to the noisiness of user provided tags. To overcome this problem, we propose an approach for identifying related tags in folksonomies. The approach uses tag co-occurrence statistics and Laplacian score feature selection to create probability distribution for each tag. Consequently, related tags are determined according to the distance between their distributions. In this regards, we propose a distance metric based on Jensen-Shannon Divergence. The new metric named AJSD deals with the noise in the measurements due to statistical fluctuations in tag co-occurrences. We experimentally evaluated our approach using WordNet and compared it to a common tag relatedness approach based on the cosine similarity. The results show the effectiveness of our approach and its advantage over the adversary method.
Folksonomies; JSD; Laplacian Score; Tag Relatedness
Settore INF/01 - Informatica
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
2014
Book Part (author)
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/230481
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 9
  • ???jsp.display-item.citation.isi??? ND
social impact