Folksonomies - networks of users, resources, and tags allow users to easily retrieve, organize and browse web contents. However, their advantages are still limited mainly due to the noisiness of user provided tags. To overcome this issue, we propose an approach for characterizing related tags in folksonomies: we use tag co-occurrence statistics and Laplacian score based feature selection in order to create empirical co-occurrence probability distribution for each tag; then we identify related tags on the basis of the dissimilarity between their distributions. For this purpose, we introduce variant of the Jensen-Shannon Divergence, which is more robust to statistical noise. We experimentally evaluate our approach using WordNet and compare it to a common tag-relatedness approach based on the cosine similarity. The results show the effectiveness of our approach and its advantage over the competing method.

Tag relatedness in image folksonomies / H. Mousselly Sergieh, E. Egyed Zsigmond, G. Gianini, M. Döller, J. Pinon, H. Kosch. - In: DOCUMENT NUMÉRIQUE. - ISSN 1279-5127. - 17:2(2014), pp. 33-54. [10.3166/dn.17.2.33-54]

Tag relatedness in image folksonomies

G. Gianini;
2014

Abstract

Folksonomies - networks of users, resources, and tags allow users to easily retrieve, organize and browse web contents. However, their advantages are still limited mainly due to the noisiness of user provided tags. To overcome this issue, we propose an approach for characterizing related tags in folksonomies: we use tag co-occurrence statistics and Laplacian score based feature selection in order to create empirical co-occurrence probability distribution for each tag; then we identify related tags on the basis of the dissimilarity between their distributions. For this purpose, we introduce variant of the Jensen-Shannon Divergence, which is more robust to statistical noise. We experimentally evaluate our approach using WordNet and compare it to a common tag-relatedness approach based on the cosine similarity. The results show the effectiveness of our approach and its advantage over the competing method.
folksonomy; tag relatedness; JSD; AJSD; Laplacian score; feature selection
Settore INF/01 - Informatica
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
2014
http://dn.revuesonline.com/article.jsp?articleId=19707
Article (author)
File in questo prodotto:
File Dimensione Formato  
AJSDpreprint_ver02.pdf

accesso aperto

Descrizione: Bozza preprint
Tipologia: Pre-print (manoscritto inviato all'editore)
Dimensione 837.63 kB
Formato Adobe PDF
837.63 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/240562
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? ND
social impact