We present iSA (integrated sentiment analysis), a novel algorithm designed for social networks and Web 2.0 sphere (Twitter, blogs, etc.) opinion analysis, i.e. developed for the digital environments characterized by abundance of noise compared to the amount of information. Instead of performing an individual classification and then aggregate the predicted values, iSA directly estimates the aggregated distribution of opinions. Based on supervised hand-coding rather than NLP techniques or ontological dictionaries, iSA is a language-agnostic algorithm (based on human coders' abilities). iSA exploits a dimensionality reduction approach which makes it scalable, fast, memory efficient, stable and statistically accurate. The cross-tabulation of opinions is possible with iSA thanks to its stability. Through empirical analysis it will be shown when iSA outperforms machine learning techniques of individual classification (e.g. SVM, Random Forests, etc) as well as the only other alternative for aggregated sentiment analysis known as ReadMe.

iSA : a fast, scalable and accurate algorithm for sentiment analysis of social media content / A. Ceron, L. Curini, S.M. Iacus. - In: INFORMATION SCIENCES. - ISSN 0020-0255. - 367-368(2016), pp. 105-124.

iSA : a fast, scalable and accurate algorithm for sentiment analysis of social media content

A. Ceron
Primo
;
L. Curini
Secondo
;
S.M. Iacus
2016

Abstract

We present iSA (integrated sentiment analysis), a novel algorithm designed for social networks and Web 2.0 sphere (Twitter, blogs, etc.) opinion analysis, i.e. developed for the digital environments characterized by abundance of noise compared to the amount of information. Instead of performing an individual classification and then aggregate the predicted values, iSA directly estimates the aggregated distribution of opinions. Based on supervised hand-coding rather than NLP techniques or ontological dictionaries, iSA is a language-agnostic algorithm (based on human coders' abilities). iSA exploits a dimensionality reduction approach which makes it scalable, fast, memory efficient, stable and statistically accurate. The cross-tabulation of opinions is possible with iSA thanks to its stability. Through empirical analysis it will be shown when iSA outperforms machine learning techniques of individual classification (e.g. SVM, Random Forests, etc) as well as the only other alternative for aggregated sentiment analysis known as ReadMe.
Opinion mining; Sentiment analysis; Twitter analysis
Settore MAT/06 - Probabilita' e Statistica Matematica
Settore SECS-S/01 - Statistica
Settore SPS/04 - Scienza Politica
2016
Article (author)
File in questo prodotto:
File Dimensione Formato  
Information Sciences.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 963.77 kB
Formato Adobe PDF
963.77 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/392517
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 49
  • ???jsp.display-item.citation.isi??? 37
social impact