In the last years, online users have been sharing more and more opinions, reviews, and comments on the web. Opinion mining is the automatic process of getting the subject of such opinions, and recently it has been attracting great commercial and academic interest. Several methods were presented for performing opinion mining in Bangla language, however they reported limited performance. In the present article, we considered the only two publicly datasets available for opinion mining in the Bangla language. We machine translated the datasets into the English language and we preprocessed them by extracting textual frequency based features. Then, we designed two stacked contractive auto-encoders based architectures to perform opinion mining in Bangla language, one for each dataset. The classifiers were trained on the machine translated version on the two datasets in a stacked learning fashion. The proposed classifiers achieved improved performance, with respect to accuracy (>= 96%), precision (>= 93%), recall (>= 94%), and F1 score (>= 94%), reported in the past state of the art works. Furthermore, the experimental results showed that both the machine translation procedure and the stacked learning frameworks improved the final classification performance.

Opinion mining from machine translated Bangla reviews with stacked contractive auto-encoders / M. Bodini. - In: JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING. - ISSN 1868-5137. - 14:9(2023 Sep), pp. 12119-12131. [10.1007/s12652-022-03760-w]

Opinion mining from machine translated Bangla reviews with stacked contractive auto-encoders

M. Bodini
2023

Abstract

In the last years, online users have been sharing more and more opinions, reviews, and comments on the web. Opinion mining is the automatic process of getting the subject of such opinions, and recently it has been attracting great commercial and academic interest. Several methods were presented for performing opinion mining in Bangla language, however they reported limited performance. In the present article, we considered the only two publicly datasets available for opinion mining in the Bangla language. We machine translated the datasets into the English language and we preprocessed them by extracting textual frequency based features. Then, we designed two stacked contractive auto-encoders based architectures to perform opinion mining in Bangla language, one for each dataset. The classifiers were trained on the machine translated version on the two datasets in a stacked learning fashion. The proposed classifiers achieved improved performance, with respect to accuracy (>= 96%), precision (>= 93%), recall (>= 94%), and F1 score (>= 94%), reported in the past state of the art works. Furthermore, the experimental results showed that both the machine translation procedure and the stacked learning frameworks improved the final classification performance.
Aspect-based sentiment analysis; Auto-encoders; Bangla language; Machine translation; Opinion mining; Stacked learning; Text classification;
Settore INF/01 - Informatica
set-2023
25-feb-2022
Article (author)
File in questo prodotto:
File Dimensione Formato  
s12652-022-03760-w.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 1.8 MB
Formato Adobe PDF
1.8 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/912773
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 2
social impact