In the last years, online users have been sharing more and more opinions, reviews, and comments on the web. Opinion mining is the automatic process of getting the subject of such opinions, and recently it has been attracting great commercial and academic interest. Several methods were presented for performing opinion mining in Bangla language, however they reported limited performance. In the present article, we considered the only two publicly datasets available for opinion mining in the Bangla language. We machine translated the datasets into the English language and we preprocessed them by extracting textual frequency based features. Then, we designed two stacked contractive auto-encoders based architectures to perform opinion mining in Bangla language, one for each dataset. The classifiers were trained on the machine translated version on the two datasets in a stacked learning fashion. The proposed classifiers achieved improved performance, with respect to accuracy (>= 96%), precision (>= 93%), recall (>= 94%), and F1 score (>= 94%), reported in the past state of the art works. Furthermore, the experimental results showed that both the machine translation procedure and the stacked learning frameworks improved the final classification performance.
Opinion mining from machine translated Bangla reviews with stacked contractive auto-encoders / M. Bodini. - In: JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING. - ISSN 1868-5137. - 14:9(2023 Sep), pp. 12119-12131. [10.1007/s12652-022-03760-w]
Opinion mining from machine translated Bangla reviews with stacked contractive auto-encoders
M. Bodini
2023
Abstract
In the last years, online users have been sharing more and more opinions, reviews, and comments on the web. Opinion mining is the automatic process of getting the subject of such opinions, and recently it has been attracting great commercial and academic interest. Several methods were presented for performing opinion mining in Bangla language, however they reported limited performance. In the present article, we considered the only two publicly datasets available for opinion mining in the Bangla language. We machine translated the datasets into the English language and we preprocessed them by extracting textual frequency based features. Then, we designed two stacked contractive auto-encoders based architectures to perform opinion mining in Bangla language, one for each dataset. The classifiers were trained on the machine translated version on the two datasets in a stacked learning fashion. The proposed classifiers achieved improved performance, with respect to accuracy (>= 96%), precision (>= 93%), recall (>= 94%), and F1 score (>= 94%), reported in the past state of the art works. Furthermore, the experimental results showed that both the machine translation procedure and the stacked learning frameworks improved the final classification performance.File | Dimensione | Formato | |
---|---|---|---|
s12652-022-03760-w.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Dimensione
1.8 MB
Formato
Adobe PDF
|
1.8 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.