Cross-language speech emotion recognition is receiving increased attention due to its extensive real-world applicability. This work proposes a language-agnostic speech emotion recognition algorithm focusing on Italian and German languages. We combine mel-scaled and temporal modulation spectral representations which are subsequently modeled by means of Gaussian mixture models. Emotion prediction is carried out via a Kullback Leibler divergence scheme. Importantly, we apply the proposed methodology on two problem settings, i.e. one including positive vs. negative emotion classification and a second one where all Big Six emotional states are considered. A thorough experimental campaign demonstrated the efficacy of such a method, as well as its superiority over other generative modeling schemes and state of the art approaches.

Toward Language-Agnostic Speech Emotion Recognition / S. Ntalampiras. - In: AES. - ISSN 1549-4950. - 68:1/2(2020), pp. 7-13. [10.17743/jaes.2019.0045]

Toward Language-Agnostic Speech Emotion Recognition

S. Ntalampiras
2020

Abstract

Cross-language speech emotion recognition is receiving increased attention due to its extensive real-world applicability. This work proposes a language-agnostic speech emotion recognition algorithm focusing on Italian and German languages. We combine mel-scaled and temporal modulation spectral representations which are subsequently modeled by means of Gaussian mixture models. Emotion prediction is carried out via a Kullback Leibler divergence scheme. Importantly, we apply the proposed methodology on two problem settings, i.e. one including positive vs. negative emotion classification and a second one where all Big Six emotional states are considered. A thorough experimental campaign demonstrated the efficacy of such a method, as well as its superiority over other generative modeling schemes and state of the art approaches.
Settore INF/01 - Informatica
2020
AES
Article (author)
File in questo prodotto:
File Dimensione Formato  
JAES final.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 1.07 MB
Formato Adobe PDF
1.07 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/712619
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 19
  • ???jsp.display-item.citation.isi??? 11
social impact