Expressive speech modeling is a new trend in speech processing, including emotional speech synthesis and recognition. So far, emotion recognition from speech signal has been mainly achieved using supervised classifiers. However, clustering techniques seem well fitted to resolve such a problem, especially in huge databases, where speech labeling may be a hard and tedious task. This paper presents a novel approach for emotion recognition from speech signal, based on fuzzy clustering, including probabilistic, possibilistic and graded-possibilistic c-means. In comparison to crisp clustering, mainly using kmeans, fuzzy c-means look more fitted for this problem, and potentially offer an innovative way to analyze emotions conveyed by speech using membership degrees.

Emotion recognition from speech signal using fuzzy clustering / S. Rovetta, Z. Mnasri, F. Masulli, A. Cabri (ATLANTIS STUDIES IN UNCERTAINTY MODELLING). - In: Proceedings of the 11th Conference of the European Society for Fuzzy Logic and Technology (EUSFLAT 2019) / [a cura di] V. Novak, V. Marik, M. Stepnicka, M. Navara, P. Hurtik. - [s.l] : Atlantis Press, 2019 Jan. - ISBN 978-94-6252-770-6. - pp. 197-206 (( Intervento presentato al 11. convegno Conference of the European Society for Fuzzy Logic and Technology tenutosi a Prague nel 2019 [10.2991/eusflat-19.2019.19].

Emotion recognition from speech signal using fuzzy clustering

A. Cabri
2019

Abstract

Expressive speech modeling is a new trend in speech processing, including emotional speech synthesis and recognition. So far, emotion recognition from speech signal has been mainly achieved using supervised classifiers. However, clustering techniques seem well fitted to resolve such a problem, especially in huge databases, where speech labeling may be a hard and tedious task. This paper presents a novel approach for emotion recognition from speech signal, based on fuzzy clustering, including probabilistic, possibilistic and graded-possibilistic c-means. In comparison to crisp clustering, mainly using kmeans, fuzzy c-means look more fitted for this problem, and potentially offer an innovative way to analyze emotions conveyed by speech using membership degrees.
Emotion recognition; speech signal; kmeans; fuzzy clustering; membership function
Settore INF/01 - Informatica
gen-2019
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
Pubblicazione_7.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 2.34 MB
Formato Adobe PDF
2.34 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/954094
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 1
social impact