Expressive speech modeling is a new trend in speech processing, including emotional speech synthesis and recognition. So far, emotion recognition from speech signal has been mainly achieved using supervised classifiers. However, clustering techniques seem well fitted to resolve such a problem, especially in huge databases, where speech labeling may be a hard and tedious task. This paper presents a novel approach for emotion recognition from speech signal, based on fuzzy clustering, including probabilistic, possibilistic and graded-possibilistic c-means. In comparison to crisp clustering, mainly using kmeans, fuzzy c-means look more fitted for this problem, and potentially offer an innovative way to analyze emotions conveyed by speech using membership degrees.
Emotion recognition from speech signal using fuzzy clustering / S. Rovetta, Z. Mnasri, F. Masulli, A. Cabri (ATLANTIS STUDIES IN UNCERTAINTY MODELLING). - In: Proceedings of the 11th Conference of the European Society for Fuzzy Logic and Technology (EUSFLAT 2019) / [a cura di] V. Novak, V. Marik, M. Stepnicka, M. Navara, P. Hurtik. - [s.l] : Atlantis Press, 2019 Jan. - ISBN 978-94-6252-770-6. - pp. 197-206 (( Intervento presentato al 11. convegno Conference of the European Society for Fuzzy Logic and Technology tenutosi a Prague nel 2019 [10.2991/eusflat-19.2019.19].
Emotion recognition from speech signal using fuzzy clustering
A. Cabri
2019
Abstract
Expressive speech modeling is a new trend in speech processing, including emotional speech synthesis and recognition. So far, emotion recognition from speech signal has been mainly achieved using supervised classifiers. However, clustering techniques seem well fitted to resolve such a problem, especially in huge databases, where speech labeling may be a hard and tedious task. This paper presents a novel approach for emotion recognition from speech signal, based on fuzzy clustering, including probabilistic, possibilistic and graded-possibilistic c-means. In comparison to crisp clustering, mainly using kmeans, fuzzy c-means look more fitted for this problem, and potentially offer an innovative way to analyze emotions conveyed by speech using membership degrees.File | Dimensione | Formato | |
---|---|---|---|
Pubblicazione_7.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Dimensione
2.34 MB
Formato
Adobe PDF
|
2.34 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.