Processing generalized sound events with the purpose of predicting the emotion they might evoke is a relatively young research field. Tools, datasets, and methodologies to address such a challenging task are still under development, far from any standardized format. This work aims to cover this gap by revealing and exploiting potential similarities existing during the perception of emotions evoked by sound events and music. o this end we propose (a) the usage of temporal modulation features and (b) a transfer learning module based on an Echo State Network assisting the prediction of valence and arousal measurements associated with generalized sound events. The effectiveness of the proposed transfer learning solution is demonstrated after a thoroughly designed experimental phase employing both sound and music data. The results demonstrate the importance of transfer learning in the specific field and encourage further research on approaches which manage the problem in a cooperative way.
Emotion prediction of sound events based on transfer learning / S. Ntalampiras, I. Potamitis (COMMUNICATIONS IN COMPUTER AND INFORMATION SCIENCE). - In: Engineering Applications of Neural Networks / [a cura di] G. Boracchi, L. Iliadis,C. Jayne, A. Likas. - [s.l] : Springer Verlag, 2017. - ISBN 9783319651712. - pp. 303-313 (( Intervento presentato al 18. convegno EANN tenutosi a Athens nel 2017 [10.1007/978-3-319-65172-9_26].
Emotion prediction of sound events based on transfer learning
S. Ntalampiras;
2017
Abstract
Processing generalized sound events with the purpose of predicting the emotion they might evoke is a relatively young research field. Tools, datasets, and methodologies to address such a challenging task are still under development, far from any standardized format. This work aims to cover this gap by revealing and exploiting potential similarities existing during the perception of emotions evoked by sound events and music. o this end we propose (a) the usage of temporal modulation features and (b) a transfer learning module based on an Echo State Network assisting the prediction of valence and arousal measurements associated with generalized sound events. The effectiveness of the proposed transfer learning solution is demonstrated after a thoroughly designed experimental phase employing both sound and music data. The results demonstrate the importance of transfer learning in the specific field and encourage further research on approaches which manage the problem in a cooperative way.| File | Dimensione | Formato | |
|---|---|---|---|
|
38 EANN2017.pdf
accesso riservato
Tipologia:
Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione
336.99 kB
Formato
Adobe PDF
|
336.99 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




