This work presents a novel framework for the automatic assessment of the unpleasantness caused by audio events to a human listener which is a relatively new research problem. Melfrequency cepstral coefficients and temporal modulation parameters were employed to characterize 75 sound stimuli varying from animal calls to baby cries. The final assessment is made by means of a clustering scheme realized by Gaussian mixture models. The proposed framework leads to the best performance in terms of mean squared error and correlation between predicted and measured unpleasantness levels reported so far in the literature.
On predicting the unpleasantness level of a sound event / S. Ntalampiras, I. Potamitis (INTERSPEECH). - In: Celebrating the Diversity of Spoken Languages / [a cura di] H. Li, P. Ching. - [s.l] : International Speech and Communication Association, 2014. - ISBN 9781634394352. - pp. 1782-1785 (( Intervento presentato al 15. convegno INTERSPEECH tenutosi a Singapore nel 2014.
On predicting the unpleasantness level of a sound event
S. Ntalampiras;
2014
Abstract
This work presents a novel framework for the automatic assessment of the unpleasantness caused by audio events to a human listener which is a relatively new research problem. Melfrequency cepstral coefficients and temporal modulation parameters were employed to characterize 75 sound stimuli varying from animal calls to baby cries. The final assessment is made by means of a clustering scheme realized by Gaussian mixture models. The proposed framework leads to the best performance in terms of mean squared error and correlation between predicted and measured unpleasantness levels reported so far in the literature.Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




