IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

Predicting the emotional responses of humans to soundscapes is a relatively recent field of research coming with a wide range of promising applications. This work presents the design of two convolutional neural networks, namely ArNet and ValNet, each one responsible for quantifying arousal and valence evoked by soundscapes. We build on the knowledge acquired from the application of traditional machine learning techniques on the specific domain, and design a suitable deep learning framework. Moreover, we propose the usage of artificially created mixed soundscapes, the distributions of which are located between the ones of the available samples, a process that increases the variance of the dataset leading to significantly better performance. The reported results outperform the state of the art on a soundscape dataset following Schafer’s standardized categorization considering both sound’s identity and the respective listening context.

Emotional quantification of soundscapes by learning between samples / S. Ntalampiras. - In: MULTIMEDIA TOOLS AND APPLICATIONS. - ISSN 1380-7501. - 79:41-42(2020 Nov), pp. 30387-30395. [10.1007/s11042-020-09430-3]

Emotional quantification of soundscapes by learning between samples

S. Ntalampiras

2020

Abstract

Predicting the emotional responses of humans to soundscapes is a relatively recent field of research coming with a wide range of promising applications. This work presents the design of two convolutional neural networks, namely ArNet and ValNet, each one responsible for quantifying arousal and valence evoked by soundscapes. We build on the knowledge acquired from the application of traditional machine learning techniques on the specific domain, and design a suitable deep learning framework. Moreover, we propose the usage of artificially created mixed soundscapes, the distributions of which are located between the ones of the available samples, a process that increases the variance of the dataset leading to significantly better performance. The reported results outperform the state of the art on a soundscape dataset following Schafer’s standardized categorization considering both sound’s identity and the respective listening context.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				acoustic ecology; audio signal processing; afffective computing;
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore INF/01 - Informatica
			
	Data di pubblicazione
	
				nov-2020
			
	Data ahead of print o data di stampa
	
				15-ago-2020
			
	Rivista in ANCE
	
				MULTIMEDIA TOOLS AND APPLICATIONS
			
	DOI
	
				https://dx.doi.org/10.1007/s11042-020-09430-3
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
38 Ntalampiras2020_Article_EmotionalQuantificationOfSound.pdf accesso aperto Descrizione: Online first Tipologia: Publisher's version/PDF Dimensione 1.22 MB Formato Adobe PDF Visualizza/Apri	1.22 MB	Adobe PDF	Visualizza/Apri
Ntalampiras2020_Article_EmotionalQuantificationOfSound.pdf accesso aperto Tipologia: Publisher's version/PDF Dimensione 1.23 MB Formato Adobe PDF Visualizza/Apri	1.23 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/758861

Citazioni

ND

11

10

ND

social impact