Auditory models comparison for horizontal localization of concurrent speakers in adverse acoustic scenarios

Barumerli, R.; Almenari, A.; Geronazzo, M.; Di Nunzio, G.; Avanzini, F.

doi:10.18154/RWTH-CONV-239730

This paper aims at comparing and reproducing the predictions of two public available computational auditory models for speaker localization in different simulated environments. The direction-of-arrival (DOA) of sound sources in the horizontal plane can be extracted by using binaural spatial cues from room and user acoustics. Since our predictions consider the specificity of both models at the level of peripheral processing, the proposed solution for DOA extraction also provides a common multi-conditional training for the Gaussian Mixture Model (GMM) approach. A set of acoustic simulations of adverse conditions (i.e. multi speakers or high reverberant scenarios) supports the evaluation phase on robustness of the synthetic auditory process. Our analysis reproduces two case studies from the scientific literature in order to investigate the reliability of localization predictions in the frontal horizontal plane. Finally, a newly defined acoustic scenario allows to identify differences between auditory models outcome in the entire horizontal plane. The results show a good agreement with previous literature and our machine learning approach emphasizes peculiarities of each approach for auditory peripheral processing.

Auditory models comparison for horizontal localization of concurrent speakers in adverse acoustic scenarios / R. Barumerli, A. Almenari, M. Geronazzo, G. Di Nunzio, F. Avanzini - In: Proceedings of the 23rd International Congress on AcousticsPrima edizione. - [s.l] : EAA, 2019. - ISBN 9783939296157. - pp. 7651-7658 (( Intervento presentato al 23. convegno International Congress on Acoustics tenutosi a Aachen nel 2019 [10.18154/RWTH-CONV-239730].

Auditory models comparison for horizontal localization of concurrent speakers in adverse acoustic scenarios

R. Barumerli;A. Almenari;M. Geronazzo;G. Di Nunzio;F. Avanzini

2019

Abstract

This paper aims at comparing and reproducing the predictions of two public available computational auditory models for speaker localization in different simulated environments. The direction-of-arrival (DOA) of sound sources in the horizontal plane can be extracted by using binaural spatial cues from room and user acoustics. Since our predictions consider the specificity of both models at the level of peripheral processing, the proposed solution for DOA extraction also provides a common multi-conditional training for the Gaussian Mixture Model (GMM) approach. A set of acoustic simulations of adverse conditions (i.e. multi speakers or high reverberant scenarios) supports the evaluation phase on robustness of the synthetic auditory process. Our analysis reproduces two case studies from the scientific literature in order to investigate the reliability of localization predictions in the frontal horizontal plane. Finally, a newly defined acoustic scenario allows to identify differences between auditory models outcome in the entire horizontal plane. The results show a good agreement with previous literature and our machine learning approach emphasizes peculiarities of each approach for auditory peripheral processing.

Scheda breve

Scheda completa

Scheda completa (DC)

	Settori scientifico-disciplinari del contributo (sola visualizzazione)
	
				Settore INF/01 - Informatica
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
			
	Data di pubblicazione
	
				2019
			
	DOI
	
				https://dx.doi.org/10.18154/RWTH-CONV-239730
			
	Tipologia
	
				Book Part (author)
			
	Appare nelle tipologie:
	
				03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
barumerli_ica19.pdf accesso aperto Tipologia: Publisher's version/PDF Dimensione 391.24 kB Formato Adobe PDF Visualizza/Apri	391.24 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/711163

Citazioni

ND

0

ND

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca