This paper deals with the issue of individualizing the head-related transfer function (HRTF) rendering process for auditory elevation perception: is it possible to find a nonindividual, personalized HRTF set that allows a listener to have an equally accurate localization performance than with his/her individual HRTFs? We propose a psychoacoustically motivated, anthropometry based mismatch function between HRTF pairs, that exploits the close relation between the listener's pinna geometry and localization cues. This is evaluated using an auditory model that computes a mapping between HRTF spectra and perceived spatial locations. Results on a large number of subjects in the CIPIC and ARI HRTF databases suggest that there exists a non-individual HRTF set which allows a listener to have an equally accurate vertical localization than with individual HRTFs. Furthermore, we find the optimal parametrization of the proposed mismatch function, i.e. the one that best reflects the information given by the auditory model. Our findings show that the selection procedure yields statistically significant improvements with respect to dummy-head HRTFs or random HRTF selection, with potentially high impact from an applicative point of view.

Do we need individual head-related transfer functions for vertical localization? : The case study of a spectral notch distance metric / M. Geronazzo, S. Spagnol, F. Avanzini. - In: IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING. - ISSN 2329-9290. - 26:7(2018 Jul), pp. 1243-1256. [10.1109/TASLP.2018.2821846]

Do we need individual head-related transfer functions for vertical localization? : The case study of a spectral notch distance metric

F. Avanzini
Ultimo
2018

Abstract

This paper deals with the issue of individualizing the head-related transfer function (HRTF) rendering process for auditory elevation perception: is it possible to find a nonindividual, personalized HRTF set that allows a listener to have an equally accurate localization performance than with his/her individual HRTFs? We propose a psychoacoustically motivated, anthropometry based mismatch function between HRTF pairs, that exploits the close relation between the listener's pinna geometry and localization cues. This is evaluated using an auditory model that computes a mapping between HRTF spectra and perceived spatial locations. Results on a large number of subjects in the CIPIC and ARI HRTF databases suggest that there exists a non-individual HRTF set which allows a listener to have an equally accurate vertical localization than with individual HRTFs. Furthermore, we find the optimal parametrization of the proposed mismatch function, i.e. the one that best reflects the information given by the auditory model. Our findings show that the selection procedure yields statistically significant improvements with respect to dummy-head HRTFs or random HRTF selection, with potentially high impact from an applicative point of view.
spatial audio; head-related transfer functions (HRTFs); auditory models; individualized HRTFs; HRTF selection; vertical localization; spectral notch metric
Settore INF/01 - Informatica
lug-2018
2-apr-2018
Article (author)
File in questo prodotto:
File Dimensione Formato  
geronazzo_taslp18_preprint.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 2.83 MB
Formato Adobe PDF
2.83 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/568229
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 24
  • ???jsp.display-item.citation.isi??? 15
social impact