This paper proposes an image-guided HRTF selection procedure that exploits the relation between features of the pinna shape and HRTF notches. Using a 2D image of a user's pinna, the procedure selects from a database the HRTF set that best fits the anthropometry of that user. The proposed procedure is designed to be quickly applied and easy to use for a user without previous knowledge on binaural audio technologies. The entire process is evaluated by means of (i) an auditory model for sound localization in the mid-sagittal plane available from previous literature, and (ii) a short localization test in virtual reality. Using both virtual and real subjects from a HRTF database, predictions and the experimental evaluation aimed to assess the vertical localization performance with HRTF sets selected by the proposed procedure. Our results report a statistically significant improvement in predictions of the auditory model for localization performance with selected HRTFs compared to KEMAR HRTFs, which is a commercial standard in many binaural audio solutions. Moreover, the proposed localization test with human listeners reflect the model's predictions, further supporting the applicability of our perceptually-motivated metrics with anthropometric data extracted by pinna images.
Applying a Single-notch Metric to Image-guided Head-related Transfer Function Selection for Improved Vertical Localization / M. Geronazzo, E. Peruch, F. Prandoni, F. Avanzini. - In: AES. - ISSN 1549-4950. - 67:6(2019 Jun 09), pp. 414-428. [10.17743/jaes.2019.0010]
Applying a Single-notch Metric to Image-guided Head-related Transfer Function Selection for Improved Vertical Localization
F. AvanziniUltimo
2019
Abstract
This paper proposes an image-guided HRTF selection procedure that exploits the relation between features of the pinna shape and HRTF notches. Using a 2D image of a user's pinna, the procedure selects from a database the HRTF set that best fits the anthropometry of that user. The proposed procedure is designed to be quickly applied and easy to use for a user without previous knowledge on binaural audio technologies. The entire process is evaluated by means of (i) an auditory model for sound localization in the mid-sagittal plane available from previous literature, and (ii) a short localization test in virtual reality. Using both virtual and real subjects from a HRTF database, predictions and the experimental evaluation aimed to assess the vertical localization performance with HRTF sets selected by the proposed procedure. Our results report a statistically significant improvement in predictions of the auditory model for localization performance with selected HRTFs compared to KEMAR HRTFs, which is a commercial standard in many binaural audio solutions. Moreover, the proposed localization test with human listeners reflect the model's predictions, further supporting the applicability of our perceptually-motivated metrics with anthropometric data extracted by pinna images.File | Dimensione | Formato | |
---|---|---|---|
geronazzo_jaes19_prefinal.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
1.72 MB
Formato
Adobe PDF
|
1.72 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.