This paper proposes an image-guided HRTF selection procedure that exploits the relation between features of the pinna shape and HRTF notches. Using a 2D image of a subject's pinna, the procedure selects from a database the HRTF set that best fits the anthropometry of that subject. The proposed procedure is designed to be quickly applied and easy to use for a user without previous knowledge on binaural audio technologies. The entire process is evaluated by means of an auditory model for sound localization in the mid-sagittal plane available from previous literature. Using virtual subjects from a HRTF database, a virtual experiment is implemented to assess the vertical localization performance of the database subjects when they are provided with HRTF sets selected by the proposed procedure. Results report a statistically significant improvement in predictions of localization performance for selected HRTFs compared to KEMAR HRTF which is a commercial standard in many binaural audio solutions; moreover, the proposed analysis provides useful indications to refine the perceptually-motivated metrics that guides the selection.

Improving elevation perception with a tool for image-guided head-related transfer function selection / M. Geronazzo, E. Peruch, F. Prandoni, F. Avanzini (Proceedings of the International Conference on Digital Audio Effects). - In: DAFx 2017 : Proceedings / [a cura di] A. Torin, B. Hamilton, S. Bilbao, M. Newton. - Edinburgh : University of Edinburgh, 2017. - pp. 397-404 (( Intervento presentato al 20. convegno International Conference on Digital Audio Effects tenutosi a Edinburgh nel 2017.

Improving elevation perception with a tool for image-guided head-related transfer function selection

F. Avanzini
2017

Abstract

This paper proposes an image-guided HRTF selection procedure that exploits the relation between features of the pinna shape and HRTF notches. Using a 2D image of a subject's pinna, the procedure selects from a database the HRTF set that best fits the anthropometry of that subject. The proposed procedure is designed to be quickly applied and easy to use for a user without previous knowledge on binaural audio technologies. The entire process is evaluated by means of an auditory model for sound localization in the mid-sagittal plane available from previous literature. Using virtual subjects from a HRTF database, a virtual experiment is implemented to assess the vertical localization performance of the database subjects when they are provided with HRTF sets selected by the proposed procedure. Results report a statistically significant improvement in predictions of localization performance for selected HRTFs compared to KEMAR HRTF which is a commercial standard in many binaural audio solutions; moreover, the proposed analysis provides useful indications to refine the perceptually-motivated metrics that guides the selection.
Computer Science Applications; Computer Vision and Pattern Recognition; Signal Processing
Settore INF/01 - Informatica
2017
Arturia
Cedar
et al.
Eventide
Soundtoys
Steinberg
http://www.dafx17.eca.ed.ac.uk/papers/DAFx17_Proceedings.pdf
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
geronazzo_dafx17.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 426.39 kB
Formato Adobe PDF
426.39 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/554917
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 11
  • ???jsp.display-item.citation.isi??? ND
social impact