Machine learning (ML) has become pervasive in various research fields, including binaural synthesis personalization, which is crucial for sound in immersive virtual environments. Researchers have mainly addressed this topic by estimating the individual head-related transfer function (HRTF). HRTFs are utilized to render audio signals at specific spatial positions, thereby simulating real-world sound wave interactions with the human body. As such, an HRTF that is compliant with individual characteristics enhances the realism of the binaural simulation. This survey systematically examines the ML-based HRTF individualization works proposed in the literature. The analyzed works are organized according to the processing steps involved in the ML workflow, including the employed dataset, input and output types, data preprocessing operations, ML models, and model evaluation. In addition to categorizing the existing literature works, this survey discusses their achievements, identifies their limitations, and outlines aspects requiring further investigation at the crossroads of research communities in acoustics, audio signal processing, and machine learning.

A Survey on Machine Learning Techniques for Head-Related Transfer Function Individualization / D. Fantini, M. Geronazzo, F. Avanzini, S. Ntalampiras. - In: IEEE OPEN JOURNAL OF SIGNAL PROCESSING. - ISSN 2644-1322. - 6:(2025), pp. 30-56. [10.1109/ojsp.2025.3528330]

A Survey on Machine Learning Techniques for Head-Related Transfer Function Individualization

D. Fantini
Primo
;
F. Avanzini
Penultimo
;
S. Ntalampiras
Ultimo
2025

Abstract

Machine learning (ML) has become pervasive in various research fields, including binaural synthesis personalization, which is crucial for sound in immersive virtual environments. Researchers have mainly addressed this topic by estimating the individual head-related transfer function (HRTF). HRTFs are utilized to render audio signals at specific spatial positions, thereby simulating real-world sound wave interactions with the human body. As such, an HRTF that is compliant with individual characteristics enhances the realism of the binaural simulation. This survey systematically examines the ML-based HRTF individualization works proposed in the literature. The analyzed works are organized according to the processing steps involved in the ML workflow, including the employed dataset, input and output types, data preprocessing operations, ML models, and model evaluation. In addition to categorizing the existing literature works, this survey discusses their achievements, identifies their limitations, and outlines aspects requiring further investigation at the crossroads of research communities in acoustics, audio signal processing, and machine learning.
HRTF individualization; machine learning; spatial audio; binaural synthesis
Settore INFO-01/A - Informatica
   Transforming auditory-based social interaction and communication in AR/VR (SONICOM)
   SONICOM
   EUROPEAN COMMISSION
   H2020
   101017743
2025
10-gen-2025
https://ieeexplore.ieee.org/document/10836943/media#media
Article (author)
File in questo prodotto:
File Dimensione Formato  
A_Survey_on_Machine_Learning_Techniques_for_Head-Related_Transfer_Function_Individualization.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 2.64 MB
Formato Adobe PDF
2.64 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1136255
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
social impact