IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

Machine learning (ML) has become pervasive in various research fields, including binaural synthesis personalization, which is crucial for sound in immersive virtual environments. Researchers have mainly addressed this topic by estimating the individual head-related transfer function (HRTF). HRTFs are utilized to render audio signals at specific spatial positions, thereby simulating real-world sound wave interactions with the human body. As such, an HRTF that is compliant with individual characteristics enhances the realism of the binaural simulation. This survey systematically examines the ML-based HRTF individualization works proposed in the literature. The analyzed works are organized according to the processing steps involved in the ML workflow, including the employed dataset, input and output types, data preprocessing operations, ML models, and model evaluation. In addition to categorizing the existing literature works, this survey discusses their achievements, identifies their limitations, and outlines aspects requiring further investigation at the crossroads of research communities in acoustics, audio signal processing, and machine learning.

A Survey on Machine Learning Techniques for Head-Related Transfer Function Individualization / D. Fantini, M. Geronazzo, F. Avanzini, S. Ntalampiras. - In: IEEE OPEN JOURNAL OF SIGNAL PROCESSING. - ISSN 2644-1322. - 6:(2025), pp. 30-56. [10.1109/ojsp.2025.3528330]

A Survey on Machine Learning Techniques for Head-Related Transfer Function Individualization

D. Fantini^Primo;Geronazzo, Michele;F. Avanzini^Penultimo;S. Ntalampiras^Ultimo

2025

Abstract

Machine learning (ML) has become pervasive in various research fields, including binaural synthesis personalization, which is crucial for sound in immersive virtual environments. Researchers have mainly addressed this topic by estimating the individual head-related transfer function (HRTF). HRTFs are utilized to render audio signals at specific spatial positions, thereby simulating real-world sound wave interactions with the human body. As such, an HRTF that is compliant with individual characteristics enhances the realism of the binaural simulation. This survey systematically examines the ML-based HRTF individualization works proposed in the literature. The analyzed works are organized according to the processing steps involved in the ML workflow, including the employed dataset, input and output types, data preprocessing operations, ML models, and model evaluation. In addition to categorizing the existing literature works, this survey discusses their achievements, identifies their limitations, and outlines aspects requiring further investigation at the crossroads of research communities in acoustics, audio signal processing, and machine learning.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				HRTF individualization; machine learning; spatial audio; binaural synthesis
			
	Settori scientifico-disciplinari dell'articolo (validi dal 09/05/2024)
	
				Settore INFO-01/A - Informatica
			
	Titolo del progetto
	
	Titolo Progetto
	
									Transforming auditory-based social interaction and communication in AR/VR (SONICOM)
								
	Acronimo
	
									SONICOM
								
	Nome finanziatore
	
										EUROPEAN COMMISSION
									
	Finanziamento
	
									H2020
								
	N. Contratto
	
									101017743
								
	Data di pubblicazione
	
				2025
			
	Data ahead of print o data di stampa
	
				10-gen-2025
			
	Rivista in ANCE
	
				IEEE OPEN JOURNAL OF SIGNAL PROCESSING
			
	DOI
	
				https://dx.doi.org/10.1109/ojsp.2025.3528330
			
	URL
	
				https://ieeexplore.ieee.org/document/10836943/media#media
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
A_Survey_on_Machine_Learning_Techniques_for_Head-Related_Transfer_Function_Individualization.pdf accesso aperto Tipologia: Publisher's version/PDF Dimensione 2.64 MB Formato Adobe PDF Visualizza/Apri	2.64 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1136255

Citazioni

ND

0

0

social impact