This thesis delves into topics related to modal sound synthesis, a technique that generates sounds by simulating the physical interactions within resonating objects. It proposes novel methods for analyzing audio recordings and extracting the modal parameters. A central algorithm, SAMPLE, estimates these modal parameters by finding trajectories in the spectrogram of the input audio. However, SAMPLE encounters challenges with specific sounds, like acoustic beats, where two close frequencies interact and create a beating effect. To overcome this limitation, the thesis introduces BeatsDROP, an auxiliary algorithm that complements SAMPLE and models the trajectories in the spectrogram as the amplitude and frequency modulations of beats. This thesis also details the reasons why most signal analysis models fail with beats. Furthermore, the thesis presents the Generalized Mixture Space (GMS) model, which aids in representing sounds with multiple channels. GMS allows SAMPLE's simplified analysis to be applied while retaining the original channel distribution information for later resynthesis. Beyond the theoretical framework, the thesis details the development of software tools to make these methods readily usable. SAMPLE is a Python package that implements the algorithms and models, along with additional functionalities previously defined in the audio DSP literature. The thesis also includes the re-engineering and expansion of the existing SDT (Sound Design Toolkit), written in C and available as an external library for Pure Data and Max. Functionalities are implemented within SDT to enable interoperability with other software, including the possibility to import modal analysis results from SAMPLE directly into SDT's modal synthesis models.

SIGNAL MODELS, ANALYSIS ALGORITHMS, AND SOFTWARE TOOLS FOR MODAL AUDIO RESYNTHESIS / M. Tiraboschi ; supervisor: F. Avanzini ; coordinator: R. Sassi. Dipartimento di Informatica Giovanni Degli Antoni, 2024 Jul 12. 36. ciclo, Anno Accademico 2022/2023.

SIGNAL MODELS, ANALYSIS ALGORITHMS, AND SOFTWARE TOOLS FOR MODAL AUDIO RESYNTHESIS

M. Tiraboschi
2024

Abstract

This thesis delves into topics related to modal sound synthesis, a technique that generates sounds by simulating the physical interactions within resonating objects. It proposes novel methods for analyzing audio recordings and extracting the modal parameters. A central algorithm, SAMPLE, estimates these modal parameters by finding trajectories in the spectrogram of the input audio. However, SAMPLE encounters challenges with specific sounds, like acoustic beats, where two close frequencies interact and create a beating effect. To overcome this limitation, the thesis introduces BeatsDROP, an auxiliary algorithm that complements SAMPLE and models the trajectories in the spectrogram as the amplitude and frequency modulations of beats. This thesis also details the reasons why most signal analysis models fail with beats. Furthermore, the thesis presents the Generalized Mixture Space (GMS) model, which aids in representing sounds with multiple channels. GMS allows SAMPLE's simplified analysis to be applied while retaining the original channel distribution information for later resynthesis. Beyond the theoretical framework, the thesis details the development of software tools to make these methods readily usable. SAMPLE is a Python package that implements the algorithms and models, along with additional functionalities previously defined in the audio DSP literature. The thesis also includes the re-engineering and expansion of the existing SDT (Sound Design Toolkit), written in C and available as an external library for Pure Data and Max. Functionalities are implemented within SDT to enable interoperability with other software, including the possibility to import modal analysis results from SAMPLE directly into SDT's modal synthesis models.
12-lug-2024
Settore INF/01 - Informatica
https://hdl.handle.net/2434/945288
https://doi.org/10.5281/zenodo.3898795
https://doi.org/10.1109/IEEECONF59510.2023.10335232
AVANZINI, FEDERICO
AVANZINI, FEDERICO
SASSI, ROBERTO
Doctoral Thesis
SIGNAL MODELS, ANALYSIS ALGORITHMS, AND SOFTWARE TOOLS FOR MODAL AUDIO RESYNTHESIS / M. Tiraboschi ; supervisor: F. Avanzini ; coordinator: R. Sassi. Dipartimento di Informatica Giovanni Degli Antoni, 2024 Jul 12. 36. ciclo, Anno Accademico 2022/2023.
File in questo prodotto:
File Dimensione Formato  
phd_unimi_R12964.pdf

accesso aperto

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione 4.53 MB
Formato Adobe PDF
4.53 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1084008
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact