A time-frequency representation of sound is commonly obtained through the Short-Time Fourier Transform. Identifying and extracting the prominent frequency components of the spectrogram is important for sinusoidal modeling and sound processing. Borrowing a known image processing technique, known as seam carving, we propose an algorithm to track and extract the sinusoidal components from the sound spectrogram. Experiments show how this technique is well suited for sound whose prominent frequency components vary both in amplitude and in frequency. Moreover, seam carving naturally produces some auditory continuity effects. We compare this algorithm with two other sine extraction techniques, based on peak detection on spectrogram frames. The seam carving skips this step and turns out to be applicable to a variety of sounds,although being more computationally expensive.
Streams as Seams: Carving trajectories out of the time-frequency matrix / G. Capizzi, D. Rocchesso, S. Baldan - In: Proceedings of the 17th Sound and Music Computing Conference / [a cura di] S. Spagnol, A. Valle. - [s.l] : Università degli Studi di Torino, 2020. - ISBN 978-88-945415-0-2. - pp. 442-449 (( Intervento presentato al 17. convegno Sound and Music Computing Conference tenutosi a Torino nel 2020.
Streams as Seams: Carving trajectories out of the time-frequency matrix
D. Rocchesso
;
2020
Abstract
A time-frequency representation of sound is commonly obtained through the Short-Time Fourier Transform. Identifying and extracting the prominent frequency components of the spectrogram is important for sinusoidal modeling and sound processing. Borrowing a known image processing technique, known as seam carving, we propose an algorithm to track and extract the sinusoidal components from the sound spectrogram. Experiments show how this technique is well suited for sound whose prominent frequency components vary both in amplitude and in frequency. Moreover, seam carving naturally produces some auditory continuity effects. We compare this algorithm with two other sine extraction techniques, based on peak detection on spectrogram frames. The seam carving skips this step and turns out to be applicable to a variety of sounds,although being more computationally expensive.File | Dimensione | Formato | |
---|---|---|---|
SMCCIM_2020_paper_60.pdf
accesso aperto
Tipologia:
Publisher's version/PDF
Dimensione
6.25 MB
Formato
Adobe PDF
|
6.25 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.