A time-frequency representation of sound is commonly obtained through the Short-Time Fourier Transform. Identifying and extracting the prominent frequency components of the spectrogram is important for sinusoidal modeling and sound processing. Borrowing a known image processing technique, known as seam carving, we propose an algorithm to track and extract the sinusoidal components from the sound spectrogram. Experiments show how this technique is well suited for sound whose prominent frequency components vary both in amplitude and in frequency. Moreover, seam carving naturally produces some auditory continuity effects. We compare this algorithm with two other sine extraction techniques, based on peak detection on spectrogram frames. The seam carving skips this step and turns out to be applicable to a variety of sounds,although being more computationally expensive.

Streams as Seams: Carving trajectories out of the time-frequency matrix / G. Capizzi, D. Rocchesso, S. Baldan - In: Proceedings of the 17th Sound and Music Computing Conference / [a cura di] S. Spagnol, A. Valle. - [s.l] : Università degli Studi di Torino, 2020. - ISBN 978-88-945415-0-2. - pp. 442-449 (( Intervento presentato al 17. convegno Sound and Music Computing Conference tenutosi a Torino nel 2020.

Streams as Seams: Carving trajectories out of the time-frequency matrix

D. Rocchesso
;
2020

Abstract

A time-frequency representation of sound is commonly obtained through the Short-Time Fourier Transform. Identifying and extracting the prominent frequency components of the spectrogram is important for sinusoidal modeling and sound processing. Borrowing a known image processing technique, known as seam carving, we propose an algorithm to track and extract the sinusoidal components from the sound spectrogram. Experiments show how this technique is well suited for sound whose prominent frequency components vary both in amplitude and in frequency. Moreover, seam carving naturally produces some auditory continuity effects. We compare this algorithm with two other sine extraction techniques, based on peak detection on spectrogram frames. The seam carving skips this step and turns out to be applicable to a variety of sounds,although being more computationally expensive.
Time-frequency analysis; Sinusoidal components; Seam carving
Settore INF/01 - Informatica
2020
https://zenodo.org/records/3903573
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
SMCCIM_2020_paper_60.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 6.25 MB
Formato Adobe PDF
6.25 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1034428
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact