Audio-to-score alignment (A2SA) is a multimodal task consisting in the alignment of audio signals to music scores. Recent literature confirms the benefits of Automatic Music Transcription (AMT) for A2SA at the frame-level. In this work, we aim to elaborate on the exploitation of AMT Deep Learning (DL) models for achieving alignment at the note-level. We propose a method which benefits from HMM-based score-to-score alignment and AMT, showing a remarkable advancement beyond the state-of-the-art. We design a systematic procedure to take advantage of large datasets which do not offer an aligned score. Finally, we perform a thorough comparison and extensive tests on multiple datasets.
Audio-to-Score Alignment Using Deep Automatic Music Transcription / F. Simonetta, S. Ntalampiras, F. Avanzini - In: 2021 IEEE 23rd International Workshop on Multimedia Signal Processing (MMSP)[s.l] : IEEE, 2021. - ISBN 978-1-6654-3288-7. - pp. 1-6 (( Intervento presentato al 23. convegno IEEE International Workshop on Multimedia Signal Processing (IEEE MMSP) tenutosi a Tampere nel 2021 [10.1109/MMSP53017.2021.9733531].
Audio-to-Score Alignment Using Deep Automatic Music Transcription
F. SimonettaPrimo
;S. NtalampirasSecondo
;F. AvanziniUltimo
2021
Abstract
Audio-to-score alignment (A2SA) is a multimodal task consisting in the alignment of audio signals to music scores. Recent literature confirms the benefits of Automatic Music Transcription (AMT) for A2SA at the frame-level. In this work, we aim to elaborate on the exploitation of AMT Deep Learning (DL) models for achieving alignment at the note-level. We propose a method which benefits from HMM-based score-to-score alignment and AMT, showing a remarkable advancement beyond the state-of-the-art. We design a systematic procedure to take advantage of large datasets which do not offer an aligned score. Finally, we perform a thorough comparison and extensive tests on multiple datasets.File | Dimensione | Formato | |
---|---|---|---|
2107.12854.pdf
accesso aperto
Descrizione: Accepted version with erratum
Tipologia:
Pre-print (manoscritto inviato all'editore)
Dimensione
2.29 MB
Formato
Adobe PDF
|
2.29 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.