Computational Auditory Scene Analysis (CASA) is typically achieved by statistical models trained offline on available data. Their performance relies heavily on the assumption that the process generating the data along with the recording conditions are stationary over time. Nowadays, the focus of CASA is moving from structured, well-defined scenarios to unrestricted scenes with realistic characteristics where the stationarity assumption might not be true. Therefore, there is a high demand for methodologies and tools dealing with a series of problems tightly coupled with such non-stationary conditions, such as changes in the recording conditions, reverberant effects, etc. This paper formulates these obstacles under the concept drift framework and explores the two fundamental adaptation approaches: active and passive. The overall aim is to learn online the statistical properties of the evolving data distribution and incorporate them into the recognition mechanism for boosting its performance. The proposed CASA system encompasses: (a) a concept drift detector and (b) an online adaptation module. The proposed framework was evaluated in the auditory analysis of three environments (office, meeting room, and lecture hall) with diverse characteristics (dimensions, reverberation times, etc.) The corpus is based on a combination of professional sound event collections. We report encouraging experimental results in terms of classification rate, false positive rate, false negative rate, and detection delay.

Generalized Sound Recognition in Reverberant Environments / S. Ntalampiras. - In: AES. - ISSN 1549-4950. - 67:10(2019 Oct), pp. 772-781.

Generalized Sound Recognition in Reverberant Environments

S. Ntalampiras
2019

Abstract

Computational Auditory Scene Analysis (CASA) is typically achieved by statistical models trained offline on available data. Their performance relies heavily on the assumption that the process generating the data along with the recording conditions are stationary over time. Nowadays, the focus of CASA is moving from structured, well-defined scenarios to unrestricted scenes with realistic characteristics where the stationarity assumption might not be true. Therefore, there is a high demand for methodologies and tools dealing with a series of problems tightly coupled with such non-stationary conditions, such as changes in the recording conditions, reverberant effects, etc. This paper formulates these obstacles under the concept drift framework and explores the two fundamental adaptation approaches: active and passive. The overall aim is to learn online the statistical properties of the evolving data distribution and incorporate them into the recognition mechanism for boosting its performance. The proposed CASA system encompasses: (a) a concept drift detector and (b) an online adaptation module. The proposed framework was evaluated in the auditory analysis of three environments (office, meeting room, and lecture hall) with diverse characteristics (dimensions, reverberation times, etc.) The corpus is based on a combination of professional sound event collections. We report encouraging experimental results in terms of classification rate, false positive rate, false negative rate, and detection delay.
Settore INF/01 - Informatica
ott-2019
AES
Article (author)
File in questo prodotto:
File Dimensione Formato  
JAES-D-19-00021HR.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 469.63 kB
Formato Adobe PDF
469.63 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/686225
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 13
  • ???jsp.display-item.citation.isi??? 11
social impact