Computational bioacoustics is a relatively young research area, yet it has increasingly received attention over the last decade because it can be used in a wide range of applications in a cost-effective manner. This work focuses on the problem of detecting the novel bird calls and songs associated with various species and individual birds. To this end, variational autoencoders, consisting of deep encoding-decoding networks, are employed. The encoder encompasses a series of convolutional layers leading to a smooth high-level abstraction of log-Mel spectrograms that characterise bird vocalisations. The decoder operates on this latent representation to generate each respective original observation. Novel species/individual detection is carried out by monitoring and thresholding the expected reconstruction probability. We thoroughly evaluate the proposed method on two different data sets, including the vocalisations of 11 North American bird species and 16 Athene noctua individuals.

Acoustic detection of unknown bird species and individuals / S. Ntalampiras, I. Potamitis. - In: CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY. - ISSN 2468-2322. - (2021). [Epub ahead of print] [10.1049/cit2.12007]

Acoustic detection of unknown bird species and individuals

S. Ntalampiras
Primo
;
2021

Abstract

Computational bioacoustics is a relatively young research area, yet it has increasingly received attention over the last decade because it can be used in a wide range of applications in a cost-effective manner. This work focuses on the problem of detecting the novel bird calls and songs associated with various species and individual birds. To this end, variational autoencoders, consisting of deep encoding-decoding networks, are employed. The encoder encompasses a series of convolutional layers leading to a smooth high-level abstraction of log-Mel spectrograms that characterise bird vocalisations. The decoder operates on this latent representation to generate each respective original observation. Novel species/individual detection is carried out by monitoring and thresholding the expected reconstruction probability. We thoroughly evaluate the proposed method on two different data sets, including the vocalisations of 11 North American bird species and 16 Athene noctua individuals.
Settore INF/01 - Informatica
mar-2021
Article (author)
File in questo prodotto:
File Dimensione Formato  
41 cit2.12007.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 1.95 MB
Formato Adobe PDF
1.95 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

Caricamento pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/2434/824369
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 8
  • ???jsp.display-item.citation.isi??? 5
social impact