Softcomputing (fuzzy logic and artificial neural networks) have been widely applied in several fields, above all control and pattern matching. With the fast development and the huge availability of extremely pervasive communication technologies such as Internet, new challenges are prompted. Due to large availability of multimedia data (audio, video, images, etc.), searching information is becoming an increasingly complex task because most of information is not available in text format. Audio information is widely spread in multimedia information and it is strictly related to video and image information. Audio classification is the first step in the developing of complete process that leads to upgrading current text-based search engine with signal-based information such as audio (sounds, music, and speech). Fuzzy logic and artificial neural networks fit optimally the classification problem of the audio information, due to the fuzzy and the neural nature of recognition of specific audio pattern in complex audio contexts (broadcast news, video, TV programs, advertising, etc.). Non linear nature of audio perception, audio pattern recognition, and audio information extraction from a mix of unknown sources (unmixing) have a perfect matching with fuzzy logic inference and with neural classification. Both fuzzy logic and neural networks will be discussed in three main audio processing areas, word spotting, speaker recognition, and music pattern recognition Audio features extraction algorithms are firstly explained, then the modelling of a fuzzy inference engine from feature distribution and the training of an artificial neural network for pattern classification are discussed.

Softcomputing methodologies applied to audio-based information retrieval / M. Malcangi - In: Advances in Mathematical and Computational Methods / [a cura di] L. Rogozea. - Stevens Poit, USA : WSEAS Press, 2010. - ISBN 9789604742431. (( Intervento presentato al 12. convegno WSEAS International Conference on Mathematical and Computational Methods in Science and Engineering (MACMESE'10) tenutosi a Faro - Portugal nel 2010.

Softcomputing methodologies applied to audio-based information retrieval

M. Malcangi
Primo
2010

Abstract

Softcomputing (fuzzy logic and artificial neural networks) have been widely applied in several fields, above all control and pattern matching. With the fast development and the huge availability of extremely pervasive communication technologies such as Internet, new challenges are prompted. Due to large availability of multimedia data (audio, video, images, etc.), searching information is becoming an increasingly complex task because most of information is not available in text format. Audio information is widely spread in multimedia information and it is strictly related to video and image information. Audio classification is the first step in the developing of complete process that leads to upgrading current text-based search engine with signal-based information such as audio (sounds, music, and speech). Fuzzy logic and artificial neural networks fit optimally the classification problem of the audio information, due to the fuzzy and the neural nature of recognition of specific audio pattern in complex audio contexts (broadcast news, video, TV programs, advertising, etc.). Non linear nature of audio perception, audio pattern recognition, and audio information extraction from a mix of unknown sources (unmixing) have a perfect matching with fuzzy logic inference and with neural classification. Both fuzzy logic and neural networks will be discussed in three main audio processing areas, word spotting, speaker recognition, and music pattern recognition Audio features extraction algorithms are firstly explained, then the modelling of a fuzzy inference engine from feature distribution and the training of an artificial neural network for pattern classification are discussed.
Information Retrieval ; Audio features ; Artificial Neural Networks; Fuzzy logic
Settore INF/01 - Informatica
2010
Universitade do Algarve
Centro de Investigacao Sobre
Book Part (author)
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/147867
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact