Softcomputing (fuzzy logic and artificial neural networks) have been widely applied in several fields, above all control and pattern matching. With the fast development and the huge availability of extremely pervasive communication technologies such as Internet, new challenges are prompted. Due to large availability of multimedia data (audio, video, images, etc.), searching information is becoming an increasingly complex task because most of information is not available in text format. Audio information is widely spread in multimedia information and it is strictly related to video and image information. Audio classification is the first step in the developing of complete process that leads to upgrading current text-based search engine with signal-based information such as audio (sounds, music, and speech). Fuzzy logic and artificial neural networks fit optimally the classification problem of the audio information, due to the fuzzy and the neural nature of recognition of specific audio pattern in complex audio contexts (broadcast news, video, TV programs, advertising, etc.). Non linear nature of audio perception, audio pattern recognition, and audio information extraction from a mix of unknown sources (unmixing) have a perfect matching with fuzzy logic inference and with neural classification. Both fuzzy logic and neural networks will be discussed in three main audio processing areas, word spotting, speaker recognition, and music pattern recognition Audio features extraction algorithms are firstly explained, then the modelling of a fuzzy inference engine from feature distribution and the training of an artificial neural network for pattern classification are discussed.
Softcomputing methodologies applied to audio-based information retrieval / M. Malcangi - In: Advances in Mathematical and Computational Methods / [a cura di] L. Rogozea. - Stevens Poit, USA : WSEAS Press, 2010. - ISBN 9789604742431. (( Intervento presentato al 12. convegno WSEAS International Conference on Mathematical and Computational Methods in Science and Engineering (MACMESE'10) tenutosi a Faro - Portugal nel 2010.
Softcomputing methodologies applied to audio-based information retrieval
M. MalcangiPrimo
2010
Abstract
Softcomputing (fuzzy logic and artificial neural networks) have been widely applied in several fields, above all control and pattern matching. With the fast development and the huge availability of extremely pervasive communication technologies such as Internet, new challenges are prompted. Due to large availability of multimedia data (audio, video, images, etc.), searching information is becoming an increasingly complex task because most of information is not available in text format. Audio information is widely spread in multimedia information and it is strictly related to video and image information. Audio classification is the first step in the developing of complete process that leads to upgrading current text-based search engine with signal-based information such as audio (sounds, music, and speech). Fuzzy logic and artificial neural networks fit optimally the classification problem of the audio information, due to the fuzzy and the neural nature of recognition of specific audio pattern in complex audio contexts (broadcast news, video, TV programs, advertising, etc.). Non linear nature of audio perception, audio pattern recognition, and audio information extraction from a mix of unknown sources (unmixing) have a perfect matching with fuzzy logic inference and with neural classification. Both fuzzy logic and neural networks will be discussed in three main audio processing areas, word spotting, speaker recognition, and music pattern recognition Audio features extraction algorithms are firstly explained, then the modelling of a fuzzy inference engine from feature distribution and the training of an artificial neural network for pattern classification are discussed.Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.