The applicability of soft computing to implementing text-to-speech conversion is subject to debate. Using neural networks for phoneme-level, text-to-speech conversion has several advantages over hard computing. Soft computing’s capacity to generalize makes it possible to map words missing from the database, as well as to reduce contradictions related to different pronunciations for the same word. Neural networks have been shown to optimally solve a large class of applied pattern-matching problems, but very little research has been done to match the requirements of pattern generation in machine-to-human interaction. An artificial speech synthesizer based on neural networks is being developed for application to deeply embedded systems for language-independent speech commands on hands-free interfaces. A feed-forward, backpropagation artificial neural network has been trained for this purpose using a custom-developed, regular expression-based, text-to-phones transcription engine to generate training patterns. Initial experimental results show the expected properties of language independence and in-system learning.
|Titolo:||A Language-Independent Neural Network-Based Speech Synthesizer|
MALCANGI, MARIO NATALINO (Primo)
|Parole Chiave:||Speech synthesis ; text-to-speech ; artificial neural network ; regular expressions|
|Settore Scientifico Disciplinare:||Settore INF/01 - Informatica|
|Data di pubblicazione:||2007|
|Tipologia:||Book Part (author)|
|Appare nelle tipologie:||03 - Contributo in volume|