Text-driven avatars based on artificial neural
networks and fuzzy logic

Malcangi, M.N.

We discuss a new approach for driving avatars using synthetic speech generated from pure text. Lip and face muscles are controlled by the information embedded in the utterance and its related expressiveness. Rule-based, text-to-speech synthesis is used to generate phonetic and expression transcriptions of the text to be uttered by the avatar. Two artificial neural networks, one for text-to-phone transcription and the other for phone-to-viseme mapping have been trained from phonetic transcription data. Two fuzzy-logic engines were tuned for smoothed control of lip and face movement. Simulations have been run to test neural-fuzzy controls using a parametric speech synthesizer to generate voices and a face synthesizer to generate facial movement. Experimental results show that soft computing affords a good solution for the smoothed control of avatars during the expressive utterance of text.

Text-driven avatars based on artificial neural networks and fuzzy logic / M.N. Malcangi. - In: INTERNATIONAL JOURNAL OF COMPUTERS. - ISSN 1998-4308. - 4:2(2010), pp. 61-69.

Text-driven avatars based on artificial neural networks and fuzzy logic

M.N. Malcangi^Primo

2010

Abstract

We discuss a new approach for driving avatars using synthetic speech generated from pure text. Lip and face muscles are controlled by the information embedded in the utterance and its related expressiveness. Rule-based, text-to-speech synthesis is used to generate phonetic and expression transcriptions of the text to be uttered by the avatar. Two artificial neural networks, one for text-to-phone transcription and the other for phone-to-viseme mapping have been trained from phonetic transcription data. Two fuzzy-logic engines were tuned for smoothed control of lip and face movement. Simulations have been run to test neural-fuzzy controls using a parametric speech synthesizer to generate voices and a face synthesizer to generate facial movement. Experimental results show that soft computing affords a good solution for the smoothed control of avatars during the expressive utterance of text.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				speech-driven avatar ; phone-to-viseme conversion ; text-to-speech synthesis ; artificial neural network ; fuzzy logic
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore INF/01 - Informatica
			
	Data di pubblicazione
	
				2010
			
	Rivista in ANCE
	
				INTERNATIONAL JOURNAL OF COMPUTERS
			
	URL
	
				http://www.naun.org/journals/computers/19-269.pdf
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/143902

Citazioni

ND

ND

ND

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

Text-driven avatars based on artificial neural networks and fuzzy logic

M.N. Malcangi^Primo

Primo

2010

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Pubblicazioni consigliate

Citazioni

social impact

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

Text-driven avatars based on artificial neural networks and fuzzy logic

M.N. MalcangiPrimo

Primo

2010

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Citazioni

social impact

Conferma cancellazione

M.N. Malcangi^Primo

Scheda breve

Scheda completa

Scheda completa (DC)