A lumped physical model of the glottal source is presented. Vocal folds are described as single masses, but vertical phase differences between upper and lower margins of the folds are taken into account by appropriately describing the non-linear interaction of the mechanical model with aerodynamics. This results in a modified one-mass model, or a "one-delayed-mass model". Analysis on numerical simulations shows that the system behaves qualitatively as higher-dimensional models (such as the two-mass model by Ishizaka and Flanagan); in particular, control over flow skewness is guaranteed, allowing for synthesis of realistic glottal flow waveforms. As only one degree of freedom (one mass) is needed in the model, structure and number of parameters are drastically reduced, thus making it suitable for real-Time synthesis applications.

One-delayed-mass model for efficient synthesis of glottal flow / F. Avanzini, P. Alku, M. Karjalainen - In: EUROSPEECH 2001 - Scandinavia[s.l] : International Speech Communication Association, 2001 Sep. - ISBN 8790834100. - pp. 51-54 (( Intervento presentato al 7. convegno European Conference on Speech Communication and Technology tenutosi a Aalborg nel 2001.

One-delayed-mass model for efficient synthesis of glottal flow

F. Avanzini;
2001

Abstract

A lumped physical model of the glottal source is presented. Vocal folds are described as single masses, but vertical phase differences between upper and lower margins of the folds are taken into account by appropriately describing the non-linear interaction of the mechanical model with aerodynamics. This results in a modified one-mass model, or a "one-delayed-mass model". Analysis on numerical simulations shows that the system behaves qualitatively as higher-dimensional models (such as the two-mass model by Ishizaka and Flanagan); in particular, control over flow skewness is guaranteed, allowing for synthesis of realistic glottal flow waveforms. As only one degree of freedom (one mass) is needed in the model, structure and number of parameters are drastically reduced, thus making it suitable for real-Time synthesis applications.
Communication; Linguistics and Language; Computer Science Applications1707 Computer Vision and Pattern Recognition; Software
Settore INF/01 - Informatica
Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
set-2001
Aalborg City Council
Aalborg University
Det Obelske Familiefond
ELRA/ELDA
ELSNET
et al.
http://www.isca-speech.org/archive/eurospeech_2001/index.html
Book Part (author)
File in questo prodotto:
File Dimensione Formato  
avanzini_eurosp01_scanned.pdf

accesso riservato

Descrizione: Articolo principale
Tipologia: Publisher's version/PDF
Dimensione 2.11 MB
Formato Adobe PDF
2.11 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/642014
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 15
  • ???jsp.display-item.citation.isi??? ND
social impact