Autism Spectrum Disorder (ASD) is a term used to describe a constellation of early-onset social communication deficits and repetitive sensorimotor behaviours associated with a strong genetic component as well as other causes. This paper aims at creating a tool for automatically isolating segments of the speech useful for extract prosody features for identifying children with ASD. In particular, in this first phase of the research, we are interested in the creation of a large dataset of 'a' vowels of ASD and not ASD people. The 'a' vowel contains relevant information on the voice quality and emotional states. The proposed methodology is divided into 2 phases. In the former the input audio is analyzed to determine the vowel onset and offset points, useful to extract the vowel regions. Then a spectrogram graphically visualizing the identified vowels is provided as input to the second phase, where a convolutional neural network classifies whether the histogram represents the vowel 'a'. The convolutional network reaches an average accuracy of 95.00% (standard deviation ± 2.60%) on a dataset of 640 samples with Stratified 5-Fold Cross-Validation.
Automatic creation of a Vowel Dataset for performing Prosody Analysis in ASD screening / R. Francese, M. Frasca, M. Risi (IEEE SYMPOSIUM ON INFORMATION VISUALIZATION). - In: IV International Conference Information Visualisation / [a cura di] Banissi E., Ursyn A., McK. Bannatyne M.W., Pires J.M., Datia N., Huang M.L., Huang W., Nguyen Q.V., Nazemi K., Kovalerchuk B., Counsell J., Agapiou A., Khosrow-Shahi F., Chau H.-W., Li M., Laing R., Bouali F., Venturini G., Temperini M., Sarfraz M.. - [s.l] : Institute of Electrical and Electronics Engineers (IEEE), 2021 Jul. - ISBN 9781665438278. - pp. 29-34 (( Intervento presentato al 25. convegno International Conference Information Visualisation : 5 through 9 July tenutosi a Sydney nel 2021 [10.1109/iv53921.2021.00015].
Automatic creation of a Vowel Dataset for performing Prosody Analysis in ASD screening
M. FrascaPenultimo
;
2021
Abstract
Autism Spectrum Disorder (ASD) is a term used to describe a constellation of early-onset social communication deficits and repetitive sensorimotor behaviours associated with a strong genetic component as well as other causes. This paper aims at creating a tool for automatically isolating segments of the speech useful for extract prosody features for identifying children with ASD. In particular, in this first phase of the research, we are interested in the creation of a large dataset of 'a' vowels of ASD and not ASD people. The 'a' vowel contains relevant information on the voice quality and emotional states. The proposed methodology is divided into 2 phases. In the former the input audio is analyzed to determine the vowel onset and offset points, useful to extract the vowel regions. Then a spectrogram graphically visualizing the identified vowels is provided as input to the second phase, where a convolutional neural network classifies whether the histogram represents the vowel 'a'. The convolutional network reaches an average accuracy of 95.00% (standard deviation ± 2.60%) on a dataset of 640 samples with Stratified 5-Fold Cross-Validation.| File | Dimensione | Formato | |
|---|---|---|---|
|
Automatic_creation_of_a_Vowel_Dataset_for_performing_Prosody_Analysis_in_ASD_screening.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
379.66 kB
Formato
Adobe PDF
|
379.66 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




