Automatic creation of a Vowel Dataset for performing Prosody Analysis in ASD screening

Francese, R.; Frasca, M.; Risi, M.

doi:10.1109/iv53921.2021.00015

Autism Spectrum Disorder (ASD) is a term used to describe a constellation of early-onset social communication deficits and repetitive sensorimotor behaviours associated with a strong genetic component as well as other causes. This paper aims at creating a tool for automatically isolating segments of the speech useful for extract prosody features for identifying children with ASD. In particular, in this first phase of the research, we are interested in the creation of a large dataset of 'a' vowels of ASD and not ASD people. The 'a' vowel contains relevant information on the voice quality and emotional states. The proposed methodology is divided into 2 phases. In the former the input audio is analyzed to determine the vowel onset and offset points, useful to extract the vowel regions. Then a spectrogram graphically visualizing the identified vowels is provided as input to the second phase, where a convolutional neural network classifies whether the histogram represents the vowel 'a'. The convolutional network reaches an average accuracy of 95.00% (standard deviation ± 2.60%) on a dataset of 640 samples with Stratified 5-Fold Cross-Validation.

Automatic creation of a Vowel Dataset for performing Prosody Analysis in ASD screening / R. Francese, M. Frasca, M. Risi (IEEE SYMPOSIUM ON INFORMATION VISUALIZATION). - In: IV International Conference Information Visualisation / [a cura di] Banissi E., Ursyn A., McK. Bannatyne M.W., Pires J.M., Datia N., Huang M.L., Huang W., Nguyen Q.V., Nazemi K., Kovalerchuk B., Counsell J., Agapiou A., Khosrow-Shahi F., Chau H.-W., Li M., Laing R., Bouali F., Venturini G., Temperini M., Sarfraz M.. - [s.l] : Institute of Electrical and Electronics Engineers (IEEE), 2021 Jul. - ISBN 9781665438278. - pp. 29-34 (( Intervento presentato al 25. convegno International Conference Information Visualisation : 5 through 9 July tenutosi a Sydney nel 2021 [10.1109/iv53921.2021.00015].

Automatic creation of a Vowel Dataset for performing Prosody Analysis in ASD screening

Francese, Rita;M. Frasca^Penultimo;Risi, Michele

2021

Abstract

Autism Spectrum Disorder (ASD) is a term used to describe a constellation of early-onset social communication deficits and repetitive sensorimotor behaviours associated with a strong genetic component as well as other causes. This paper aims at creating a tool for automatically isolating segments of the speech useful for extract prosody features for identifying children with ASD. In particular, in this first phase of the research, we are interested in the creation of a large dataset of 'a' vowels of ASD and not ASD people. The 'a' vowel contains relevant information on the voice quality and emotional states. The proposed methodology is divided into 2 phases. In the former the input audio is analyzed to determine the vowel onset and offset points, useful to extract the vowel regions. Then a spectrogram graphically visualizing the identified vowels is provided as input to the second phase, where a convolutional neural network classifies whether the histogram represents the vowel 'a'. The convolutional network reaches an average accuracy of 95.00% (standard deviation ± 2.60%) on a dataset of 640 samples with Stratified 5-Fold Cross-Validation.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Audio signals; autism spectrum disorder; convolutional neural network; deep neural network
			
	Settori scientifico-disciplinari del contributo (validi dal 09/05/2024)
	
				Settore INFO-01/A - Informatica
			
	Data di pubblicazione
	
				lug-2021
			
	Enti collegati al convegno
	
				Institute of Electrical and Electronics Engineers (IEEE)
			
	DOI
	
				https://dx.doi.org/10.1109/iv53921.2021.00015
			
	URL
	
				https://ieeexplore.ieee.org/abstract/document/9582722
			
	Tipologia
	
				Book Part (author)
			
	Appare nelle tipologie:
	
				03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
Automatic_creation_of_a_Vowel_Dataset_for_performing_Prosody_Analysis_in_ASD_screening.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 379.66 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	379.66 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1148788

Citazioni

ND

1

1

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca