Combining Public Human Activity Recognition Datasets to Mitigate Labeled Data Scarcity

Presotto, R.; Sannara, E.; Civitarese, G.; Portet, F.; Lalanda, P.; Bettini, C.

doi:10.1109/SMARTCOMP58114.2023.00022

The use of supervised learning for Human Activity Recognition (HAR) on mobile devices leads to strong classification performances. Such an approach, however, requires large amounts of labeled data, both for the initial training of the models and for their customization on specific clients (whose data often differ greatly from the training data). This is actually impractical to obtain due to the costs, intrusiveness, and timeconsuming nature of data annotation. Moreover, even with the help of a significant amount of labeled data, model deployment on heterogeneous clients faces difficulties in generalizing well on unseen data. Other domains, like Computer Vision or Natural Language Processing, have proposed the notion of pre-trained models, leveraging large corpora, to reduce the need for annotated data and better manage heterogeneity. This promising approach has not been implemented in the HAR domain so far because of the lack of public datasets of sufficient size. In this paper, we propose a novel strategy to combine publicly available datasets with the goal of learning a generalized HAR model that can be fine-tuned using a limited amount of labeled data on an unseen target domain. Our experimental evaluation, which includes experimenting with different state-of-the-art neural network architectures, shows that combining public datasets can significantly reduce the number of labeled samples required to achieve satisfactory performance on an unseen target domain.

Combining Public Human Activity Recognition Datasets to Mitigate Labeled Data Scarcity / R. Presotto, S. Ek, G. Civitarese, F. Portet, P. Lalanda, C. Bettini - In: 2023 IEEE International Conference on Smart Computing (SMARTCOMP)[s.l] : IEEE, 2023. - ISBN 979-8-3503-2281-1. - pp. 33-40 (( convegno Smartcomp tenutosi a Nashville nel 2023 [10.1109/SMARTCOMP58114.2023.00022].

Combining Public Human Activity Recognition Datasets to Mitigate Labeled Data Scarcity

R. Presotto^Primo;Ek, Sannara;G. Civitarese;Portet, François;Lalanda, Philippe;C. Bettini^Ultimo

2023

Abstract

The use of supervised learning for Human Activity Recognition (HAR) on mobile devices leads to strong classification performances. Such an approach, however, requires large amounts of labeled data, both for the initial training of the models and for their customization on specific clients (whose data often differ greatly from the training data). This is actually impractical to obtain due to the costs, intrusiveness, and timeconsuming nature of data annotation. Moreover, even with the help of a significant amount of labeled data, model deployment on heterogeneous clients faces difficulties in generalizing well on unseen data. Other domains, like Computer Vision or Natural Language Processing, have proposed the notion of pre-trained models, leveraging large corpora, to reduce the need for annotated data and better manage heterogeneity. This promising approach has not been implemented in the HAR domain so far because of the lack of public datasets of sufficient size. In this paper, we propose a novel strategy to combine publicly available datasets with the goal of learning a generalized HAR model that can be fine-tuned using a limited amount of labeled data on an unseen target domain. Our experimental evaluation, which includes experimenting with different state-of-the-art neural network architectures, shows that combining public datasets can significantly reduce the number of labeled samples required to achieve satisfactory performance on an unseen target domain.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
			human activity recognition; mobile/wearable computing; transfer learning
		
	Settori scientifico-disciplinari del contributo
	
			Settore INF/01 - Informatica
		
	Data di pubblicazione
	
			2023
		
	DOI
	
			https://dx.doi.org/10.1109/SMARTCOMP58114.2023.00022
		
	Tipologia
	
			Book Part (author)
		
	Appare nelle tipologie:
	
			03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
Combining_Public_Human_Activity_Recognition_Datasets_to_Mitigate_Labeled_Data_Scarcity (1).pdf accesso riservato Tipologia: Pre-print (manoscritto inviato all'editore) Dimensione 1.54 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.54 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
Combining_Public_Human_Activity_Recognition_Datasets_to_Mitigate_Labeled_Data_Scarcity.pdf accesso aperto Tipologia: Publisher's version/PDF Dimensione 1.21 MB Formato Adobe PDF Visualizza/Apri	1.21 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/991948

Citazioni

ND

0

0

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca