Human Activity Recognition (HAR) based on the sensors of mobile/wearable devices aims to detect the physical activities performed by humans in their daily lives. Although supervised learning methods are the most effective in this task, their effectiveness is constrained to using a large amount of labeled data during training. While collecting raw unlabeled data can be relatively easy, annotating data is challenging due to costs, intrusiveness, and time constraints. To address these challenges, this paper explores alternative approaches for accurate HAR using a limited amount of labeled data. In particular, we have adapted recent Self-Supervised Learning (SSL) algorithms to the HAR domain and compared their effectiveness. We investigate three state-of-the-art SSL techniques of different families: contrastive, generative, and predictive. Additionally, we evaluate the impact of the underlying neural network on the recognition rate by comparing state-of-the-art CNN and transformer architectures. Our results show that a Masked Auto Encoder approach significantly outperforms other SSL approaches, including SimCLR, commonly considered one of the best-performing SSL methods in the HAR domain. The code and the pre-trained SSL models are publicly available for further research and development.
Comparing self-supervised learning techniques for wearable human activity recognition / S. Ek, R. Presotto, G. Civitarese, F. Portet, P. Lalanda, C. Bettini. - In: CCF TRANSACTIONS ON PERVASIVE COMPUTING AND INTERACTION. - ISSN 2524-521X. - (2025), pp. 1-18. [Epub ahead of print] [10.1007/s42486-024-00182-9]
Comparing self-supervised learning techniques for wearable human activity recognition
R. PresottoSecondo
;G. Civitarese;C. BettiniUltimo
2025
Abstract
Human Activity Recognition (HAR) based on the sensors of mobile/wearable devices aims to detect the physical activities performed by humans in their daily lives. Although supervised learning methods are the most effective in this task, their effectiveness is constrained to using a large amount of labeled data during training. While collecting raw unlabeled data can be relatively easy, annotating data is challenging due to costs, intrusiveness, and time constraints. To address these challenges, this paper explores alternative approaches for accurate HAR using a limited amount of labeled data. In particular, we have adapted recent Self-Supervised Learning (SSL) algorithms to the HAR domain and compared their effectiveness. We investigate three state-of-the-art SSL techniques of different families: contrastive, generative, and predictive. Additionally, we evaluate the impact of the underlying neural network on the recognition rate by comparing state-of-the-art CNN and transformer architectures. Our results show that a Masked Auto Encoder approach significantly outperforms other SSL approaches, including SimCLR, commonly considered one of the best-performing SSL methods in the HAR domain. The code and the pre-trained SSL models are publicly available for further research and development.| File | Dimensione | Formato | |
|---|---|---|---|
|
CCF___SSL__No_Edits_.pdf
embargo fino al 30/03/2026
Tipologia:
Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione
5.9 MB
Formato
Adobe PDF
|
5.9 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
|
s42486-024-00182-9.pdf
accesso riservato
Tipologia:
Publisher's version/PDF
Dimensione
5.87 MB
Formato
Adobe PDF
|
5.87 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.




