2S-SGCN: A two-stage stratified graph convolutional network model for facial landmark detection on 3D data

Burger, J.; Blandano, G.; Facchi, G.M.; Lanzarotti, R.

doi:10.1016/j.cviu.2024.104227

Facial Landmark Detection (FLD) algorithms play a crucial role in numerous computer vision applications, particularly in tasks such as face recognition, head pose estimation, and facial expression analysis. While FLD on images has long been the focus, the emergence of 3D data has led to a surge of interest in FLD on it due to its potential applications in various fields, including medical research. However, automating FLD in this context presents significant challenges, such as selecting suitable network architectures, refining outputs for precise landmark localization and optimizing computational efficiency. In response, this paper presents a novel approach, the 2-Stage Stratified Graph Convolutional Network (2S-SGCN), which addresses these challenges comprehensively. The first stage aims to detect landmark regions using heatmap regression, which leverages both local and long-range dependencies through a stratified approach. In the second stage, 3D landmarks are precisely determined using a new post-processing technique, namely MSE-over-mesh. 2S-SGCN ensures both efficiency and suitability for resource-constrained devices. Experimental results on 3D scans from the public Facescape and Headspace datasets, as well as on point clouds derived from FLAME meshes collected in the DAD-3DHeads dataset, demonstrate that the proposed method achieves state-of-the-art performance across various conditions. Source code is accessible at https://github.com/gfacchi-dev/CVIU-2S-SGCN

2S-SGCN: A two-stage stratified graph convolutional network model for facial landmark detection on 3D data / J. Burger, G. Blandano, G.M. Facchi, R. Lanzarotti. - In: COMPUTER VISION AND IMAGE UNDERSTANDING. - ISSN 1077-3142. - 250:(2025 Jan), pp. 104227.1-104227.11. [10.1016/j.cviu.2024.104227]

2S-SGCN: A two-stage stratified graph convolutional network model for facial landmark detection on 3D data

J. Burger^Primo;G. Blandano^Secondo;G.M. Facchi^Penultimo;R. Lanzarotti^Ultimo

2025

Abstract

Facial Landmark Detection (FLD) algorithms play a crucial role in numerous computer vision applications, particularly in tasks such as face recognition, head pose estimation, and facial expression analysis. While FLD on images has long been the focus, the emergence of 3D data has led to a surge of interest in FLD on it due to its potential applications in various fields, including medical research. However, automating FLD in this context presents significant challenges, such as selecting suitable network architectures, refining outputs for precise landmark localization and optimizing computational efficiency. In response, this paper presents a novel approach, the 2-Stage Stratified Graph Convolutional Network (2S-SGCN), which addresses these challenges comprehensively. The first stage aims to detect landmark regions using heatmap regression, which leverages both local and long-range dependencies through a stratified approach. In the second stage, 3D landmarks are precisely determined using a new post-processing technique, namely MSE-over-mesh. 2S-SGCN ensures both efficiency and suitability for resource-constrained devices. Experimental results on 3D scans from the public Facescape and Headspace datasets, as well as on point clouds derived from FLAME meshes collected in the DAD-3DHeads dataset, demonstrate that the proposed method achieves state-of-the-art performance across various conditions. Source code is accessible at https://github.com/gfacchi-dev/CVIU-2S-SGCN

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				3D facial landmark detection; Graph convolutional network; Heatmap regression; Stratified coarse-to-fine approach; Facescape and Headspace datasets; GCN lightweight model
			
	Settori scientifico-disciplinari dell'articolo (validi dal 09/05/2024)
	
				Settore INFO-01/A - Informatica
Settore IINF-05/A - Sistemi di elaborazione delle informazioni
			
	Data di pubblicazione
	
				gen-2025
			
	Data ahead of print o data di stampa
	
				12-nov-2024
			
	Rivista in ANCE
	
				COMPUTER VISION AND IMAGE UNDERSTANDING
			
	DOI
	
				https://dx.doi.org/10.1016/j.cviu.2024.104227
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S1077314224003084-main(1).pdf accesso aperto Tipologia: Publisher's version/PDF Licenza: Creative commons Dimensione 3.16 MB Formato Adobe PDF Visualizza/Apri	3.16 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/1118272

Citazioni

ND

5

3

6

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca