Bayesian Integration of Face and Low-level Cues for Foveated Video Coding

Boccignone, G.; Marcelli, A.; Napoletano, P.; DI FIORE, G.; Iacovoni, G.; Morsa, S.

doi:10.1109/TCSVT.2008.2005798

We present a Bayesian model that allows to automatically generate fixations/foveations and that can be suitably exploited for compression purposes. The twofold aim of this work is to investigate how the exploitation of high-level perceptual cues provided by human faces occurring in the video can enhance the compression process without reducing the perceived quality of the video and to validate such assumption with an extensive and principled experimental protocol. To such end, the model integrates top-down and bottom-up cues to choose the fixation point on a video frame: at the highest level, a fixation is driven by prior information and by relevant objects, namely human faces, within the scene; at the same time, local saliency together with novel and abrupt visual events contribute by triggering lower level control. The performance of the resulting video compression system has been evaluated with respect to both the perceived quality of foveated video clips and the compression gain with an extensive evaluation campaign, which has eventually involved 200 subjects

Bayesian Integration of Face and Low-level Cues for Foveated Video Coding / G. BOCCIGNONE, A. MARCELLI, P. NAPOLETANO, G. DI FIORE, G. IACOVONI, S. MORSA. - In: IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY. - ISSN 1051-8215. - 18:12(2008 Dec), pp. 4630762.1727-4630762.1740.

Bayesian Integration of Face and Low-level Cues for Foveated Video Coding

G. BOCCIGNONE^Primo;A. MARCELLI;P. NAPOLETANO;G. DI FIORE;G. IACOVONI;S. MORSA

2008

Abstract

We present a Bayesian model that allows to automatically generate fixations/foveations and that can be suitably exploited for compression purposes. The twofold aim of this work is to investigate how the exploitation of high-level perceptual cues provided by human faces occurring in the video can enhance the compression process without reducing the perceived quality of the video and to validate such assumption with an extensive and principled experimental protocol. To such end, the model integrates top-down and bottom-up cues to choose the fixation point on a video frame: at the highest level, a fixation is driven by prior information and by relevant objects, namely human faces, within the scene; at the same time, local saliency together with novel and abrupt visual events contribute by triggering lower level control. The performance of the resulting video compression system has been evaluated with respect to both the perceived quality of foveated video clips and the compression gain with an extensive evaluation campaign, which has eventually involved 200 subjects

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
			Face detection; Foveated video coding; Foveation filtering; Image coding; Video quality measurement
		
	Settori scientifico-disciplinari dell'articolo
	
			Settore ING-INF/05 - Sistemi di Elaborazione delle Informazioni
		
	Data di pubblicazione
	
			dic-2008
		
	Rivista in ANCE
	
			IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY
		
	DOI
	
			https://dx.doi.org/10.1109/TCSVT.2008.2005798
		
	Tipologia
	
			Article (author)
		
	Appare nelle tipologie:
	
			01 - Articolo su periodico

File in questo prodotto:

File	Dimensione	Formato
Tcsvt08Pre.pdf accesso aperto Tipologia: Pre-print (manoscritto inviato all'editore) Dimensione 1.15 MB Formato Adobe PDF Visualizza/Apri	1.15 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/52577

Citazioni

ND

34

34

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca