PAC-Bayesian Inequalities for Martingales

Seldin, Y.; Laviolette, F.; Cesa-Bianchi, N.; Shawe Taylor, J.; Auer, P.

doi:10.1109/TIT.2012.2211334

We present a set of high-probability inequalities that control the concentration of weighted averages of multiple (possibly uncountably many) simultaneously evolving and interdependent martingales. Our results extend the PAC-Bayesian (probably approximately correct) analysis in learning theory from the i.i.d. setting to martingales opening the way for its application to importance weighted sampling, reinforcement learning, and other interactive learning domains, as well as many other domains in probability theory and statistics, where martingales are encountered. We also present a comparison inequality that bounds the expectation of a convex function of a martingale difference sequence shifted to the [0, 1] interval by the expectation of the same function of independent Bernoulli random variables. This inequality is applied to derive a tighter analog of Hoeffding-Azuma's inequality.

PAC-Bayesian Inequalities for Martingales / Y. Seldin, F. Laviolette, N. Cesa-Bianchi, J. Shawe Taylor, P. Auer. - In: IEEE TRANSACTIONS ON INFORMATION THEORY. - ISSN 0018-9448. - 58:12(2012), pp. 6257492.7086-6257492.7093. [10.1109/TIT.2012.2211334]

PAC-Bayesian Inequalities for Martingales

Y. Seldin;F. Laviolette;N. Cesa-Bianchi;J. Shawe Taylor;P. Auer

2012

Abstract

We present a set of high-probability inequalities that control the concentration of weighted averages of multiple (possibly uncountably many) simultaneously evolving and interdependent martingales. Our results extend the PAC-Bayesian (probably approximately correct) analysis in learning theory from the i.i.d. setting to martingales opening the way for its application to importance weighted sampling, reinforcement learning, and other interactive learning domains, as well as many other domains in probability theory and statistics, where martingales are encountered. We also present a comparison inequality that bounds the expectation of a convex function of a martingale difference sequence shifted to the [0, 1] interval by the expectation of the same function of independent Bernoulli random variables. This inequality is applied to derive a tighter analog of Hoeffding-Azuma's inequality.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Bernstein's inequality; Hoeffding-Azuma's inequality; martingales; PAC-Bayesian bounds
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore INF/01 - Informatica
			
	Data di pubblicazione
	
				2012
			
	Rivista in ANCE
	
				IEEE TRANSACTIONS ON INFORMATION THEORY
			
	DOI
	
				https://dx.doi.org/10.1109/TIT.2012.2211334
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/223243

Citazioni

ND

79

66

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca