Improved second-order bounds for prediction with expert advice

Cesa-Bianchi, N.; Mansour, Y.; Stoltz, G.

doi:10.1007/11503415_15

This work studies external regret in sequential prediction games with both positive and negative payoffs. External regret measures the difference between the payoff obtained by the forecasting strategy and the payoff of the best action. In this setting, we derive new and sharper regret bounds for the well-known exponentially weighted average forecaster and for a new forecaster with a different multiplicative update rule. Our analysis has two main advantages: first, no preliminary knowledge about the payoff sequence is needed, not even its range; second, our bounds are expressed in terms of sums of squared payoffs, replacing larger first-order quantities appearing in previous bounds. In addition, our most refined bounds have the natural and desirable property of being stable under rescalings and general translations of the payoff sequence.

Improved second-order bounds for prediction with expert advice / N. Cesa-Bianchi, Y. Mansour, G. Stoltz - In: Learning Theory: 18th Annual Conference on Learning Theory, COLT 2005 : Bertinoro, Italy, June 27-30, 2005 : Proceedings / [a cura di] P. Auer, R. Meir. - Berlin : Springer, 2005. - ISBN 3540265562. - pp. 217-232 (( Intervento presentato al 18. convegno Annual Conference on Learning Theory - COLT 2005 tenutosi a Bertinoro nel 2005.

Improved second-order bounds for prediction with expert advice

N. Cesa-Bianchi^Primo;Y. Mansour;G. Stoltz

2005

Abstract

This work studies external regret in sequential prediction games with both positive and negative payoffs. External regret measures the difference between the payoff obtained by the forecasting strategy and the payoff of the best action. In this setting, we derive new and sharper regret bounds for the well-known exponentially weighted average forecaster and for a new forecaster with a different multiplicative update rule. Our analysis has two main advantages: first, no preliminary knowledge about the payoff sequence is needed, not even its range; second, our bounds are expressed in terms of sums of squared payoffs, replacing larger first-order quantities appearing in previous bounds. In addition, our most refined bounds have the natural and desirable property of being stable under rescalings and general translations of the payoff sequence.

Scheda breve

Scheda completa

Scheda completa (DC)

	Settori scientifico-disciplinari del contributo (sola visualizzazione)
	
				Settore INF/01 - Informatica
			
	Data di pubblicazione
	
				2005
			
	DOI
	
				https://dx.doi.org/10.1007/11503415_15
			
	Tipologia
	
				Book Part (author)
			
	Appare nelle tipologie:
	
				03 - Contributo in volume

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/9906

Citazioni

ND

24

16

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca