IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca

We investigate contextual online learning with nonparametric (Lipschitz) comparison classes under different assumptions on losses and feedback information. For full information feedback and Lipschitz losses, we design the first explicit algorithm achieving the minimax regret rate (up to log factors). In a partial feedback model motivated by second-price auctions, we obtain algorithms for Lipschitz and semi-Lipschitz losses with regret bounds improving on the known bounds for standard bandit feedback. Our analysis combines novel results for contextual second-price auctions with a novel algorithmic approach based on chaining. When the context space is Euclidean, our chaining approach is efficient and delivers an even better regret bound.

Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning / N. Cesa-Bianchi, P. Gaillard, C. Gentile, S. Gerchinovitz (Proceedings of Machine Learning Research). - In: Proceedings of the 2017 Conference on Learning Theory / [a cura di] S. Kale, O. Shamir. - [s.l] : PMLR, 2017. - pp. 465-481 (( convegno Conference on Learning Theory tenutosi a Amsterdam nel 2017.

Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning

N. Cesa-Bianchi^Primo;P. Gaillard;C. Gentile;S. Gerchinovitz

2017

Abstract

We investigate contextual online learning with nonparametric (Lipschitz) comparison classes under different assumptions on losses and feedback information. For full information feedback and Lipschitz losses, we design the first explicit algorithm achieving the minimax regret rate (up to log factors). In a partial feedback model motivated by second-price auctions, we obtain algorithms for Lipschitz and semi-Lipschitz losses with regret bounds improving on the known bounds for standard bandit feedback. Our analysis combines novel results for contextual second-price auctions with a novel algorithmic approach based on chaining. When the context space is Euclidean, our chaining approach is efficient and delivers an even better regret bound.

Scheda breve

Scheda completa

Scheda completa (DC)

	Settori scientifico-disciplinari del contributo (sola visualizzazione)
	
				Settore INF/01 - Informatica
			
	Data di pubblicazione
	
				2017
			
	URL
	
				http://proceedings.mlr.press/v65/cesa-bianchi17a.html
			
	Tipologia
	
				Book Part (author)
			
	Appare nelle tipologie:
	
				03 - Contributo in volume

File in questo prodotto:

File	Dimensione	Formato
cesa-bianchi17a.pdf accesso riservato Tipologia: Publisher's version/PDF Dimensione 382.91 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	382.91 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/519327

Citazioni

ND

27

ND

ND

social impact