We study the problem of online multiclass classification in a setting where the learner’s feedback is determined by an arbitrary directed graph. While including bandit feedback as a special case, feedback graphs allow a much richer set of applications, including filtering and label efficient classification. We introduce GAPPLETRON, the first online multiclass algorithm that works with arbitrary feed- back graphs. For this new algorithm, we prove surrogate regret bounds that hold, both in expectation and with high probability, for a large class of surrogate losses. Our bounds are of order B√ρKT , where B is the diameter of the prediction space, K is the number of classes, T is the time horizon, and ρ is the domination number (a graph-theoretic parameter affecting the amount of exploration). In the full in- formation case, we show that GAPPLETRON achieves a constant surrogate regret of order B2K. We also prove a general lower bound of order max {B2K, √T } showing that our upper bounds are not significantly improvable. Experiments on synthetic data show that for various feedback graphs our algorithm is competitive against known baselines.

Beyond Bandit Feedback in Online Multiclass Classification / D. van der Hoeven, F. Fusco, N. Cesa Bianchi (ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS). - In: Advances in Neural Information Processing Systems / [a cura di] M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, J. Wortman Vaughan. - [s.l] : Curran Associates, 2021. - ISBN 9781713845393. - pp. 13280-13291 (( Intervento presentato al 34. convegno Neural Information Processing Systems tenutosi a virtual nel 2021.

Beyond Bandit Feedback in Online Multiclass Classification

D. van der Hoeven
Primo
;
N. Cesa Bianchi
Ultimo
2021

Abstract

We study the problem of online multiclass classification in a setting where the learner’s feedback is determined by an arbitrary directed graph. While including bandit feedback as a special case, feedback graphs allow a much richer set of applications, including filtering and label efficient classification. We introduce GAPPLETRON, the first online multiclass algorithm that works with arbitrary feed- back graphs. For this new algorithm, we prove surrogate regret bounds that hold, both in expectation and with high probability, for a large class of surrogate losses. Our bounds are of order B√ρKT , where B is the diameter of the prediction space, K is the number of classes, T is the time horizon, and ρ is the domination number (a graph-theoretic parameter affecting the amount of exploration). In the full in- formation case, we show that GAPPLETRON achieves a constant surrogate regret of order B2K. We also prove a general lower bound of order max {B2K, √T } showing that our upper bounds are not significantly improvable. Experiments on synthetic data show that for various feedback graphs our algorithm is competitive against known baselines.
No
English
Settore INF/01 - Informatica
Intervento a convegno
Esperti anonimi
Ricerca di base
Pubblicazione scientifica
   European Learning and Intelligent Systems Excellence (ELISE)
   ELISE
   EUROPEAN COMMISSION
   H2020
   951847
Advances in Neural Information Processing Systems
M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, J. Wortman Vaughan
Curran Associates
2021
13280
13291
12
9781713845393
34
Volume a diffusione internazionale
Gold
0
Neural Information Processing Systems
virtual
2021
34
Convegno internazionale
Intervento inviato
https://proceedings.neurips.cc/paper/2021/file/6e79ed05baec2754e25b4eac73a332d2-Paper.pdf
DSRC - Data science research center
manual
Aderisco
D. van der Hoeven, F. Fusco, N. Cesa Bianchi
Book Part (author)
open
273
Beyond Bandit Feedback in Online Multiclass Classification / D. van der Hoeven, F. Fusco, N. Cesa Bianchi (ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS). - In: Advances in Neural Information Processing Systems / [a cura di] M. Ranzato, A. Beygelzimer, Y. Dauphin, P.S. Liang, J. Wortman Vaughan. - [s.l] : Curran Associates, 2021. - ISBN 9781713845393. - pp. 13280-13291 (( Intervento presentato al 34. convegno Neural Information Processing Systems tenutosi a virtual nel 2021.
info:eu-repo/semantics/bookPart
3
Prodotti della ricerca::03 - Contributo in volume
File in questo prodotto:
File Dimensione Formato  
NeurIPS-2021-beyond-bandit-feedback-in-online-multiclass-classification-Paper.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 284.73 kB
Formato Adobe PDF
284.73 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/906093
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 1
social impact