Shifting bounds for on-line classification algorithms ensure good performance on any sequence of examples that is well predicted by a sequence of changing classifiers. When proving shifting bounds for kernel-based classifiers, one also faces the problem of storing a number of support vectors that can grow unboundedly, unless an eviction policy is used to keep this number under control. In this paper, we show that shifting and on-line learning on a budget can be combined surprisingly well. First, we introduce and analyze a shifting Perceptron algorithm achieving the best known shifting bounds while using an unlimited budget. Second, we show that by applying to the Perceptron algorithm the simplest possible eviction policy, which discards a random support vector each time a new one comes in, we achieve a shifting bound close to the one we obtained with no budget restrictions. More importantly, we show that our randomized algorithm strikes the optimal trade-off between budget B and norm U of the largest classifier in the comparison sequence. Experiments are presented comparing several linear-threshold algorithms on chronologically-ordered textual datasets. These experiments support our theoretical findings in that they show to what extent randomized budget algorithms are more robust than deterministic ones when learning shifting target data streams.

Tracking the Best Hyperplane with a Simple Budget Perceptron / N. Cesa-Bianchi, G. Cavallanti, C. Gentile. - In: MACHINE LEARNING. - ISSN 0885-6125. - 69:2-3(2007 Feb 02), pp. 143-167. [10.1007/s10994-007-5003-0]

Tracking the Best Hyperplane with a Simple Budget Perceptron

N. Cesa-Bianchi
Primo
;
G. Cavallanti
Secondo
;
2007

Abstract

Shifting bounds for on-line classification algorithms ensure good performance on any sequence of examples that is well predicted by a sequence of changing classifiers. When proving shifting bounds for kernel-based classifiers, one also faces the problem of storing a number of support vectors that can grow unboundedly, unless an eviction policy is used to keep this number under control. In this paper, we show that shifting and on-line learning on a budget can be combined surprisingly well. First, we introduce and analyze a shifting Perceptron algorithm achieving the best known shifting bounds while using an unlimited budget. Second, we show that by applying to the Perceptron algorithm the simplest possible eviction policy, which discards a random support vector each time a new one comes in, we achieve a shifting bound close to the one we obtained with no budget restrictions. More importantly, we show that our randomized algorithm strikes the optimal trade-off between budget B and norm U of the largest classifier in the comparison sequence. Experiments are presented comparing several linear-threshold algorithms on chronologically-ordered textual datasets. These experiments support our theoretical findings in that they show to what extent randomized budget algorithms are more robust than deterministic ones when learning shifting target data streams.
Budget algorithms; Mistake bounds; Pattern classification; Perceptron algorithm
Settore INF/01 - Informatica
2-feb-2007
Article (author)
File in questo prodotto:
File Dimensione Formato  
J29.pdf

accesso aperto

Tipologia: Pre-print (manoscritto inviato all'editore)
Dimensione 518.61 kB
Formato Adobe PDF
518.61 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/26350
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 119
  • ???jsp.display-item.citation.isi??? 77
social impact