Playing monotone games to understand learning behaviors

Apolloni, B.; Bassis, S.; Gaito, S.; Malchiodi, D.; Zoppis, I.

doi:10.1016/j.tcs.2010.02.011

We deal with a special class of games against nature which correspond to subsymbolic learning problems where we know a local descent direction in the error landscape but not the amount gained at each step of the learning procedure. Namely, Alice and Bob play a game where the probability of victory grows monotonically by unknown amounts with the resources each employs. For a fixed effort on Alice’s part Bob increases his resources on the basis of the results of the individual contests (victory, tie or defeat). Quite unlike the usual ones in game theory, his aim is to stop as soon as the defeat probability goes under a given threshold with high confidence. We adopt such a game policy as an archetypal remedy to the general overtraining threat of learning algorithms. Namely, we deal with the original game in a computational learning framework analogous to the Probably Approximately Correct formulation. Therein, a wise use of a special inferential mechanism (known as twisting argument) highlights relevant statistics for managing different trade-offs between observability and controllability of the defeat probability. With similar statistics we discuss an analogous trade-off at the basis of the stopping criterion of subsymbolic learning procedures. As a conclusion, we propose a principled stopping rule based solely on the behavior of the training session, hence without distracting examples into a test set.

Playing monotone games to understand learning behaviors / B. Apolloni, S. Bassis, S. Gaito, D. Malchiodi, I. Zoppis. - In: THEORETICAL COMPUTER SCIENCE. - ISSN 0304-3975. - 411:25(2010), pp. 2384-2405.

Playing monotone games to understand learning behaviors

B. Apolloni^Primo;S. Bassis^Secondo;S. Gaito;D. Malchiodi^Penultimo;I. Zoppis

2010

Abstract

We deal with a special class of games against nature which correspond to subsymbolic learning problems where we know a local descent direction in the error landscape but not the amount gained at each step of the learning procedure. Namely, Alice and Bob play a game where the probability of victory grows monotonically by unknown amounts with the resources each employs. For a fixed effort on Alice’s part Bob increases his resources on the basis of the results of the individual contests (victory, tie or defeat). Quite unlike the usual ones in game theory, his aim is to stop as soon as the defeat probability goes under a given threshold with high confidence. We adopt such a game policy as an archetypal remedy to the general overtraining threat of learning algorithms. Namely, we deal with the original game in a computational learning framework analogous to the Probably Approximately Correct formulation. Therein, a wise use of a special inferential mechanism (known as twisting argument) highlights relevant statistics for managing different trade-offs between observability and controllability of the defeat probability. With similar statistics we discuss an analogous trade-off at the basis of the stopping criterion of subsymbolic learning procedures. As a conclusion, we propose a principled stopping rule based solely on the behavior of the training session, hence without distracting examples into a test set.

Scheda breve

Scheda completa

Scheda completa (DC)

	Parole chiave
	
				Overtraining control ; training stopping rule ; monotone games ; algorithmic inference ; computational learning ; subsymbolic learning
			
	Settori scientifico-disciplinari dell'articolo (sola visualizzazione)
	
				Settore INF/01 - Informatica
			
	Data di pubblicazione
	
				2010
			
	Rivista in ANCE
	
				THEORETICAL COMPUTER SCIENCE
			
	DOI
	
				https://dx.doi.org/10.1016/j.tcs.2010.02.011
			
	Tipologia
	
				Article (author)
			
	Appare nelle tipologie:
	
				01 - Articolo su periodico

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/160370

Citazioni

ND

0

0

ND

IRIS Institutional Research Information System - AIR Archivio Istituzionale della Ricerca